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PRIMARY NUCLEOTIDE SEQUENCE OF THE SHRIMP WHITE SPOT 
BACILLIFORM VIRUS (WSBV), DISCOVERY SYSTEMS CONTAINING THIS 
SEQUENCE AND DETECTION KITS AND ANTIVERAU TARGETS FOR 
DETECTION AND CONTROLLING SHRIMP VIRUS OUTBREAK AND SPREAD 

RELATED APPLICATION 

The present application claims priority to Chinese patent application No. 99124717.5, 
filed November 24, 1999. 

FIELD OF THE INVENTION 

The present invention is in the field of genomic discovery systems. The present 
invention specifically provides the complete shrimp white spot bacilliform virus (WSBV) 
genome and isolated fragments thereof in a form that is commercially useful, including 
detection kits, antiviral agents, reagents such as nucleic acid arrays, and computer-based 
systems. 

BACKGROUND OF THE INVENTION 

The shrimp and prawn (hereafter collectively referred to as shrimp) industry is a rapid 
growth worldwide industry worth billions of dollars. Worldwide, the shrimp industry relies 
on both the harvesting of wild shrimp and aquaculture, which is the controlled farming of 
fish, shellfish, and plants. Aquaculture, particularly aquaculture of shrimp, is growing rapidly 
due to increasing consumer demand for shrimp and other seafood. Aquaculture has been 
expanding at an annual rate far surpassing the growth of livestock meat, capture fisheries, and 
agricultural production. The aquaculture industry delivers high-quality protein for human and 
animal consumption and provides a substantial source of income and employment, 
particularly for developing countries. Aquaculture accounts for nearly 20 percent of the 
world's harvest of fish, shellfish, and seaweeds. The total worldwide value of giant tiger 
prawn production is the greatest of any aquaculture species. Aquaculture of giant tiger prawn 
significantly contributes to many Asian and Latin American economies, where the majority 
of giant tiger prawn production occurs. Shrimp accounted for approximately a quarter of the 
overall value of Asian fish exports in 1996. In the United States, harvesting and processing 
shrimp, including both aquaculture and harvesting wild shrimp, is a $3 billion dollar a year 
industry that employs over 1 1,000 people. Furthermore, shrimp aquaculture in the U.S. has 
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the potential to become a high-growth business. The risk of viral diseases to cultured shrimp 
is the primary obstacle to the growth of the shrimp aquaculture industry. 

Shrimp viral disease is a major worldwide concem of the shrimp industry. Both 
aquaculture and wild shrimp are vulnerable to viral infection, which can lead to devastating 
economic consequences. Furthermore, shrimp viruses may affect other crustaceans such as 
crabs and crayfish. Drastic declines in the populations of wild shrimp or other crustaceans 
due to viral disease can also dramatically affect other species in the food chain that depend 
upon shrimp for food and can lead to severe ecological consequences. 

Major pathogenic shrimp viruses include White Spot Bacilliform Virus (WSBV), 
Infectious Hypodermal and Hematopoietic Virus (IHHNV), Tara Syndrome Virus (TSV), and 
Yellow Head Virus (YHV). IHHNV and TSV are endemic throughout South and Central 
America, while WSBV and YHV are endemic throughout Asia. All U.S. shrimp species are 
susceptible to infection and disease from one or more of these four viruses. Susceptibility of 
U.S. species of shrimp to these viruses may lead to restrictions on the importation of foreign 
shrimp into the U.S. 

Past incidents of viral outbreaks illustrate the devastating affects that a viral outbreak 
can have on the shrimp industry. An outbreak of IHHNV in 1987 in the Gulf of California 
shrimp fishery reduced shrimp to levels that could not support commercial harvests until 
1994. Outbreaks in 1995 and 1996 on U.S. shrimp farms caused a 50 to 95 percent loss of 
production at affected farms. Shrimp exports from China to the U.S, dropped 75% between 
1990 and 1995 due to infection by WSBV. 

WSBV is regarded as one of the most highly pathogenic viruses of penaeid shrimp. 
No uniform name exists for WSBV. It is also known as White Spot Syndrome Virus 
(WSSV), Prawn White Spot Bacilliform Virus (PWSBV), White Spot Baculiform Virus 
(WSBV), Baculoviral Hypodermal and Hematopoietic Necrosis Virus (HHNBV), Rod- 
shaped Nuclear Virus of Penaeus Japonicus (RV-PJ), Systemic Ectodermal and Mesodermal 
Bacilliform Virus (SEMBV). Other acronyms include WSV, WSDV, and LNBV. The virus is 
a non-occluded, circular, double-stranded DNA bacilliform virus with a genome of 
approximately 300kb. WSBV virions are enveloped nucleocapsids with bacilliform 
morphology and a tail-like extension at one end. 

White Spot Syndrome, caused by WSBV, is also known by such names as Red 
Disease, China Virus Disease, and Shrimp Explosive Epidemic Disease. Infected shrimp 
display rapid reduction in food consumption and lethargy. Gross observations include a loose 
cuticle and a red color to the entire body and appendages along with small subcutaneous 
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white spots. Histological examination reveals prominent intranuclear inclusion bodies in the 
cuticular epithelium, subcutis, and connective tissues. Cumulative mortality rates reach 100% 
within 3 to 10 days of the onset of clinical signs. No significant resistance to WSBV has been 
reported. All native U.S. species of shrimp are susceptible to WSBV infection under 
experimental conditions. WSBV is widely spread throughout most of the shrimp growing 
regions of Asia and the Indo-Pacific, including China, Japan, Korea, Thailand, Indonesia, 
Taiwan, Vietnam, Malaysia, and India. Lethal outbreaks of WSBV virus have recently been 
recorded in Texas and South Carolina. Furthermore, the virus has been shown to infect other 
crustaceans including amphipods, ostracods^ swimming crabs, crayfish, copepods. and shore 
flies. The possibility exists that these organisms could act as a reservoir through which 
further shrimp infection, or infection of other species, can occur. 

In view of the serious economic and ecological risks posed to the worldwide shrimp 
industry and shrimp populations by viruses, particularly WSBV, a strong need exists for 
antiviral agents and detection systems. Detection systems should be highly specific, rapid, 
and sensitive. To facilitate development of antiviral agents and detection systems, knowledge 
of the complete genomic sequence and protein encoding sequences of WSBV is needed. Prior 
to the present invention, very few reports on WSBV genomic sequences existed, and only a 
small fraction of the entire WSBV genome had been sequenced. To date, only six WSBV 
sequenced have been patented, published or stored in public genome databases, such as 
Genbank. All sequences to date are short sequences ranging in length from 420 bp to 2424 
bp. J.S. Kim and others from Korea have sequenced 2424bp (wsu 92007,1997) and 420bp 
(wsu 89843, 1997); K.Mitsuo and others from Japan have sequenced two fragments, 1447bp 
(PN JP 1997201 196-A/2) and 1461bp (PN JP 1997201 196-A/l) in length; Chufang Luo et.al. 
from Taiwan sequenced I461bp (PMU50923, 1996); L.M. Nunan et. al. from the United 
States has reported 868bp sequence in J, Virological Methods (1997(63): pi 93-201). These 
known sequences are no more than lOkb in length all together. In addition, these sequences 
are randomly sequenced with no systematic analysis; therefore determining sequence 
function is difficult. Since the complete genome of WSBV is more than 300kb in length, the 
analysis of the complete genomic DNA sequence and it's complete structure, the 
determination of the expressed sequences, and prediction of the functions of encoded proteins 
are all new scientific achievements. These achievements are the basis for the present 
invention. The present invention is directed to providing the complete primary nucleotide 
sequence of WSBV and isolated fragments thereof, protein encoding sequences of WSBV, 
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and antiviral agents and detection systems based on the nucleotide and protein encoding 
sequences provided by the present invention. 

DNA Viruses 

Generally, transcription of a DNA virus genome occurs in the nucleus of the host cell, 
utilizing host cell polymerases and other host enzymes for viral mRNA synthesis and viral 
replication. Viral gene transcription is modulated by the interaction of specific DNA-binding 
proteins with promoter and enhancer elements in the viral genome. Commonly, the viral 
promoter and enhancer elements are similar in sequence to those of the host cell in order to 
allow the host cell's transcriptional activation factors and DNA-dependent RNA polymerase 
to bind the viral control elements. Cells from different tissues or species express different 
DMA-binding proteins, and this is a major factor in determining which species, and which 
cells and tissues of that species, that the virus can infect. 

Viruses, in general, depend on the host cell ribosomes, transfer RNA (tRNA), and 
mechanisms of posttranslational modification to produce their proteins. Generally, viral 
mRNA encoding non-structural viral proteins, such as DNA-binding proteins and enzymes, 
are transcribed first. These are followed by late viral gene products encoding structural 
proteins. 

Viruses utilize various methods to promote preferential translation of their viral 
mRNA over host cell mRNA. In some instances, concentration of viral mRNA in the host 
cell is so large that it occupies most of the celPs ribosomes, thereby preventing translation of 
host cell mRNA. Viruses may inhibit synthesis and/or induce degradation of the cell's nucleic 
acids. Many viruses increase the permeability of the host cell membrane, thereby reducing 
the ribosomal affinity for most cellular mRNA. 

Viral DNA replication begins at a unique sequence in the genome called the origin of 
replication, or ori. The ori is recognized by viral or host nuclear factors and DNA-dependent 
DNA polymerase. Viral DNA synthesis is semi-conservative and a primer is required by the 
DNA polymerase to initiate synthesis of the new DNA molecule. 

Viral Screening Tests and Antiviral Agents 

Viral screening tests and detection kits, such as nucleic acid arrays, can be developed 
based on either nucleic acids or polypeptides provided by the present invention. A nucleic 
acid probe to a virus specific nucleotide sequence, or an antibody to a virus specific protein. 
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is introduced into contact with a sample, such as a sample of shrimp cells, whereby the 
presence of the virus is detected using an assay system. 

Antiviral agents, either nucleic acid or protein-based, directly interfere with viral 
function or preferably, interfere with viral replication to stop or prevent spread of the virus in 
a population, such as in a population of shrimp. Knowledge of the nucleic acid and protein 
sequences of the virus allows antiviral agents to be designed to attack a number of viral 
targets necessary for viral replication or function, such as viral encoded enzymes or structural 
proteins. Attachment of the virus to the host cell is the first step in viral replication and is 
mediated by the interaction of a viral attachment protein and a host cell surface receptor. This 
interaction can be blocked by neutralizing antibodies, which bind to and coat the virion, or 
receptor antagonists which are peptide or carbohydrate analogues of the viral attachment 
protein and competitively block the interaction of the virus with the cell. Agents can be 
designed that bind to the viral attachment protein and prevent penetration of the virus into the 
cytoplasm or nucleus of the host cell and/or uncoating of the virus. These agents thereby 
prevent the virus from delivering its genome into the host cell. Viral mRNA expression and 
utilization can be targeted with anti-viral agents. Antisense oligonucleotides can be designed 
to bind to newly transcribed viral RN A and thereby prevent the viral RNA from being 
processed to mRNA in the nucleus, delivered to the cytoplasm, and bound to the ribosome. 
Many antiviral drugs are nucleoside analogues, which inhibit viral polymerases. Viral 
polymerases are often less specific for substrate than are host polymerases, therefore the viral 
polymerase will often bind a nucleotide analogue with a modified base and/or sugar several 
hundredfold better than the host enzyme. Antiviral drugs can therefore be preferentially 
incorporated into the viral genome. DNA viruses, such as WSBV, are particularly susceptible 
to these types of drugs due to the extent and rapid rate of nucleotide incorporation during 
viral replication. Inhibition of posttranslational modification of viral proteins, such as 
phosphorylation, may also inhibit viral replication. 



SUMMARY OF THE INVENTION 

The present invention is based on the sequencing and assembly of the WSBV 
genome. The present invention provides the primary nucleotide sequence of the WSBV 
genome (SEQ ID NO: 1) and predicted transcript sequences (SEQ ID NOS: 2, 4, 6... 280, 
282, 284, 286-293: See the Sequence Listing and the Figure Sheets for both the genomic and 
transcript sequences) and polymorphic sites on these transcripts summarized in Table I 
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hereinafter, and protein encoded sequence produced from each of the genes found in the 
WSBV genome. This information is provided in the form of sequences and annotation 
information and can be used to generate computer based discovery systems, nucleic acid 
detection reagents and kits such as nucleic acid arrays, protein based detection kits, and 
antiviral targets. 

The present invention provides these nucleotide sequences of the WSBV genome, and 
representative fragments thereof, in a form that can be used, analyzed, and commercialized. 
For example, the present invention provides the nucleic acid sequences as contiguous strings 
of primary sequences in a form readable by computers, such as recorded on computer 
readable media, e.g., magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as CD-ROM; electrical storage media such as 
RAM and ROM; and hybrids of these categories such as magnetic/optical storage media. 
The present invention specifically provides a Sequence Listing in computer readable form 
stored on such media. Such compositions are useful in the discovery of drug and antiviral 
targets. 

The present invention further provides systems, particularly computer-based systems 
that contain the primary sequence information of the present invention stored in data storage 
means. Such systems are designed to identify commercially important fragments of the 
WSBV genome. 

Another embodiment of the present invention is directed to isolated fragments, and 
collections of fragments, of the WSBV genome. The fragments of the WSBV genome 
include, but are not limited to, fragments that encode peptides, hereinafter open reading 
frames (ORFs) and fragments that modulate the expression of an operably linked ORF, 
hereinafter expression modulating fragments (EMFs). The ORFs are provided in the 
Sequence Listing and in Figure 3. 

The present invention further includes kits, such as nucleic acid arrays, detection 
reagents and microfluidic devices, that comprise one or more fragments of the WSBV 
genome of the present invention, particularly ORFs. The kits, such as arrays, can be used to 
track the expression of many genes, even all genes, or rationally selected subsets thereof, 
contained in the WSBV genome. 

The identification of the entire coding set of sequences from the genome of WSBV 
will be of great value to all laboratories working with this organism and for a variety of 
commercial and ecological purposes. Many fragments of the WSBV genome will be 
immediately identified by similarity searches against protein and nucleic acid databases and 
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by identifying structural motifs present in protein domains and will be of immediate value to 
WSBV researchers and for commercial value for controlling WSBV infection in shrimp 
populations. A specific example concerns viral envelope proteins, many of which interact 
with host cells. Proteins of this family can readily be configured into screens and assays for 
detecting chemical modulators of the protein activity. The biological significance of this and 
other families of proteins for controlling viral replication is well known. Many of the known 
antiviral agents modulate the activity of these types of proteins. The WSBV genome will 
allow one to identify all potential antiviral targets. 

The present invention is further directed to isolated WSBV proteins encoded by the 
ORFs of the present invention, A variety of methodologies known in the art can be utilized to 
obtain any one of the proteins of the present invention. The amino acid sequence can be 
synthesized using commercially available peptide synthesizers. In an alternative method, the 
viral protein can be purified from cells infected with the virus. 

The invention further provides antibodies that selectively bind one of the WSBV 
proteins encoded by the present invention. Antibodies have use in viral detection and control 
and can be generated using the protein encoding sequences provided by the present invention. 
Such antibodies include both monoclonal and polyclonal antibodies, and fragments thereof. 
The invention further provides hybridomas capable of producing the above-described 
monoclonal antibodies. 

The present invention provides methods of identifying WSBV in a test sample, such 
as a sample of shrimp. Such methods comprise incubating cells extracted from the test 
sample with one or more of the antibodies or probes based on the nucleic acid sequences 
provided by the present invention under conditions that allow a skilled artisan to determine if 
the test sample contains the ORF or product produced therefrom. 

Using the isolated proteins of the present invention, the present invention further 
provides methods of obtaining and identifying agents capable of binding to a protein encoded 
by one of the ORFs of the present invention. Specifically, such agents include antibodies, 
peptides, carbohydrates, pharmaceutical agents and the like. Such methods comprise the steps 
of contacting an agent with an isolated protein encoded by one of the ORFs of the present 
invention and determining whether the agent binds to said protein. 

DESCRIPTION OF THE FIGURE SHEETS 

Figure 1 provides a block diagram of a computer system 102 that can be used to 
implement the computer-based systems of the present invention. 
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Figure 2 (Sheets 1-40) provides the primary genomic sequence of WSBV. 
Figure 3 (Sheets 1-160) provides: 

1 ) the predicted transcript sequence of the WSBV gene and starting ATG 
site (SEQ ID NOS: 2, 4, 6, 8 . . . 280, 282, 284, 286-293); 

2) the predicted protein sequence of the WSBV gene (SEQ ID NOS: 3, 5, 7, 
9.., 28 1,283, 285); 

3) results of a BLAST query run using default parameters that shoves 
proteins producing significant alignments with the predicted WSBV 
protein sequence of the present invention. 

4) comments 

5) TaqMan primer/probe sets. Oligonucleotide sequences useful as primers 
and/or probes for amplifying and/or screening for the WSBV genes provided 
by the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

General Description 

The present invention is based on the sequencing and assembly of the WSBV 
genome. In this process, the primary nucleotide sequence of 5795 nucleic acid fragments was 
determined. These fragments were assembled into a single contiguous sequence of 305, 1 07 
bp. After assembly, the sequences were analyzed with various computer packages and 
compared with all external data sources. The result of this analysis was the identification of 
150 predicted genes/transcripts contained in the WSBV genome. The present invention 
provides the genomic nucleic acid sequence of WSBV (SEQ ID NO: 1), see Figure 2, Sheets 
1-40, as well as the predicted gene structure of all 150 identified genes (SEQ ID NOS: 2, 4, 
6. ..280, 282, 284, 286-293 ) and polymorphic sites on these transcripts summarized in Table 
1, and predicted amino acid sequences of all of the encoded proteins (SEQ ID NOS: 3, 5, 
7... 281, 283, 285), see Figure 3, sheets 1-160. 
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Table 1. - Summary of SBV Polymorphic Sites 
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The nucleotide sequences of the present invention, or representative fragments 
thereof, are provided in a form that can be readily used, analyzed, and interpreted by a skilled 
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artisan, in one embodiment, the sequences are provided as contiguous strings of primary 
sequence information corresponding to the nucleotide sequences provided in the figures. 

As used herein, a "representative fragment of the nucleotide sequence provided 
herein" refers to any portion of these sequences that are not presently represented within a 
publicly available database. Preferred representative fragments of the present invention are 
WSBV open reading frames and expression modulating fragments (ORPs and EMFs 
respectively, see figure 3 and below). 

The nucleotide sequence information provided herein was obtained by sequencing the 
WSBV genome using a shotgun sequencing method known in the art. WSBV genomic DNA 
was initially obtained for sequencing by extraction and purification of viral DNA from 
infected shrimp tissues using the method of Yang et al. (J. Virological Methods^ 67:1-4 
(1997)), which is hereby incorporated by reference. The nucleotide sequences provided 
herein are highly accurate, although not necessarily a 100% perfect, representation of the 
nucleotide sequence of the WSBV genome. 

Using the information provided herein together with routine cloning and sequencing 
methods, one of ordinary skill in the art is able to identify, clone and sequence all 
"representative fragments" of interest including open reading frames (ORFs) encoding a large 
variety of WSBV proteins. In very rare instances, this may reveal a nucleotide sequence 
error present in the nucleotide sequence disclosed herein. Thus, once the present invention is 
made available (i.e., the information in the Sequence Listing and figures in a useable form), 
resolving a rare sequencing error would be well within the skill of the art. Nucleotide 
sequence editing software is publicly available. 

Even if all of the very rare sequencing errors in the sequences herein disclosed were 
corrected, the resulting nucleotide sequence would still be at least 90% identical, and more 
likely 99% identical, and most likely 99.99% identical to the nucleotide sequence provided 
herein. 

Thus, the present invention further provides nucleotide sequences that are at least 
90% identical, or greater, to the nucleotide sequences of the present invention in a form 
which can be readily used, analyzed and interpreted by the skilled artisan. Methods for 
determining whether a nucleotide sequence is at least 90% identical to the nucleotide 
sequence of the present invention are routine and readily available to the skilled artisan. For 
example, the well known BLAST algorithm can be used to generate the percent identity of 
nucleotide sequences. 
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The present invention further provides a prediction of all of the genes within the 
WSBV genome. This information is provided in Figure 3. The information in the figures can 
be used to generate WSBV detection kits, antiviral agents, expression arrays, microfluidic 
devices, individual gene fragments, proteins, antibodies, promoters, protein and nucleotide 
based viral screens and the like, and to identify commercially important genes and gene 
products. 

Specific Embodiments 
Computer Related Embodiments 

The nucleotide sequences provided in the present invention, a representative fragment 
thereof, or nucleotide sequences at least 90% identical to these sequences, may be "provided" 
in a variety of mediums to facilitate use thereof. As used herein, "provided** refers to a 
manufacture, other than an isolated nucleic acid molecule, that contains a nucleotide 
sequence of the present invention, i.e., the nucleotide sequences provided in the present 
invention, a representative fragment thereof, or nucleotide sequences at least 90% identical to 
these sequences. Such a manufacture provides the WSBV genome or a subset thereof (e.g., a 
WSBV open reading firame (ORF)) in a form that allows a skilled artisan to examine the 
manufacture using means not directly applicable to examining the WSBV genome or a subset 
thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present invention 
can be recorded on computer readable media. As used herein, "computer readable media" 
refers to any medium that can be read and accessed directly by a computer. Such media 
include, but are not limited to: magnetic storage media, such as floppy discs, hard disc 
storage medium, and magnetic tape; optical storage media such as CD-ROM; electrical 
storage media such as RAM and ROM; and hybrids of these categories such as 
magnetic/optical storage media. A skilled artisan can readily appreciate how any of the 
presently known computer readable mediums can be used to create a manufacture comprising 
computer readable medium having recorded thereon a nucleotide sequence of the present 
invention. One such medium is provided with the present application, namely, the present 
application contains computer readable medium (CD-R) that has the sequence contigs 
provided/recorded thereon in ASCII text format in a Sequence Listing. 

As used herein, "recorded" refers to a process for storing information on computer 
readable medium, A skilled artisan can readily adopt any of the presently known methods for 
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recording information on computer readable medium to generate manufactures comprising 
the nucleotide sequence information of the present invention, 

A variety of data storage structures are available to a skilled artisan for creating a 
computer readable medium having recorded thereon a nucleotide or amino acid sequence of 

the present invention. The choice of the data storage structure will generally be based on the 

O 

means chosen to access the stored information. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the present 
invention on computer readable medium. The sequence information can be represented in a 
word processing text file, formatted in commercially-available software such as WordPerfect 
and Microsoft Word, or represented in the form of an ASCII file, stored in a database 
application, such as OB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any 
number of data processor structuring formats (e.g. text file or database) in order to obtain 
computer readable medium having recorded thereon the nucleotide sequence information of 
the present invention. 

By providing the nucleotide sequences of the present invention, a representative 
fragment thereof, or nucleotide sequences at least 90% identical to these sequences, in 
computer readable form, a skilled artisan can routinely access the sequence information for a 
variety of purposes. Computer software is publicly available which allows a skilled artisan to 
access sequence information provided in a computer readable medium. Software which 
implements the BLAST (Altschul et at, J. Mol BioL 21 5:403-410 (1990)) and BLAZE 
(Brutlag et at, Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase system an 
be used to identify open reading frames (ORFs) within the WSBV genome that contain 
homology to ORFs or proteins from other organisms. Such ORFs are protein-encoding 
fragments within the WSBV genome and are useful in producing commercially important 
proteins such as proteins used as drug or antiviral targets. 

The present invention further provides systems, particularly computer-based systems, 
which contain the sequence information described herein. Such systems are designed to 
identify commercially important fragments of the WSBV genome. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU), input means, output means, and 
data storage means. A skilled artisan can readily appreciate that any one of the currently 
available computer-based system are suitable for use in the present invention. Such system 
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can be changed into a system of the present invention by utilizing the sequence information 
provided on the CD-R, or a subset thereof without any experimentation. 

As stated above, the computer-based systems of the present invention comprise a data 
storage means having stored therein a nucleotide sequence of the present invention and the 
necessary hardware means and software means for supporting and implementing a search 
means. As used herein, "data storage means" refers to memory which can store nucleotide 
sequence information of the present invention, or a memory access means which can access 
manufactures having recorded thereon the nucleotide sequence information of the present 
invention. 

As used herein, "search means" refers to one or more programs that are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the WSBV genome which match a particular target sequence 
or target motif. A variety of known algorithms are disclosed publicly and a variety of 
commercially available software for conducting search means are available and can be used 
in the computer-based systems of the present invention. Examples of such software include^ 
but is not limited to, MacPattem (EMBL), BLASTN and BLASTX (MCBIA). A skilled 
artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 
computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid sequence of six or 
more nucleotides or two or more amino acids. A skilled artisan can readily recognize that the 
longer a target sequence is, the less likely a target sequence will be present as a random 
occurrence in the database. The most preferred sequence length of a target sequence is from 
about 10 to 100 amino acids or from about 30 to 300 nucleotide residues. However, it is well 
recognized that searches for commercially important fragments of the WSBV genome, such 
as sequence fragments involved in gene expression and protein processing, may be of shorter 
length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally 
selected sequence or combination of sequences in which the sequence(s) is chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There 
are a variety of target motifs known in the art. Protein target motifs include, but are not 
limited to, enzymatic active sites and signal sequences. Nucleic acid target motifs include. 
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but are not limited to, promoter sequences, hairpin structures and inducible expression 
elements (protein binding sequences). 

A variety of structural formats for the input and output means can be used to input and 
output the information in the computer-based systems of the present invention. A preferred 
format for an output means ranks fragments of the WSBV genome possessing varying 
degrees of homology to the target sequence or target motif Such presentation provides a 
skilled artisan with a ranking of sequences which contain various amounts of the target 
sequence or target motif and identifies the degree of homology contained in the identified 
fragment. 

A variety of comparing means can be used to compare a target sequence or target 
motif with the data storage means to identify sequence fragments of the WSBV genome. 
Software which implements the BLAST and BLAZE algorithms (Altschul et al., J Mol. Biol. 
215:403-410 (1990)) can be used to identify open reading frames within the WSBV genome. 
A skilled artisan can readily recognize that any one of the publicly available homology search 
programs can be used as the search means for the computer-based systems of the present 
invention. 

One application of this embodiment is provided in Figure 1. Figure 1 provides a 
block diagram of a computer system 1 02 that can be used to implement the present invention. 
The computer system 102 includes a processor 106 connected to a bus 104. Also connected 
to the bus 104 are a main memory 108 (preferably implemented as random access memory, 
RAM) and a variety of secondary storage devices 1 10, such as a hard drive 112 and a 
removable medium storage device 1 14. The removable medium storage device 1 14 may 
represent, for example, a floppy disk drive, a CD- ROM drive, a magnetic tape drive, etc. A 
removable storage medium 1 16 (such as a floppy disk, a compact disk, a magnetic tape, etc.) 
containing control logic and/or data recorded therein may be inserted into the removable 
medium storage device 114. The computer system 102 includes appropriate software for 
reading the control logic and/or the data from the removable storage medium 1 16 once 
inserted in the removable medium storage device 1 1 4. 

The nucleotide sequences of the present invention may be stored in a well known 
manner in the main memory 1 08, any of the secondary storage devices 1 1 0, and/or a 
removable storage medium 1 16. Software for accessing and processing the genomic 
sequence (such as search tools, comparing tools, etc.) reside in main memory 108 during 
execution. 
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Biochemical Embodiments 
Nucleic Acid Fragments 

Another embodiment of the present invention is directed to isolated fragments of the 
WSB V genome. The fragments of the WSBV genome of the present invention include, but 
are not limited to, fragments that encode peptides, hereinafter open reading frames (ORFs) 
and fragments v/hich modulate the expression of an operably linked ORF. Some of these 
fragments are identified and described in Figure 3. The isolated nucleic acid molecules of the 
present invention include, but are not limited to, single stranded and double stranded DNA, 
and single stranded RNA. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, 
DNA segments encoding the polypeptides and proteins provided by this invention are 
assembled from fragments of the WSBV genome or single nucleotides, short oligonucleotide 
linkers, or from a series of oligonucleotides, to provide a synthetic nucleic acid molecule. 

As used herein, an "isolated nucleic acid molecule" or an "isolated fragment of the 
WSBV genome" refers to a nucleic acid molecule possessing a specific nucleotide sequence 
which has been subjected to purification means to reduce, from the composition, the number 
of compounds which are normally associated with the composition. A variety of purification 
means can be used to generate the isolated fragments of the present invention. These include, 
but are not limited to, methods that separate constituents of a solution based on charge, 
solubility, or size. 

In one embodiment, WSBV DNA can be mechanically sheared to produce fragments 
of about 2kb, lOkb, or 15-20 kb in length. These fragments can then be used to generate a 
WSBV library by inserting them into vectors, such as plasmid or lambda vectors, using 
methods well known in the art. Primers flanking each fragment, for example an ORF, can 
then be generated using nucleotide sequence information provided in the present invention. 
PGR cloning can then be used to isolate the ORF from the WSBV DNA library. PGR cloning 
is well known in the art. Thus, given the availability of the present identified gene coding 
sequences of the WSBV genome, it is routine experimentation to isolate any ORF, or other 
fragment of the assembly of the present invention, particularly using the information 
provided in Figure 3. Such fragments can be applied to an array, microfluidic device, or other 
detection kit format and used to detect expression of a viral gene (see below). 

As used herein, an "open reading frame" (ORF) means a series of triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. A 
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skilled artisan can readily identify ORFs in the WSBV genome using the gene coding 
sequences provided herein and/or the computer-based systems of the present invention. 

As used herein, an "expression modulating fragment" (EMF) means a series of 
nucleotide molecules which modulates the expression of an operably linked ORF or another 
EMF. 

As used herein, a viral sequence is said to "modulate the expression of an operably 
linked sequence" when the expression of the sequence is altered by the presence of the EMF. 
EMFs include, but are not limited to, promoters, and promoter modulating sequences 
(inducible elements). One class of viral EMFs are fragments which induce the expression of 
an operably linked viral ORF in response to a specific host regulatory factor or physiological 
event, such as a host anti-viral response. 

EMF sequences can be identified within the WSBV genome by their proximity to the 
ORFs identified using the computer-based systems of the present invention. EMFs may be 
found immediately 5' to the ORF. Alternatively, EMFs can be identified using known EMFs 
as a target sequence or target motif in the computer-based systems of the present invention. 

The presence and activity of an EMF can be confirmed using an EMF trap vector. An 
EMF trap vector contains a cloning site 5* to a marker sequence. A marker sequence encodes 
an identifiable phenotyp)e, such as antibiotic resistance or a complementing nutrition 
auxotrophic factor, which can be identified or assayed when the EMF trap vector is placed 
within an appropriate host under appropriate conditions. An EMF will modulate the 
expression of an operably linked marker sequence. A sequence that is suspected of being an 
EMF is cloned in all three reading frames in one or more restriction sites upstream from the 
marker sequence in the EMF trap vector. The vector is then transformed into an appropriate 
host using known procedures and the phenotype of the transformed host is examined under 
appropriate conditions. 

The sequences falling within the scope of the present invention are not limited to the 
specific sequences herein described, but also include variations thereof. Variations can be 
routinely determined by comparing the sequence provided in the present invention, or a 
representative fragment thereof, with a sequence from another WSBV isolate. Furthermore, 
to accommodate the degeneracy of the genetic code, the invention includes nucleic acid 
molecules coding for the same amino acid sequences as do the specific ORFs disclosed 
herein. In other words, in the coding region of an ORF, substitution of one codon for another 
that encodes the same amino acid is expressly contemplated. 



wo 01/38351 



PCT/USOO/28888 



Any specific sequence disclosed herein can be readily screened for errors by 
resequencing a particular fragment, such as an ORF, in both directions (i.e., sequence both 
strands). Ahernatively, error screening, or variant detection, can be performed by sequencing 
corresponding polynucleotides of WSBV origin isolated by using part or all of the fragments 
in question as a probe or primer. 

Nucleic Acid Fragment Uses 

The nucleic acid molecules of the present invention are useful for probes, primers, 
chemical intermediates, and in biological assays. The nucleic acid molecules are useful as 
hybridization probes for viral messenger RNA, viral transcript/cDNA, and viral genomic DNA 
to isolate full-length viral cDNA and viral genomic clones encoding the peptides described in 
Figure 3, and for use in viral screens and antiviral agents. 

Oligonucleotide probes have long been used to detect complementary nucleic acid 
sequences in a nucleic acid of interest (the "target" nucleic acid) in the form of detection 
kits/reagents. In some assay formats, the oligonucleotide probe is tethered, i.e., by covalent 
attachment, to a solid support, and arrays of oligonucleotide probes immobilized on solid 
supports have been used to detect specific nucleic acid sequences in a target nucleic acid- 
See, e.g., PCT patent publication Nos. WO 89/10977 and 89/1 1548. In other formats, the 
detection reagents are supplied in solution. 

The probe can correspond to any sequence along the entire length of the nucleic acid 
molecules provided in the figures. However, as discussed, fragments are not to be construed as 
encompassing fragments disclosed prior to the present invention. 

Each of the ORFs of the WSBV genome that can be routinely identified using the 
computer system of the present invention can be used in numerous ways as polynucleotide 
reagents. The sequences can be used as diagnostic probes or diagnostic amplification primers 
to detect the expression of a particular gene or groups of genes. This is particularly useful in 
the form of nucleic acid arrays employing 1 or more, 10 or more, 100 or more, or most to all 
of the WSBV ORFs in a single array. 

The nucleic acid molecules are also useful as primers for PGR to amplify any given 
region of a nucleic acid molecule and are useful for synthesizing antisense molecules of desired 
length and sequence. 

The nucleic acid molecules are useful as primers for the 5' nuclease PGR assay 
(hereafter referred to as the TaqMan assay). The TaqMan assay provides a sensitive and rapid 
means of detecting viral nucleic acid and therefore is well suited for use in viral screening 



wo 01/38351 



PCT/USOO/28888 



applications such as detection kits. The TaqMan assay detects the accumulation of a specific 
amplified product during PGR. The TaqMan assay utilizes an oligonucleotide probe labeled 
with a fluorescent reporter dye at the 5' end of the probe and a quencher dye at the 3' end of 
the probe. During the PCR reaction, the 5' nuclease activity of DNA polymerase cleaves the 
probe, thereby separating the reporter dye and the quencher dye and resulting in increased 
fluorescence of the reporter. Accumulation of PCR product is detected directly by monitoring 
the increase in fluorescence of the reporter dye. The 5' nuclease activity of DNA polymerase 
cleaves the probe between the reporter and the quencher only if the probe hybridizes to the 
target and is amplified during PCR. Therefore, only the target sequence of interest is 
detected. 

Preferred TaqMan primer and probe sequences are disclosed in Figure 3. It will be 
apparent to one of skill in the art that the disclosed primers and probes of the present 
invention are useful as diagnostic probes or amplification primers for screening for the 
presence of WSBV in a biological sample or for isolating or screening particular WSBV 
genes. 

The nucleic acid molecules are also useful for expressing antigenic portions of the 
WSBV proteins that can then be used, for example, to develop antibodies to the viral antigens. 

The nucleic acid molecules are also useful as hybridization probes for determining the 
presence, level, form and distribution of WSBV nucleic acid expression/ Accordingly, the 
probes can be used to detect the presence of, or to determine levels of, a specific viral nucleic 
acid molecule, either DNA or RNA, in cells and tissues of shrimp or other organisms under 
moderate or stringent conditions. One example of stringent hybridization conditions are 
hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45 °C, followed by one or 
more washes in 0.2 X SSC, 0.!% SDS at 50-65 **C. Examples of moderate to low stringency 
hybridization conditions are well known in the art. Furthermore, probes corresponding to the 
viral peptides described herein can be used to assess expression and/or gene copy number in a 
given infected cell, tissue, or organism. For example. Northern blots can be used for RNA 
detection. Southern blots can be used for DNA detection, and Western blots can be used for 
peptide/protein detection. These uses are relevant for detecting the presence of virus in shrimp 
as well as for monitoring the distribution of virus throughout various cells and tissues of shrimp 
during the course of viral infection. 

In addition, each of the expression modulating fragments (EMFs) can be used in DNA- 
protein binding assays to screen for modulating peptides which may be present in cells and 
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tissues of shrimp or other organisms. This use is relevant for obtaining the specific host 
regulatory factors that interact with promoters in the WSBV genome. 

Nucleic acid expression assays are also useful for drug screening to identify compounds 
that modulate viral nucleic acid expression. TTie invention thus provides a method for 
identifying a compound that can be used to treat viral infection. The method typically includes 
assaying the ability of the compound to modulate the expression of viral nucleic acid and thus 
identifying a compound that can be used to treat a viral infection. The assays can be performed 
in cell-based and cell-free systems. Cell-based assays include cells infected with virus particles 
or recombinant cells genetically engineered to express specific viral nucleic acid sequences. 
Ce!l-fi*ee assays can be used to detect the ability of a compound to directly bind to a nucleic acid 
fragment or protein. 

The assay for viral nucleic acid expression can involve direct assay of nucleic acid 
levels, such as mJlNA levels, or on collateral compounds involved in the signal pathway. 
Furthermore, the expression of host cell genes that are up- or dovm-regulated in response to the 
viral protein can also be assayed. In this embodiment the regulatory regions of these genes can 
be operably linked to a reporter gene such as luciferase. 

Thus, modulators of viral gene expression can be identified by a method wherein a cell 
infected with virus is contacted with a candidate compound and the expression of viral mRNA 
detemiined. The level of expression of viral mRNA in the presence of the candidate compound 
is compared to the level of expression of viral mRNA in the absence of the candidate compound. 
The candidate compound can then be identified as a modulator of viral nucleic acid expression 
based on this comparison and be used, for example, to disrupt viral replication. When 
expression of viral mRNA is statistically significantly greater in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of viral 
nucleic acid expression. When viral nucleic acid expression is statistically significantly less in 
the presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of viral nucleic acid expression. 

The invention further provides methods of treating viral infection, with the nucleic acid 
as a target, using a compound identified through drug screening as a gene modulator to modulate 
viral nucleic acid expression in cells and tissues infected with the virus. Modulation includes 
both up-regulation (i.e. activation or agonization) or down-regulation (suppression or 
antagonization) of nucleic acid expression. Generally, viral nucleic acid expression is down- 
regulated to prevent viral replication and treat viral infection. 
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Alternatively, a modulator for viral nucleic acid expression can be a small molecule or 
drug identified using the screening assays described herein as long as the drug or small molecule 
modulates viral nucleic acid expression in the cells and tissues infected with the virus. 

The nucleic acid molecules are also useful for monitoring the effectiveness of 
modulating compounds on the expression or activity of the viral gene in a treatment regimen. 
Thus, the gene expression pattern can serve as a barometer for the continuing effectiveness of 
treatment with the compound, particularly with compounds to which a virus can develop 
resistance. The gene expression pattern can also serve as a marker indicative of a 
physiological response of the virus to the compound. Accordingly, such monitoring would 
allow either increased administration of the compound or the administration of alternative 
compounds to which the virus has not become resistant. 

The nucleic acid molecules are also useful as antisense constructs to control viral gene 
expression in infected cells and tissues. A DNA antisense nucleic acid molecule is designed to 
be complementary to, and therefore bind to, a region of the viral gene necessary for 
transcription, thereby preventing transcription and hence production of viral protein. An 
antisense RlsIA or DNA nucleic acid molecule would hybridize to the viral mRNA and thus 
block translation of viral mRNA into protein by the host celTs translational machinery. 
Alternatively, a class of antisense molecules can be used to inactivate viral mRNA in order to 
decrease expression of viral nucleic acid and inhibit viral replication or function. These 
molecules can therefore be used to treat viral infection. This technique involves cleavage of 
viral mRNA by ribozymes that recognize one or more regions of viral mRNA that attenuate the 
ability of the mRNA to be translated by host cell translational machinery. Possible regions 
include coding or control regions; particularly coding or control regions encoding or regulating 
proteins that play critical roles in viral function or replication, such as entry into the nucleus of 
the host cell or virion assembly. 

The nucleic acid molecules of the present invention can be employed to create transgenic 
viral resistant shrimp. Several possible mechanisms could be employed to impart WSBV 
resistance to shrimp using the nucleic acid and protein coding sequences provided by the present 
invention. One possible mechanism of imparting WSBV resistance to shrimp involves 
transforming shrimp cells with viral nucleic acids that express an attenuated virion coat protein 
such that when the transgenic shrimp is infected with WSBV, the expressed coat protein 
envelopes the virus and thereby prevents translation of the viral DNA. In this example, the 
virion coat protein can either be const itutively expressed or regulated by a promoter that is 
activated upon WSBV infection. Shrimp cells can be transformed with viral DNA under 



wo 01/38351 



PCT/USOO/28888 



suitable conditions known in the art. The WSBV construct in a vector can be microinjected 
directly into host cells using micropipettes, [Crossway, Mol Gen. Genetics^ 202: 1 79-85 (1985)], 
or using polyethylene glycol [Krens et ai. Nature^ 296:72-74 (1982)]. Alternatively, shrimp 
cells may be transformed by incubating the shrimp cells or tissue v^ith an inoculum of bacteria 
that have been transformed with a vector comprising a gene that imparts WSBV resistance. The 
transformed shrimp cells are then grown and regenerated into shrimp such that the proteins 
expressed by the transformed cells impart WSBV resistance to the shrimp. 

Nucleic Acid Arrays and Detection Reagents 

The present invention further provides detection reagents and kits, such as arrays or 
microarrays, of nucleic acid molecules that are based on the novel WSBV sequence 
information provided in the present invention and particularly the transcript information 
(SEQ ID NOS: 2, 4, 6. . . 280, 282, 284. 286-293) provided in Figure 3 and polymorphic sites 
on these transcripts summarized in Table 1. 

As used herein "Arrays" or "Microarrays" refers to an array of distinct 
polynucleotides or oligonucleotides synthesized on a substrate, such as paper, nylon or other 
type of membrane, filter, chip, glass slide, or any other suitable solid, or semi-solid support. 
The development of arraying technologies such as photolithographic synthesis of a nucleic 
acid array and high density spotting of cDNA products has provided methods for making 
very large arrays of oligonucleotide probes in very small areas. See U.S. Pat. No. 5,143,854 
and PCT patent publication Nos. WO 90/15070 and 92/10092. Microfabricated arrays of 
large numbers of oligonucleotide probes, called "DNA chips", offer great promise for a wide 
variety of applications. In one embodiment, the microarray is prepared and used according to 
the methods described in US Patent 5,837,832, Chee et al., PCT application W095/1 1995 
(Chee et al.), Lockhart, D. J. et al. (1996; NaL Biotech 14: 1675-1680) and Schena, M. et aL 
(1996; Proc. Natl. Acad. Sci. 93: 10614-10619), all of which are incorporated herein in their 
entirety by reference. In other embodiments, such arrays are produced by the methods 
described by Brown et. aL, US Patent No. 5,807,522. 

The microarray or detection kit is preferably composed of a large number of unique, 
single-stranded nucleic acid sequences, usually either synthetic antisense oligonucleotides or 
fragments of cDNAs, fixed to a solid support. The oligonucleotides are preferably about 6-60 
nucleotides in length, more preferably 15-30 nucleotides in length, and most preferably about 
20-25 nucleotides in length. For a certain type of microarray or detection kit, it may be 
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preferable to use oligonucleotides that are only 7-20 nucleotides in length. For others, such as 
cDNA, longer lengths are possible and preferable. These can be of the order of Ikb or more. 

The microarray or detection kit may contain oligonucleotides that cover the known 5' 
or 3' sequence, sequential oligonucleotides that cover the full-length sequence; or unique 
oligonucleotides selected from particular areas along the length of the sequence. 
Polynucleotides used in the microarray or detection kit may be oligonucleotides that are 
specific to a viral gene or genes of interest. 

In order to produce oligonucleotides to a knovs^n sequence for a microarray or 
detection kit, the viral gene(s) of interest (or an ORF identified from the contigs of the 
present invention) is typically examined using a computer algorithm which starts at the 5' or 
at the 3* end of the nucleotide sequence. Typical algorithms will then identify oligomers of 
defined length that are unique to the gene, have a GC content within a range suitable for 
hybridization, and lack predicted secondary structure that may interfere with hybridization. 
In certain situations it may be appropriate to use pairs of oligonucleotides on a microarray or 
detection kit. The "pairs" will be identical, except for one nucleotide that preferably is 
located in the center of the sequence. The second oligonucleotide in the pair (mismatched by 
one) serves as a control. The number of oligonucleotide pairs may range from one to two 
millions. The oligomers are synthesized at designated areas on a substrate using a light- 
directed chemical process. The substrate may be paper, nylon or other type of membrane, 
filter, chip, glass slide or any other suitable solid support. 

In another aspect, an oligonucleotide may be synthesized on the surface of the 
substrate by using a chemical coupling procedure anid an ink jet application apparatus, as 
described in PCT application W095/251 1 16 (Baldeschweiler et al.) which is incorporated 
herein in its entirety by reference. In another aspect, a "gridded" array analogous to a dot (or 
slot) blot may be used to arrange and link cDNA fi-agments or oligonucleotides to the surface 
of a substrate using a vacuum system, thermal, UV, mechanical or chemical bonding 
procedure. An array, such as those described above, may be produced by hand or by using 
available devices (slot blot or dot blot apparatus), materials (any suitable solid support), and 
machines (including robotic instruments), and may contain 8, 24, 96, 384, 1536, 6144 or 
more oligonucleotides, or any other number which lends itself to the efficient use of 
commercially available instrumentation. 

In other embodiments, the array or detection reagent/kit can be produced by spotting a 
nucleic acid molecule onto the surface of a substrate (See Brown et. al., US Patent No. 



wo 01/38351 



PCT/USOO/28888 



5,807,522). In such embodiments, PGR primers to one or more nucleic acid fragments are 
used to generate nucleic acid molecules suitable for deposition onto a substrate. 

In order to conduct sample analysis using a microarray or detection kit, viral nucleic 
acid is isolated from a biological sample infected with WSBV and the viral nucleic acid is 
made into hybridization probes. Viral nucleic acid may be isolated from biological samples 
obtained from fluids, cultured cells, biopsies, or other tissue preparations from a shrimp or 
other organism of interest that is infected with WSBV. Viral mRNA is isolated, and cDNA is 
produced and used as a template to make antisense RNA (aRNA). The aRNA is amplified in 
the presence of fluorescent nucleotides, and labeled probes are incubated with the microarray 
or detection kit so that the probe sequences hybridize to complementary oligonucleotides of 
the microarray or detection kit. Incubation conditions are adjusted so that hybridization 
occurs with precise complementary matches or with various degrees of less complementarity. 
After removal of nonhybridized probes, a scanner is used to determine the levels and patterns 
of fluorescence. The scanned images are examined to determine degree of complementarity 
and the relative abundance of each oligonucleotide sequence on the microarray or detection 
kit. A detection system may be used to measure the absence, presence, and amount of 
hybridization for all of the distinct viral sequences simultaneously. This data may be used for 
large scale correlation studies on the sequences, expression patterns, mutations, variants, or 
polymorphisms among viral isolates. 

Using such arrays, the present invention provides methods to identify the expression 
of one or more of the ORFs of the present invention. In detail, such methods comprise 
incubating a test sample with one or more nucleic acid molecules and assaying for binding of 
the nucleic acid molecule with components within the test sample. Such assays will typically 
involve arrays comprising most, if not all of the genes in the WSBV genome, or rationally 
selected subsets thereof. The genomic sequence (SEQ ID NO: 1) and transcript sequences 
(SEQ ID NOS: 2, 4, 6. . .280, 282, 284, 286-293) of the WSBV genome of the present 
invention are provided in Figure 2 and Figure 3 and polymorphic sites on these transcripts 
summarized in Table 1 . 

Conditions for incubating a nucleic acid molecule with a test sample vary. Incubation 
conditions depend on the format employed in the assay, the detection methods employed, and 
the type and nature of the nucleic acid molecule used in the assay. One skilled in the art will 
recognize that any one of the commonly available hybridization, amplification or array assay 
formats can readily be adapted to employ the novel fragments of the WSBV genome 
disclosed herein. Examples of such assays can be found in Chard, T, An Introduction to 
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Radioimmunoassay and Related Techniques^ Elsevier Science Publishers, Amsterdam, The 
Netherlands (1986); Bullock, G. R. et al.. Techniques in Immunocytochemistry, Academic 
Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 
Biology^ Elsevier Science Publishers, Amsterdam, The Netherlands (1985). 

The test samples of the present invention include, but are not limited to, nucleic acid 
cKtracts, cells, and protein or membrane extracts of cells infected with WSBV. The test 
sample used in the above-described method will vary based on the assay format, the nature of 
the detection method, and the tissues, cells, or extracts used as the sample to be assayed. 
Methods for preparing nucleic acid extracts or for preparing cells are well known in the art 
and can readily be adapted in order to obtain a sample that is compatible with the system 
utilized. 

In another embodiment of the present invention, kits are provided which contain. the 
necessary reagents to carry out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close 
confinement, one or more containers which comprises: (a) a first container comprising one of 
the nucleic acid molecules that can bind to a fragment of the WSBV genome disclosed 
herein; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting the presence of a bound nucleic acid. Preferred kits 
will include detection reagents/arrays/chips/microfluidic devices that are capable of detecting 
the expression of 1 or more, 10 or more, 100 or more, or most or all of the genes expressed in 
WSBV, particularly the genes provided in Figure 3. 

In detail, a compartmentalized kit includes any kit in which reagents are contained in 
separate containers. Such containers include small glass containers, plastic containers, strips 
of plastic, glass or paper, or arraying material such as silica. Such containers allow one to 
efficiently transfer reagents from one compartment to another compartment such that the 
samples and reagents are not cross-contaminated, and the agents or solutions of each 
container can be added in a quantitative fashion from one compartment to another. Such 
containers may include a container which will accept the test sample, a container which 
contains the nucleic acid probe, containers which contain wash reagents (such as phosphate 
buffered saline, Tris-buffers, etc.), and containers which contain the reagents used to detect 
the bound probe. One skilled in the art will readily recognize that the previously unidentified 
ORFs that can be routinely identified using the sequence information disclosed herein can be 



wo 01/38351 



PCT/USOO/28888 



readily incorporated into one of the established kit formats which are well known in the art, 
particularly expression arrays. 

Protein/Peptide Molecules 

The present invention provides nucleic acid sequences that encode WSBV protein 
molecules (Figure 3). The peptide sequences provided in Figure 3, as well as the obvious 
variants described herein, and using the information in Figure 3, will be referred to herein as 
the WSBV peptides of the present invention or peptides/polypeptides/proteins of the present 
invention. 

Enzymes and other viral proteins are produced during viral activity and replication 
and can be used as targets for screening and quantitating a particular virus, or as antiviral 
targets. Like viruses in general, WSBV utilizes the resources of the host cell for production of 
viral proteins. Viral proteins can be detected using an antibody, or binding portion thereof, to 
the protein or a probe that recognizes proteins or peptides of the present invention. Viral 
antigens present either on the surface or within the infected cell can be detected by various 
antibody tests, including immunoflourescence or enzyme immunoassay (EIA). Virus or 
antigen released from infected cells can be detected by such antibody tests as enzyme linked 
immunosorbent assay (ELISA), radioimmunoassay (RIA), or latex agglutination (LA). 
Protein-based tests such as these for WSBV antigens are useful for detecting outbreaks of 
WSBV in shrimp populations. 

The present invention provides isolated peptide and protein molecules that comprise, 
consist essentially of, or consist of the amino acid sequences of the WSBV peptides disclosed 
in Figure 3, (which are encoded by the transcript sequences that are also shown in Figure 3), 
as well as all obvious variants of these peptides that are within the art to make and use. Some 
of these variants are described in detail below. 

As used herein, a peptide is said to be "isolated" or "purified" when it is substantially 
free of cellular material or free of chemical precursors or other chemicals. The peptides of the 
present invention can be purified to homogeneity or other degrees of purity. The level of 
purification will be based on the intended use. The critical feature is that the preparation allows 
for the desired function of the peptide, even if in the presence of considerable amounts of other 
components. 

In some uses, "substantially free of cellular material" includes preparations of the peptide 
having less than about 30% (by dry weight) other proteins (i.e., contaminating protein), less than 
about 20% other proteins, less than about 10% other proteins, or less than about 5% other 
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proteins. When the peptide is recombinantly produced, it can also be substantially free of culture 
mediunn, i.e., culture medium represents less than about 20% of the volume of the protein 
preparation. 

The language "substantially free of chemical precursors or other chemicals" includes 
preparations of the peptide in which it is separated from chemical precursors or other chemicals 
that are involved in its synthesis. In one embodiment, the language "substantially free of 
chemical precursors or other chemicals" includes preparations of a WSBV peptide having less 
than about 30% (by dry weight) chemical precursors or other chemicals, less than about 20% 
chemical precursors or other chemicals, less than about 10% chemical precursors or other 
chemicals, or less than about 5% chemical precursors or other chemicals. 

An isolated WSBV peptide can be purified from cells infected with WSBV, purified 
from cells that have been altered to express it (recombinant), or synthesized using known protein 
synthesis methods. For example, a nucleic acid molecule encoding the peptide can be cloned 
into an expression vector, the expression vector introduced into a host cell, and the protein 
expressed in the host cell. The protein can then be isolated from the host cells by an appropriate 
purification scheme using standard protein purification techniques. Many of these techniques 
are described in detail below. 

Accordingly, the present invention provides proteins that consist of the amino acid 
sequences provided in Figure 3 (SEQ IDNOS:3, 5, 7... 281, 283, 285), for example, proteins 
encoded by the transcript/cDNA nucleic acid sequences also shown in Figure 3 (SEQ ID NOS:2, 
4, 6... 280, 282, 284, 286-293). A protein consists of an amino acid sequence when the amino 
acid sequence is the final amino acid sequence of the protein. 

The present invention further provides proteins that consist essentially of the amino acid 
sequences provided in Figure 3 (SEQ iDNOS:3, 5, 7... 281, 283, 285), for example, proteins 
encoded by the transcript/cDNA nucleic acid sequences also shown in Figure 3 (SEQ ID NOS:2, 
4, 6... 280, 282, 284, 286-293). A protein consists essentially of an amino acid sequence when 
such an amino acid sequence is present with only a few additional amino acid residues, for 
example from about 1 to about 100 or so additional residues, typically from 1 to about 20 
additional residues in the final protein. 

The present invention further provides proteins that comprise the amino acid sequences 
provided in Figure 3 (SEQ ID NOS:3, 5, 7. ..281, 283, 285), for example, proteins encoded by 
the transcript/cDNA nucleic acid sequences also shown in Figure 3 (SEQ ID NOS:2, 4, 6. . .280, 
282, 284, 286-293). A protein comprises an amino acid sequence when the amino acid 
sequence is at least part of the final amino acid sequence of the protein. In such a fashion, the 
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protein can be only the peptide or have additional amino acid molecules, such as amino acid 
residues (contiguous encoded sequence) that are naturally associated with it or heterologous 
amino acid residues/peptide sequences. Such a protein can have a few additional amino acid 
residues or can comprise several hundred or more additional amino acids. The preferred classes 
of proteins that are comprised of the peptides of the present invention are the naturally occurring 
mature proteins. A brief description of how various types of these proteins can be made/isolated 
is provided below. 

The peptides of the present invention can be attached to heterologous sequences to form 
chimeric or fusion proteins. Such chimeric and fusion proteins comprise a WSBV peptide 
operative ly linked to a heterologous protein having an amino acid sequence not substantially 
homologous to the WSBV peptide. "Operatively linked" indicates that the WSBV peptide and 
the heterologous protein are fused in-frame. The heterologous protein can be fused to the N- 
terminus or C-terminus of the WSBV peptide. 

In some uses, the fusion protein does not affect the activity of the WSBV peptide per se. 
For example, the ftision protein can include, but is not limited to, enzymatic fusion proteins, for 
example beta-galactosidase fusions, yeast two-hybrid GAL fusions, poly-His fusions, MYC- 
tagged, Hl-tagged and Ig fusions. Such fusion proteins, particularly poly-His fusions, can 
facilitate the purification of recombinant WSBV peptide. In certain host cells, expression and/or 
secretion of a protein can be increased by using a heterologous signal sequence. 

A chimeric or fusion protein can be produced by standard recombinant DNA techniques. 
For example, DNA fragments coding for the different protein sequences are ligated together in- 
frame in accordance with conventional techniques. In another embodiment, the fusion gene can 
be synthesized by conventional techniques including automated DNA synthesizers. 
Alternatively, PGR amplification of gene fragments can be carried out using anchor primers 
which give rise to complementary overhangs between two consecutive gene fragments which 
can subsequently be annealed and re-amplified to generate a chimeric gene sequence (see 
Ausubel et ah. Current Protocols in Molecular Biology 1 992). Moreover, many expression 
vectors are commercially available that already encode a fiision moiety (e.g., a GST protein). A 
WSBV peptide-encoding nucleic acid can be cloned into such an expression vector such that the 
fusion moiety is linked in-frame to the WSBV peptide. 

As mentioned above, the present invention also provides and enables obvious variants of 
the amino acid sequence of the proteins of the present invention, such as naturally occurring 
mature forms of the peptide, sequence variants of the peptides, non-naturally occurring 
recombinant ly derived variants of the peptides, and orthologs and para logs of the peptides. Such 
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variants can readily be generated using art-known techniques in the fields of recombinant 
nucleic acid technology and protein biochemistry. It is understood, however, that variants 
exclude any amino acid sequences disclosed prior to the invention. 

Such variants can readily be identified/made using molecular techniques and the 
sequence information disclosed herein. Further, such variants can readily be distinguished from 
other peptides based on sequence and/or structural homology to the WSBV peptides of the 
present invention. The degree of homology/identity present will be based primarily on whether 
the peptide is a functional variant or non-functional variant, the amount of divergence present in 
the paralog protein family and the evolutionary distance between orthologous viruses. 

To determine the percent identity of two amino acid sequences or two nucleic acid 
sequences, the sequences are aligned for optimal comparison purposes (e.g,, gaps can be 
introduced in one or both of a first and a second amino acid or nucleic acid sequence for 
optimal alignment and non-homologous sequences can be disregarded for comparison 
purposes). In preferred embodiments, at least 30%, 40%, 50%, 60%, 70%, 80%, or 90% or 
more of the length of a reference sequence is aligned for comparison purposes. The amino 
acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are 
then compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position (as used herein amino acid or nucleic acid "identity" is 
equivalent to amino acid or nucleic acid "homology"). The percent identity between the two 
sequences is a function of the number of identical positions shared by the sequences, taking 
into account the number of gaps, and the length of each gap, which are introduced for optimal 
alignment of the two sequences. 

The comparison of sequences and determination of percent identity and similarity 
between two sequences can be accomplished using a mathematical algorithm. {Computational 
Molecular Biology^ Lesk, A.M., ed., Oxford University Press, New York, 1988; Biocomputing: 
Informatics and Genome Projects, Smith, D. W,, ed., Academic Press, New York, 1993; 
Computer Analysis of Sequence Data, Part 1, Griffin, A.M., and Grififm, H.G., eds., Humana 
Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic 
Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton 
Press, New York, 1991). In a preferred embodiment, the percent identity between two amino 
acid sequences is determined using the Needleman and Wunsch {X MoL Biol. (48):444-453 
(1970)) algorithm which has been incorporated into the GAP program in the GCG software 
package, using either a Blossom 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 
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12, 10, 8, 6, or 4 and a length weight of 1 , 2, 3, 4, 5, or 6. In yet another preferred 
embodiment, the percent identity between two nucleotide sequences is determined using the 
GAP program in the GCG software package (Devereux, J., et al. Nucleic Acids Res. 12(1):387 
(1984)), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a 
length weight of 1, 2, 3, 4, 5, or 6. In another embodiment, the percent identity between two 
amino acid or nucleotide sequences is determined using the algorithm of E. Myers and W. 
Miller (CABIOS, 4:1 1-17 (1989)) which has been incorporated into the ALIGN program 
(version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap 
penalty of 4. 

The nucleic acid and protein sequences of the present invention can further be used as 
a "query sequence" to perform a search against sequence databases to, for example, identify 
other viruses related to WSBV or functionally related protein sequences. Such searches can 
be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et aL (J. 
MoL Biol. 215:403-10 (1990)). BLAST nucleotide searches can be performed with the 
NBLAST program, score =^ 100, wordlength =^ 12 to obtain nucleotide sequences homologous 
to the nucleic acid molecules of the invention. BLAST protein searches can be performed 
with the XBLAST program, score = 50, wordlength === 3 to obtain amino acid sequences 
homologous to the proteins of the invention. To obtain gapped alignments for comparison 
purposes. Gapped BLAST can be utilized as described in Altschul et al. {Nucleic Acids Res. 
25( 1 7):3389-3402 (1 997)). When utilizing BLAST and gapped BLAST programs, the 
default parameters of the respective programs (e,g., XBLAST and NBLAST) can be used. 
The results of one such analysis are provided in Figure 3. 

Full-length pre-processed forms, as well as mature processed forms, of proteins that 
comprise one of the peptides of the present invention can readily be identified as having 
complete sequence identity to one of the WSBV peptides of the present invention as well as 
being encoded by the same viral gene as the WSBV peptide provided herein. 

Variants of a WSBV peptide can readily be identified as t)eing a WSBV protein having a 
high degree of sequence homology/identity (also referred to as "significant sequence 
homology") to at least a portion of the WSBV peptide as well as being encoded by the same 
viral gene as the WSBV peptide provided herein. Viral genes can readily be determined based 
on the WSBV sequence information provided in Figure 3. As used herein, two proteins (or a 
region of the proteins) have significant homology when the amino acid sequences are 
typically at least about 70-80%, 80-90%, and more typically at least about 90-95% or more 
homologous. A significantly homologous amino acid sequence, according to the present 
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invention, will be encoded by a nucleic acid sequence that will hybridize to a WSBV peptide 
encoding nucleic acid molecule under stringent conditions as more fully described below. 

Paralogs of a WSBV peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the WSBV peptide, as being 
encoded by a gene from WSBV, and as having similar activity or function. Two proteins will 
typically be considered paralogs when the amino acid sequences typically share at least about 
60% or greater, and more typically at least about 70% or greater homology through a given 
region or domain. Such paralogs will be encoded by a nucleic acid sequence that will 
hybridize to a WSBV peptide encoding nucleic acid molecule under moderate to stringent 
conditions as more fully described below. 

Orthologs of a WSBV peptide can readily be identified as having some degree of 
significant sequence homology/identity to at least a portion of the WSBV peptide as well as 
being encoded by a gene from another virus. Preferred orthologs will be isolated from viruses of 
commercial or medical importance for the development of broad-spectrum diagnostic and anti- 
viral agents. Such orthologs will be encoded by a nucleic acid sequence that will hybridize to 
a WSBV peptide encoding nucleic acid molecule under moderate to stringent conditions, as 
more fully described below, depending on the degree of relatedness of the two viruses 
yielding the proteins. 

Non-natural ly occurring variants of the WSBV peptides of the present invention can 
readily be generated using recombinant techniques. Such variants include, but are not limited to, 
deletions, insertions, and substitutions in the amino acid sequence of the WSBV peptide. For 
example, one class of substitutions is conserved amino acid substitutions. Such substitutions are 
those that substitute a given amino acid in a WSBV peptide by another amino acid of like 
characteristics. Typically seen as conservative substitutions are the replacements, one for 
another, among the aliphatic amino acids Ala, Val, Leu, and He; interchange of the hydroxy I 
residues Ser and Thr; exchange of the acidic residues Asp and Glu; substitution between the 
amide residues Asn and Gin; exchange of the basic residues Lys and Arg; and replacements 
among the aromatic residues Phe and Tyr. Guidance concerning which amino acid changes are 
likely to be phenotypically silent are found in Bowie et aL, Science 247:1306-1310 (1990). 

Variant WSBV peptides can be fully functional or can lack function in one or more 
activities, e.g. ability to bind to host cell receptors or ability to form structural components such 
as the viral nucleocapsid or outer membrane, etc. Fully functional variants typically contain only 
conservative variation or variation in non-critical residues or in non-critical regions. Figure 3 
provides the results of protein analysis and can be used to identify critical domains/regions. 
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Functional variants can also contain substitutions of similar amino acids that result in no change 
or an insignificant change in function. Alternatively, such substitutions may positively or 
negatively affect function to some degree. 

Non-functional variants typically contain one or more non-conservative amino acid 
substitutions, deletions, insertions, inversions, or truncation or a substitution, insertion, 
inversion, or deletion in a critical residue or critical region. 

Amino acids that are essential for function can be identified by methods known in the 
art, such as site-directed mutagenesis or alanine-scanning mutagenesis (Cunningham et al. 
Science 244:1081-1085 (1989)), particularly using the results provided in Figure 3. The latter 
procedure introduces single alanine mutations at every residue in the molecule. The resulting 
mutant molecules are then tested for biological activity such as DNA binding. Sites that are 
critical for virus/host cell receptor binding can also be determined by structural analysis such as 
crystallization, nuclear magnetic resonance or photoaffinity labeling (Smith et aL, J. Mol. Biol. 
224:899-904 (1992); de Vos et ol Science 255:306-312 (1992)). 

The present invention further provides fragments of the WSBV peptides, in addition to 
proteins and peptides that comprise and consist of such fragments, particularly those comprising 
the residues identified in Figure 3. The fragments to which the invention pertains, however, are 
not to be construed as encompassing fragments that may be disclosed publicly prior to the 
present invention. 

As used herein, a fragment comprises at least 8, 10, 12, 14, 16, or more contiguous 
amino acid residues from a WSBV peptide. Such fragments can be chosen based on the ability 
to retain one or more of the biological activities of the WSBV peptide or could be chosen for the 
ability to perform a function, e.g. bind a substrate or act as an immunogen. Particularly 
important fragments are biologically active fragments, peptides that are, for example, about 8 or 
more amino acids in length. Such fragments will typically comprise a domain or motif of the 
WSBV peptide, e.g., active site or a substrate-binding domain. Further, possible fragments 
include, but are not limited to, domain or motif containing fragments, soluble peptide fragments, 
and fragments containing immunogenic structures. Predicted domains and functional sites are 
readily identifiable by computer programs well known and readily available to those of skill in 
the art (e.g., PROSITE analysis). 

Polypeptides often contain amino acids other than the 20 amino acids commonly 
referred to as the 20 naturally occurring amino acids. Further, many amino acids, including the 
terminal amino acids, may be modified by natural processes during the course of viral infection, 
such as processing and other post-translational modifications by the host cell, or by chemical 
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modification techniques well known in the art. Common modifications that occur naturally are 
described in basic texts, detailed monographs, and the research literature, and they are well 
known to those of skill in the art. 

Examples of known modifications include, but are not limited to, acetylation, acylation, 
ADP-ribosylation, amidation, covalent attachment of flavin, covalent attachment of a heme 
moiety, covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a 
lipid or lipid derivative, covalent attachment of phosphotidylinositol* cross-linking, cyclization, 
disulfide bond formation, demethylation, formation of covalent crosslinks, formation of cystine, 
formation of pyroglutamate, formylation, gamma carboxylation, glycosylation, GPI anchor 
formation, hydroxylation, iodination, methylation, myristoylation, oxidation, proteolytic 
processing, phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RMA 
mediated addition of amino acids to proteins such as arginylation, and ubiquitination. 

Such modifications are well known to those of skill in the art and have been described in 
great detail in the scientific literature. Several particularly common modifications, 
glycosylation, lipid attachment, sulfation, gamma-carboxylation of glutamic acid residues, 
hydroxylation and ADP-ribosylation, for instance, are described in most basic texts, such as 
Proteins - Structure and Molecular Properties, 2nd Ed., T.E. Creighton, W. H. Freeman and 
Company, New York (1993). Many detailed reviews are available on this subject, such as by 
Wold, F., Posttranslational Covalent Modification of Proteins, B.C. Johnson, Ed., Academic 
Press, New York 1-12(1 983); Seifter et aL (Meth. Enzymol. 1 82: 626-646 ( 1 990)) and Rattan et 
al. {Ann. KY, Acad, Sci, 663:48-62 (1992)). 

Peptides or protein encoding sequences of the present invention can be modified or 
mutated, either naturally, such as by host cell mechanisms, or by techniques known to those of 
skill in the art, to disrupt protein formation or protein function, and thereby disrupt viral 
replication and function. These methods can be used to prevent and/or treat viral infection. 

Accordingly, the WSBV peptides of the present invention also encompass derivatives or 
analogs in which a substituted amino acid residue is not one encoded by the genetic code, in 
which a substituent group is included, in which the mature WSBV peptide is fused with another 
compound, such as a compound to increase or decrease the half-life of the WSBV peptide (for 
example, polyethylene glycol), in which the additional amino acids are fused to the mature 
WSBV peptide, such as a leader or secretory sequence or a sequence for purification of the 
mature WSBV peptide or a pro-protein sequence, or in which the WSBV peptide has been 
modified or mutated, either naturally or recombinantly, to disrupt protein function, and thereby 
disrupt WSBV function and/or replication. 
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Protein/Pcptide Uses 

The proteins of the present invention can be used in substantial and specific assays 
related to the functional information provided in the figures; to raise antibodies or to elicit 
another immune response; as a reagent (including the labeled reagent) in assays designed to 
quantitatively determine levels of the protein (or its binding partner or ligand) in biological 
samples; and as markers for infected samples in which the corresponding protein is 
preferentially expressed (either constitutive I y or at a particular stage of viral infection). 
Where the protein binds or potentially binds to another protein or ligand (such as, for 
example, a host cell receptor protein), the protein can be used to identify the binding 
partner/ligand so as to develop a system to identify inhibitors of the binding interaction. Any 
or all of these uses are capable of being developed into reagent grade or kit format for 
commercialization as commercial products. 

Methods for performing the uses listed above are well known to those skilled in the 
art. References disclosing such methods include "Molecular Cloning: A Laboratory Manual", 
2d ed.. Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch and T. Maniatis 
eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning Techniques", 
Academic Press» Berger, S. L. and A. R. Kimmel eds., 1987. 

The potential uses of the viral peptides of the present invention are based primarily on 
the function of the protein. For example, isolated WSBV peptides serve as targets for 
identifying antiviral agents, particularly for identifying antiviral agents that interfere with 
viral replication in a host cell infected with a virus that expresses the peptide. Specific and 
substantial uses for the molecules of the present invention are provided herein. Further uses 
can readily be determined using the information provided herein, that which is known in the 
art, and routine experimentation. 

The proteins of the present invention (including variants and fragments that may have 
been disclosed prior to the present invention) are useful for biological assays for viruses that are 
related to WSBV. Such assays involve any of the known protein functions or activities or 
properties useful for diagnosis of WSBV infection. 

The proteins of the present invention are also useful in virus screening assays, in cell- 
based or cell-free systems. Cell-based systems can be native, i.e., host cells infected with the 
virus, as a biopsy or expanded in cell culture. In an alternate embodiment, cell-based assays 
involve recombinant host cells expressing the viral protein. Cell-based or cell-free systems can 
be used in assays for protein activity, such as enzymatic activity. Cell-free assays can be used to 
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detect the ability of a compound to directly bind to a protein or nucleic acid fragment of the 
present invention. 

The polypeptides can be used to identify compounds that modulate activity of the 
protein. Both the WSBV peptides of the present invention and appropriate variants and 
fragments can be used in high-throughput screens to assay candidate compounds for the ability 
to bind to the WSBV peptide. These compounds can be further screened against a tlinctional 
WSBV peptide to determine the effect of the compound on the WSBV peptide activity. Further, 
these compounds can be tested in shrimp to determine activity/effectiveness. Compounds can 
be identified that inactivate the WSBV peptide to a desired degree (antagonists). 

Further, the proteins of the present invention can be used to screen a compound for the 
ability to stimulate or inhibit interaction between the WSBV protein and a target molecule that 
normally interacts with the WSBV protein, e.g. a host cell receptor. Such assays typically 
include the steps of combining the WSBV protein with a candidate compound under conditions 
that allow the WSBV protein, or fragment thereof, to interact with the target molecule, and 
detecting the formation of a complex between the WSBV protein and the target or detecting the 
biochemical consequence of the interaction between the WSBV protein and the target, such as 
any of the associated effects of host cell signal transduction such as protein phosphorylation, 
cAMP turnover, or adenylate cyclase activation, etc. 

Candidate compounds include, for example, 1) peptides such as soluble peptides, 
including Ig-tailed fusion peptides and members of random peptide libraries (see, e.g.. Lam et 
al. Nature 354:82-84 (1991); Houghten et al, Nature 354:84-86 (1991)) and combinatorial 
chemistry-derived molecular libraries made of D- and/or L- configuration amino acids; 2) 
phosphopeptides (e.g., members of random and partially degenerate, directed phosphopeptide 
libraries, see, e.g., Songyang et al„ Cell 72:767-778 (1993)); 3) antibodies (e.g., polyclonal, 
monoclonal^ anti-idiotypic, chimeric, and single chain antibodies as well as Fab, F(ab')2, Fab 
expression library fragments, and epitope-binding fragments of antibodies); and 4) small organic 
and inorganic molecules (e.g., molecules obtained from combinatorial and natural product 
libraries). 

One candidate compound is a non-virulent soluble fragment of the WSBV peptide that 
competes for substrate binding, such as for binding to shrimp cellular receptors. Other candidate 
compounds include non-virulent mutant WSBV peptides or appropriate fragments containing 
mutations that prevent WSBV virulence and thus compete for substrate. Accordingly, a 
fragment that competes for substrate, for example with a higher affinity, or a fragment that binds 
substrate but is inactive or non-virulent, is encompassed by the present invention. 
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The invention further includes other end point assays to identify compounds that 
inhibit WSBV activity. The assays typically involve an assay of events in the shrimp cell signal 
transduction pathway that indicate viral activity. Thus, the phosphorylation of a substrate, 
activation of a protein, a change in the expression of genes that are up- or down-regulated in a 
host cell in response to WSBV infection can be assayed. 

Any of the viral functions mediated by a WSBV protein can be used as an endpoint 
assay. These include all of the biochemical or biological events described herein, and in the 
references cited herein, incorporated by reference for these endpoint assay targets, and other 
functions known to those of ordinary skill in the art or that can be readily identified using the 
information provided in the figures, particularly Figure 3. 

The proteins of the present invention are also useful in competition binding assays in 
methods designed to discover compounds that interact with the viral protein (e.g. binding 
partners and/or ligands). Thus, a compound is exposed to a viral polypeptide under conditions 
that allow the compound to bind or to otherwise interact with the polypeptide. Soluble viral 
polypeptide is also added to the mixture. If the test compound interacts with the soluble viral 
polypeptide, it decreases the amount of complex formed or activity from the viral protein target. 
This type of assay is particularly useful in cases in which compounds are sought that interact 
with specific regions of the viral protein. Thus, the soluble polypeptide that competes with the 
target viral protein region is designed to contain peptide sequences corresponding to the region 
of interest. See Hodgson, Bio/technology^ 1992, Sept 10(9), 973-80, for a review of competition 
binding assays and other receptor screening assays. 

To perfonn cell free drug screening assays, it is sometimes desirable to immobilize 
either the viral protein, or fragment, or its target molecule to facilitate separation of complexes 
from uncomplexed forms of one or both of the proteins, as well as to accommodate automation 
of the assay. 

Techniques for immobilizing viral proteins on matrices can be used in the drug screening 
assays. In one embodiment, a fusion protein can be provided which adds a domain that allows 
the viral protein to be bound to a matrix. For example, glutathione-S-transferase fusion proteins 
can be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Lxjuis, MO) or 
glutathione derivatized microtitre plates, which are then combined with the cell iysates (e.g., ^^S- 
labeted) and a candidate drug compound, and the mixture incubated under conditions conducive 
to complex formation (e.g., at physiological conditions for salt and pH). Following incubation, 
the beads are washed to remove any unbound label, and the matrix immobilized and radiolabel 
determined directly, or in the supernatant afler the complexes are dissociated. Alternatively, the 
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complexes can be dissociated from the matrix, separated by SDS-PAGE, and the level of a viral 
protein target ligand, such as a host cell receptor protein, found in the bead fraction quantitated 
from the gel using standard electrophoretic techniques. For example, either the viral protein or 
its target ligand can be immobilized utilizing conjugation of biotin and streptavidin using 
techniques well known in the art. Alternatively, antibodies reactive with the viral protein but 
which do not interfere with binding of the viral protein to its target ligand can be derivatized to 
the wells of the plate, and the viral protein trapped in the wells by antibody conjugation. 
Preparations of a viral protein target ligand and a candidate compound are incubated in the viral 
protein-presenting wells and the amount of complex trapped in the well can be quantitated. 
Methods for detecting such complexes, in addition to those described above for the GST- 
immobilized complexes, include immunodetection of complexes using antibodies reactive with 
the viral protein target ligand, or which are reactive with viral protein and compete with the 
target ligand, as well as enzyme-linked assays which rely on detecting an enzymatic activity 
associated with the target ligand. 

Agents that modulate one of the viral proteins of the present invention can be identified 
using one or more of the above assays, alone or in combination. It is generally preferable to use 
a cell-based or cell free system first and then confirm activity in a shrimp, or other organism, 
infected with WSBV, 

Modulators of viral protein activity identified according to these drug screening assays 
can be used to treat shrimp infected with WSBV. These methods of treatment include the steps 
of administering a modulator of viral protein activity in a pharmaceutical composition to an 
organism, such as a shrimp, that is infected with WSBV, the modulator being identified as 
described herein. 

In yet another aspect of the invention, the WSBV proteins can be used as "bait 
proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 5,283,3 1 7; 
Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J. Biol Chem, 268:12046-12054; 
Bartel et al (1993) Biotechniques 14:920-924; Iwabuchi et al (1993) Oncogene 8:1693- 
1696; and Brent WO94/10300), to identify other proteins which bind to or interact with the 
viral protein and are involved in viral protein activity, and therefore are targets for inhibiting 
viral protein activity. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay utilizes 
two different DNA constructs. In one construct, the gene that codes for a viral protein is 
fused to a gene encoding the DMA binding domain of a known transcription factor (e.g.. 
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GAL-4). In the other construct, a DNA sequence, from a library of DNA sequences, that 
encodes an unidentified protein ("prey" or "sample") is fused to a gene that codes for the 
activation domain of the known transcription factor. If the "bail" and the "prey" proteins are 
able to interact, in vivo, forming a viral protein-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene (e.g., LacZ) which is operably linked to a 
transcriptional regulatory site responsive to the transcription factor. Expression of the 
reporter gene can be detected and cell colonies containing the functional transcription factor 
can be isolated and used to obtain the cloned gene which encodes the protein which interacts 
with the viral protein. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an agent 
identified as described herein in an appropriate animal model, such as a shrimp infected with 
WSBV. For example, an agent identified as described herein (e.g., a viral protein-modulating 
agent, an antisense viral nucleic acid molecule, a viral protein-specific antibody, or a viral 
protein-binding partner) can be used in a shrimp, or other organism, infected with WSBV to 
determine the efficacy, toxicity, or side effects of treatment with such an agent. 
Alternatively, an agent identified as described herein can be used in an animal or other model 
to determine the mechanism of action of such an agent. Furthermore, this invention pertains 
to uses of novel agents identified by the above-described screening assays for treatments as 
described herein. 

The viral proteins of the present invention are also useful for providing targets for 
diagnosing viral infection. Accordingly, the invention provides methods for detecting the 
presence, or levels of, the viral protein (or encoding nucleic acid) in an infected cell, tissue, or 
organism. The method involves contacting a biological sample with a compound capable of 
interacting with the viral protein such that the interaction can be detected. Such an assay can be 
provided in a single detection format or a multi-detection format such as an antibody chip array. 

One agent for detecting a protein in a sample is an antibody capable of selectively 
binding to a WSBV protein. A biological sample includes tissues, cells and biological fluids 
isolated from a shrimp or other infected organism, as well as tissues, cells and fluids present 
within the infected organism. 

In vitro techniques for detection of viral peptide include enzyme linked immunosorbent 
assays (ELISAs), Western blots, immunoprecipitations and immunofluorescence using a 
detection reagent, such as an antibody or protein binding agent. Alternatively, the peptide can 



wo 01/38351 



PCT/USOO/28888 



be detected in vivo in an infected organism by introducing into the subject a labeled anti-peptide 
antibody or other type of detection agent. For example, the antibody can be labeled with a 
radioactive marker whose presence and location in an infected organism can be detected by 
standard imaging techniques. Particularly useful are methods that detect fragments of a peptide 
in a sample. 

Antibodies 

The invention also provides antibodies that selectively bind to one of the WSBV 
peptides of the present invention, a protein comprising such a peptide, as well as variants and 
fragments thereof As used herein, an antibody selectively binds a target peptide when it binds 
the target peptide and does not significantly bind to unrelated proteins. An antibody is still 
considered to selectively bind a peptide even if it also binds to other proteins that are not 
substantially homologous with the target peptide so long as such proteins share homology with a 
fragment or domain of the peptide target of the antibody. In this case, it would be understood 
that antibody binding to the peptide is still selective despite some degree of cross-reactivity. 

As used herein, an antibody is defined in terms consistent with that recognized within 
the art: they are multi-subunit proteins produced by a mammalian organism in response to an 
antigen challenge. The antibodies of the present invention include polyclonal antibodies and 
monoclonal antibodies, as well as fragments of such antibodies, including, but not limited to. 
Fab or F(ab')2, and Fv fragments. 

Many methods are known for generating and/or identifying antibodies to a given target 
peptide. Several such methods are described by Harlow, Antibodies^ Cold Spring Harbor Press, 
(1989). In general, to generate antibodies, an isolated peptide is used as an immunogen and is 
administered to a mammalian organism, such as a rat, rabbit or mouse. The antibodies 
generated by the organism in response to the immunogen are then isolated. The full-length 
protein, an antigenic peptide fragment or a fusion protein can be used. Particularly important 
fragments are those covering functional domains, such as the domains identified in Figure 3, and 
domains of sequence homology or divergence between WSBV and other viruses, such as those 
that can readily be identified using protein alignment methods and as presented in the figures. 

Monoclonal antibodies can be produced by hybridomas, which are immortalized cell 
lines capable of secreting a specific monoclonal antibody. The immortalized cell lines can be 
created in vitro by fusing two different cell types, usually lymphocytes, one of which is a 
tumor cell. 
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Antibodies are preferably prepared from regions or discrete fragments of the WSBV 
proteins. Antibodies can be prepared from any region of the peptide as described herein. 
However, preferred regions will include those involved in function/activity and/or virus/host 
interaction. Figure 3 can be used to identify particularly important regions while sequence 
alignment can be used to identify conserved and unique sequence fragments. 

An antigenic fragment will typically comprise at least 8 contiguous amino acid residues. 
The antigenic peptide can comprise, however, at least 10, 12, 14, 16 or more amino acid 
residues. Such fragments can be selected on a physical property, such as fragments 
corresponding to regions that are located on the surface of the protein, e.g., hydrophilic regions, 
or can be selected based on sequence uniqueness (see Figure 3). 

Detection of an antibody of the present invention can be facilitated by coupling (i.e., 
physically linking) the antibody to a detectable substance. Examples of detectable substances 
include various enzymes, prosthetic groups, fluorescent materials, luminescent materials, 
bioluminescent materials, and radioactive materials. Examples of suitable enzymes include 
horseradish peroxidase, alkaline phosphatase, p-galactosidase, or acetylcholinesterase; examples 
of suitable prosthetic group complexes include streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbel liferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an example of 
a luminescent material includes luminol; examples of bioluminescent materials include 
luciferase, luciferin, and aequorin, and examples of suitable radioactive material include 

Antibody Uses 

The antibodies can be used to isolate one of the proteins of the present invention by 
standard techniques, such as affinity chromatography or immunoprecipitation. The 
antibodies can facilitate the purification of the natural viral protein from host cells infected 
with WSBV, as well as recombinantly produced protein. In addition, such antibodies are 
useful for detecting the presence of the viral proteins of the present invention in cells or 
tissues in order to determine the pattern of viral infection among various cells or tissues in a 
shrimp or other organism over the course of viral infection. Further, such antibodies can be 
used to detect protein in situ, in vitro, or in a cell lysate or supernatant in order to evaluate the 
abundance and pattern of viral infection. 
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The antibodies can also be used to assess subcellular localization of virus particles in 
host cells. The diagnostic uses can be applied, not only in genetic testing, but also in monitoring 
a treatment modality. Accordingly, where treatment is ultimately aimed at preventing or halting 
expression of the WSBV protein, antibodies directed against the protein or relevant fragments 
can be used to monitor therapeutic efficacy. 

The antibodies are also useful diagnostic tools, such as for use as immunological 
markers for aberrant viral protein analyzed by electrophoretic mobility, isoelectric point, tryptic 
peptide digest, and other physical assays known to those in the art. 

The antibodies are also useful for inhibiting protein function. For example, antibodies 
may bind directly to viral peptides to block binding of the viral peptide to a binding partner 
such as a host cell receptor. Antibodies can thereby serve as antiviral agents. Antibodies can 
be prepared against specific fragments containing sites required for protein fiinction or 
against intact viral protein that is associated with virulence. 

The invention also encompasses kits for using antibodies to detect the presence of a 
WSBV protein in a biological sample, such as a shrimp cell sample. The kit can comprise 
antibodies such as a labeled or labelable antibody and a compound or agent for detecting viral 
protein in a biological sample; means for determining the amount of protein in the sample; 
means for comparing the amount of protein in the sample with a standard; and instructions for 
use. Such a kit can be supplied to detect a single protein or epitope or can be configured to detect 
one of a multitude of epitopes, such as in an antibody detection array. Arrays are described in 
detail above for nucleic acid arrays and similar methods have been developed for antibody 
arrays. 

All publications and patents mentioned in the above specification are herein 
incorporated by reference. Various modifications and variations of the described method and 
system of the invention will be apparent to those skilled in the art without departing from the 
scope and spirit of the invention. Although the invention has been described in connection 
with specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications of 
the above-described modes for carrying out the invention which are obvious to those skilled 
in the field of molecular biology or related fields are intended to be within the scope of the 
following claim. 

Certain aspects of the present invention are described in greater detail in the non- 
limiting examples that follow. 
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Examples: 
Infected prawn 

Dead and moribund P. japonicus with evident white spots on the inside surface of the 
crust were collected from a prawn farm and kept at 4 ^C. 

Isolation of nucleocapsids 

Hepatopancreata gill and intestine were removed from P. japonicus and placed in an 
ice-bathed beaker, homogenized as a 10% suspension in TESP buffer (50 mmol/1, Tris-HCl, 
pH 8,5, 10 PMSF), then centrifuged at 6500 x ^ for 1 0 min at 4 The supernatant was 
recentrifuged at 30,000 x g for 30 min at 4*^C. The pellet was suspended in an approximate 
two volume of TESP buffer containing 1% (v/v) Triton X-1 00, After centrifugation at 5000 x 
^ for 10 min, the supernatant was centrifuged again at 25,000 x g for 20 min. The pellet was 
suspended in TESP buffer and differential centrifugation was repeated, and then the 
precipitate was resuspended in TMP (100 mmol/1, Tris-HCI, pH 7.5, 10 mmol/l MgCb/l, 1 
mmol/1 PMSF) buffer and treated with DNase and RNase. The mixture was incubated at 
37^Cfor 15 min and 30 ml of TESP buffer was added. Differential centrifugation was 
repeated again and the pellet was resuspended In 1 ml of TESP buffer, 1 /A of suspension was 
dropped on a copper grid, negatively stained with 2% (w/v) uranyl acetate, pH 7.6, and 
observed using a JEM-IOOCX II transmission electron microscope. 

Purification of viral DNA 

The nucleocapsid suspension was lysed with 2 ml of GTE buffer (6 mol/1 guanidine 
hydrochloride; 50 mmol/1 Tris-HCl; 10 mmol/I EDTA; pH 7.0), slightly mixed, and then 
centrifuged at 25,000 x for 1 0 min at 4 ^C. The supernatant was collected and 0.02 vol. of 1 
mol/1 MgCb and 0.6 vol. of isopropanol were added. After centrifugation, the pellet was 
picked out with pipette tip, washed twice with 70% ethanol and then dissolved in 1 ml of TE 
buffer containing 0.5% (w/v) SDS and 0.5 mg/ml proteinase K and incubated at 55^C for 2-3 
hrs. The DNA was precipitated again with 0.01 vol. of 1 mol/1 MgCb and 0.25 vol. of 
isopropanol and dissolved in 0.1 x TE buffer. The viral DNA obtained was quantified by a 
spectrophotometer. 
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Construction of a random "shotgun" library: 

WSBV genomic DNA was sheared with sonication. Mung Bean nuclease was used to 
blunt the end. The DNA fragments between 1 .8-2kb were recovered from an agarose gel 
following electrophoresis. The blunt end DNA was cloned into pUClS vectors. The vector 
was subsequently transformed into DH5a cells and plated onto LB plate. 
Construction of a restriction fragment library: 

WSBV genomic DNA was partially digested with SauSAl restriction enzymes. DNA 
fragments between 5-1 Okb were recovered from the agarose gel. pBluescript vectors were 
digested by the restriction enzyme and the ends were dephosphorylated. The fragments were 
cloned into pBluescript vector and transformed into XL-blue competent cells. Subsequently, 
the DNA plasmid was prepared. 

Large scale DNA sequencing 
PCR reactions: 

PGR reactions were carried out in a 25 //I volume containing 0.2mM dNTP, 1.5mM 
MgCb, 5 jjM of each primers, 2.5 unit of Taq polymerase, and a single white colony as 
template. Take out ImI as glycerol stocks. PCR reactions were done in a PE 9700. The 
cycling profile consisted of an initial denaturation at 95**C for 12 min (one cycle) followed by 
30 cycles of denaturation at 95°C for 15s, annealing at SZ^'C for 20s, and extension at 72*'C 
for 2 min. Then 5 ju\ of PCR products were visualized on 1% agarose gels stained with 
ethidium bromide. Excess primers and dNTPs were removed by digesting the PCR products 
with exonuclease 1 and shrimp alkaline phosphatase. 
DNA sequencing: 

Sequencing reactions were carried out in 5p.l volume containing \\x\ of "BigDye" 
premix (PE Applied Biosystems), Z.2\iM sequencing primer and 30-90ng PCR products. 
Sequencing reactions were done in a PE 9700. The cycling profile consisted of 30 cycles of 
denaturation at 95°C for 30s, annealing at SO^'C for 30s, and extension at 6(fC for 4 min. 
Excess Dye terminators were remove with ethanol precipitation, and sequencing was carried 
out on AB! 377 automated sequencer. 
Sequence analysis 

A total of 5795 sequences were assembled in a UNIX system using InnerPeace 
software designed based on the "Phred, Phrap and Consed" program originally developed by 
the University of Washington. Sequences were edited and finished as follows: a) for bad 
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sequence quality, sequencing was repeated; b) for regions with repetitive sequences, which 
may cause misassembly, primers were designed for walking on the original PCR products or 
on plasmid DNA; c) for mapping gaps, clones were sequenced that cover the gap; d) for 
physical gaps, PCR primers were designed between the gaps, then the PCR products were 
sequenced that cover the gap; e) for gaps that can't be covered by PCR methods, walking on 
WSBV genomic DNA was applied. 

WSBV cDNA library construction and WSBV cDNA clone selecti on 
Poiy(A)-mRNA was purified by using the "PolyATtract System 1000" kit (Promega). 
Double stranded cDNA was synthesized and cloned using the "SUPERSCRIPT™ Plasmid 
System for cDNA Synthesis and Plasmid Cloning" kit (GIBCO BRL). cDNA clones were 
transformed into DHIOII cells and then plated. WSBV cDNA clones were selected by DNA 
hybridization using Dig labeled WSBV genomic DNA as a probe (Dig labeling kit, 
Boehringer Mannheim). Finally, the plasmid DNA was prepared for automatic sequencing. 
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Claims 

That which is claimed is: 

1. An isolated peptide comprising an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ID NOS:3, 5, 7... 281, 283, 285; 

(b) a variant of an amino acid sequence shown in SEQ ID NOS:3, 5, 7. . .28 1 , 283, 
285, wherein said variant is encoded by a nucleic acid molecule that hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 4, 

6. . .280, 282, 284, 286-293; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown in SEQ 
ID NOS:3, 5, 7.. .281, 283, 285, wherein said ortholog is encoded by a nucleic acid molecule 
that hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule 
shown in SEQ ID NOS:2, 4, 6... 280, 282, 284, 286-293; and 

(d) a fragment of an amino acid sequence shown in SEQ ID NOS:3, 5,7.,.281,283, 
285, wherein said fragment comprises at least 10 contiguous amino acids. 

2. An isolated peptide consisting of an amino acid sequence selected from the group 
consisting of: 

(a) an amino acid sequence shown in SEQ ID NOS:3, 5, 7... 281, 283, 285. 

(b) a variant of an amino acid sequence shown in SEQ ID NOS:3, 5, 7... 281, 283, 
285, wherein said variant is encoded by a nucleic acid molecule that hybridizes under stringent 
conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 4, 

6... 280, 282, 284,286-293; 

(c) an amino acid sequence of an ortholog of an amino acid sequence shown in SEQ 
ID NOS:3, 5, 7. ..281, 283, 285, wherein said ortholog is encoded by a nucleic acid molecule 
that hybridizes under stringent conditions to the opposite strand of a nucleic acid molecule 
shown in SEQ ID NOS:2, 4, 6. -.280, 282, 284, 286-293; and 

(d) a fragment of an amino acid sequence shown in SEQ ID NOS:3, 5, 7. ..281, 283, 
285, wherein said fragment comprises at least 10 contiguous amino acids. 

3. An isolated antibody that selectively binds to a peptide of claim 1 , 
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4. An isolated nucleic acid molecule comprising a nucleotide sequence selected from the 
group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in SEQ ID 
NOS:3, 5, 7... 281, 283, 285; 

(b) a nucleotide sequence that encodes a variant of an amino acid sequence shown in 
SEQ ID NOS:3, 5, 7... 281, 283, 285, wherein said nucleotide sequence hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ rDNOS:2, 
4, 6... 280, 282, 284, 286-293; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7. ..281, 283, 285, wherein said nucleotide sequence hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 

4, 6... 280, 282, 284, 286-293; 

(d) a nucleotide sequence that encodes a fragment of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7... 281, 283, 285, wherein said fragment comprises at least 10 contiguous 
amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence of (a)-(d). 

5. An isolated nucleic acid molecule consisting of a nucleotide sequence selected from the 
group consisting of: 

(a) a nucleotide sequence that encodes an amino acid sequence shown in SEQ ID 
NOS:3, 5, 7.. .281, 283, 285; 

(b) a variant of an amino acid sequence shown in SEQ ID NOS:3, 5, 7.. .281, 283, 
285, wherein said nucleotide sequence hybridizes under stringent conditions to the opposite 
strand of a nucleic acid molecule shown in SEQ ID NOS:2, 4, 6.. .280, 282, 284, 286-293; 

(c) a nucleotide sequence that encodes an ortholog of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7. ..281, 283, 285, wherein said nucleotide sequence hybridizes under 
stringent conditions to the opposite strand of a nucleic acid molecule shown in SEQ ID NOS:2, 
4, 6... 280, 282, 284, 286-293; 

(d) a nucleotide sequence that encodes a fragment of an amino acid sequence shown 
in SEQ ID NOS:3, 5, 7.. .281, 283, 285, wherein said fragment comprises at least 10 contiguous 
amino acids; and 

(e) a nucleotide sequence that is the complement of a nucleotide sequence of (a)-(d). 



6. 



A gene chip comprising a nucleic acid molecule of claims 4 or 5. 
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7. 



A transgenic non-human organism comprising a nucleic acid molecule of claims 4 or 5. 



8. 



A nucleic acid vector comprising a nucleic acid molecule of claims 4 or 5. 



9. 



A host cell containing the vector of claim 8. 



1 0. A method for producing any of the peptides of claim 1 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

IK A method for producing any of the peptides of claim 2 comprising introducing a 
nucleotide sequence encoding any of the amino acid sequences in (a)-(d) into a host cell, and 
culturing the host cell under conditions in which the peptides are expressed from the nucleotide 
sequence. 

12. A method for detecting the presence of any of the peptides of claims 1 or 2 in a sample, 
said method comprising contacting said sample with a detection agent that specifically allows 
detection of the presence of the peptide in the sample and then detecting the presence of the 
peptide. 

1 3. A method for detecting the presence of a nucleic acid molecule of claims 4 or 5 in a 
sample, said method comprising contacting the sample with an oligonucleotide that hybridizes to 
said nucleic acid molecule under stringent conditions and determining whether the 
oligonucleotide binds to said nucleic acid molecule in the sample. 

14. A method for identifying a modulator of a peptide of claims 1 or 2, said method 
comprising contacting said peptide with an agent and determining if said agent has modulated 
the function or activity of said peptide. 

1 5. The method of claim 14, wherein said agent is administered to a host cell comprising an 
expression vector that expresses said peptide. 
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16. A method for identifying an agent that binds to any of the peptides of claims 1 or 2, said 
method comprising contacting the peptide with an agent and assaying the contacted mixture to 
determine whether a complex is formed with the agent bound to the peptide. 

17. A pharmaceutical composition comprising an agent identified by the method of claim 1 6 
and a pharmaceutical ly acceptable carrier therefor. 

18. A method for treating WSBV infection, said method comprising administering to an 
organism a pharmaceutically effective amount of an agent identified by the method of claim 16. 

19. A method for identifying a modulator of the expression of a peptide of claims 1 or 2, 
said method comprising contacting a cell expressing said peptide with an agent, and determining 
if said agent has modulated the expression of said peptide. 

20. An isolated WSBV peptide having an amino acid sequence that shares at least 70% 
homology with an amino acid sequence shown in SEQ ID NOS:3, 5, 7... 281, 283, 285. 

21 . A peptide according to claim 20 that shares at least 90 percent homology with an amino 
acid sequence shown in SEQ IDNOS:3, 5, 7... 281, 283, 285. 

22. An isolated nucleic acid molecule encoding a WSBV peptide, said nucleic acid 
molecule sharing at least 80 percent homology with a nucleic acid molecule shown in SEQ 
ID NOS:2, 4, 6... 280, 282, 284, 286-293. 

23. A nucleic acid molecule according to claim 22 that shares at least 90 percent 
homology with a nucleic acid molecule shown in SEQ ID NOS:2, 4, 6.., 280, 282, 284, 286- 
293. 

24. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
1 or more genes from WSBV, wherein said genes are selected from the group consisting of 
SEQ ID NOS:2, 4, 6... 282, 284, 286, 288-292 and 293. 

25. The detection reagent of claim 24, wherein said reagent is a nucleic acid array. 
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26. The array of claim 25, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

27. The array of claim 25, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS: 2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 

28. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
10 or more genes from WSBV, wherein said genes are selected from the group consisting of 
SEQ ID NOS:2, 4, 6.,. 282, 284, 286, 288-292 and 293. 

29. The detection reagent of claim 28, wherein said reagent is a nucleic acid array. 

30. The array of claim 29, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

31. The array of claim 29, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS: 2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 

32. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
100 or more genes from WSBV, wherein said genes are selected from the group consisting of 
SEQ ID NOS: 2, 4, 6... 282, 284, 286, 288-292 and 293. 

33. The detection reagent of claim 32, wherein said reagent is a nucleic acid array. 

34. The array of claim 33, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

35. The array of claim 33, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ IDNOS:2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 
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36. An isolated nucleic acid detection reagent that is capable of detecting the presence of 
all genes from WSBV, wherein said genes are selected from the group consisting of SEQ ID 
NOS:2, 4, 6. ..282, 284, 286, 288-292 and 293. 

37. The detection reagent of claim 36, wherein said reagent is a nucleic acid array. 

38. The array of claim 37, wherein said array is comprised of short oligonucleotides from 
about 5 to about 100 nucleotides in length. 

39. The array of claim 38, wherein said array is comprised of polynucleotides based on 
the transcript sequences (SEQ ID NOS:2, 4, 6... 280, 282, 284, 286-293), wherein said 
polynucleotides are from about 100 to about 1000 nucleotides in length. 
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GGATCCAGACATCACCCCAACAATCGCAGACCCGATTACAGTAGCAGAAGrTGGAAACCCTGTCGTGAGGATAAAAACTCCTTCATGTGRTGAGTTGAAT 
AAATTCATGAGATTCTCTACTAATTTCTCATTTTTAAAATACGTCAAACGGTTTATGGCGCAGAGAGCTTCATCTAATAGTAGGAATGATATGTCCGCTT 
TCTTATTAGATTTGATTTTTGAAAAATTGATAGAGTTTTTATTATCAACAGGATGGAATAGAGAAGCTCTGGGAGGAACTACTGGCGCTATTCTTATGAT 
CAGGACGGCAACTTGTCGTCTAGCGCAATCTTATTACCCCCTCCCTGAAACCGTTGGTATGGACTTTTTGCAAACACCTGGGTTTGACTACAATAAATCT 
TTCCCTAACAATGAAAGGTATATATATGAATTTCAAGCTACAATGTAAATAATTGGTTAATAAAATAAAGGTATATTTTTAAAAAATGTGTrXATTTTTC 
CCCAACCTTAAACAGATCATTGCCAGGAGAAAATCGCATACTTAACAGATAATGCCTATTTGTATCATCGATGTCTTCGTCAAACGTTGCTCCAAACACA 
AGTGTGTTGATCCTATTTTTCTTCAAAACTGTTGGTTTGAAATATAATACAAATTTCTTAGCTGAAAATTCCCTTCCTGTAGCATTTAGACCTGGAGACA 
GTGAAACAGAGTCCACTGTTATATCCCTCTTTGTATTATTGTATACTCTTGCATACAATTCTAATTCTTTAGATTCGGCTTTGTTTACATTrAGTGTGTG 
GTCTCCGTCTCCAAGTAAAGATGTGAGGATTATGTCTCCTTTCTTTTTTACCAGTTCAGAATCGGACGTTCCTTGGCCAGCTACATATACCrTTCCATAG 
TTCCTGGTTTGTAATGTGCCGTTTACAAAGTTAAAGTGGTTTCCTCTAGCAACACCGTTAATGGTCAAGTTTATTATTTCAGATTCAACAGGGTAGGCGT 
CTTTATCCTTCTTGTCCAATTTCTTGTTAAGTTCTATGTTGGTTACAACTATAGATATAACCACGAGTATCAATGTCAAACCCGCCAGTATAGCGGCGTA 
AACC CCCCACAT GTGC ATTT TG AAG AAAG T TGT AC AAAGGTCAT GAGAGAAAAAAAC AGAT ATT AAAGTTTGTTATATTT TATTT AGTCGTAGAAATC AC 
CACTGAGATATATTGTGAGCAGCAACTGCCGTTTCAGCATACTCGAAAGAAGAATAGTTGCGATCCTTTGAAGCCGGCATCTTGAAATGATGTGCAGTAT 
TATTTCTGCTAGTTTTGTTGAAGTACTGGATAGCCTTTACTGAGTTGAAAGTTTCACGGGAACAGCTTGAfiCAGTTGATGGAGGAAGAAGTTCGCCTGCA 
CTGGCAATTCACAAAGGAGTGCTGATTATTGTTGTTATGATGGTACATTTTGAAGGTGAGAGCACGTCTTTCTTCAACCAAAGGTGTTTGTCTAATGTGG 
CTCAGGTACATCCACGCTCACCTTTTATACCAACGTCTTACAGTTCCTTCAGATCTTCCGTCATGGCTTTAATCTTGTTGGTTATCCCTCGATCTACGAA 
GAATCTAATATCTCCCTCAGATCTGTTCAAAACTTTCCACGACTCCATTGGGTCCAGAGTATAATCTGACCTTTCTATGTTTTTGACAACTACAGGATCT 
GGTTGAATATATGCAGATGATGCATCAAAATCTACTTCTTCGCCGCTCATGCCTGTTTCTAGTCTTGAAAGGAGTGTGTCCAAATCTTCCTCTTCTTCGT 
CTTCTTCCATTTCTTCTCCACTATCTAAAAATTCGTCTATATCATTATCATCAAAGTCTATGTCTATATCATCATCATCATCTTCCCCAAGAGATGAAGA 
AGGAGATGGGGTTCGTGTCGGGGGAGGTGGAGGAGTAGGAGTAGGAGTAGGTGGAGGAGTAGGAGTAGGAGTAGGAGGGGGTGGAGGTAACAAACCTTCT 
CCAGGCACAGTTCCTGTTCCTCCACCTCCTGTATTGGACCCAGTATTTACACTCGGTTGTTCAAATTCAAAAACTTCCTCCTCTTCTTTTCCGTCTTCTA 
CAGCACCTTCCTCGTCTCTTCCGATCGGCCTATTTTCAGGTTCCCTTCGCCCTAATACAGCTCCTTCGGTGTCATCTTCTTTTTCCTTCTCTTTATCTTT 

AAT AAAAGGAAC AT C ATTAT TATCC TT AT T TTTT TAG ATT C AAT T ATGGC AC CAGAAAAAT AAAATGATT TGTAC AAACACGAGTT ACCCCTCC AC ACG C 
CTGACCTCTGCCGCGAATTCTTCATACCCTTCCATATTCTTTGCTGCAAAGGTCACGTCATGTTGATGAGCTACCATTTCATATCGCATATCTCGCCAAA 
AAGATCCATCCGCAACAGCGGCCGTGCATTCTTCTTCAATGGCAGAAGTCATGTCAGCTGTCATTTT'rTTAATAAGATCACGTAGTATTTCCTCCCTATA 
CAAATCTATAACCTTCTGCCTAAAGACAGGCCTCTCTGCTAAAAGAGAAGGATTTTCTGTAAGCTGGTCCATGGTGTAATAAAGTGTTCATTTATCTAAC 
TTTTAAAACCCAAACTCTTCTAAAATATCTACAGATTCCAGGTACTCTGCCGCCATTGTTGTAATCATTGTATTTGCCTTGTCTATCACTGCTTGTCTTT 
TATCITCCACATCTCCCACCTCGTCTTCGTGTTTTTTCCTTTCTTCTTCTAGAAGCCGTCTAAAGACAGCTTCTCTAATTGAAGGTATTAGCCCCTCAAC 
TACCTGAGAAGCTTCCTGTTTTACATCATTCACAAATTCGGGCGATTGTAGTACGAGCGTGATCGCGCTCTTGTAGATGTCTTCCAAAATGTCCATTTTT 
GTTGTTTAGACCTAGATAGAGAGTGGGAGACGCTATTAATTTTTTTGTTCATGTGGCTCATGACCTCATGTCCGATATATGTATTGGTTGGGTGTAATAC 
AAGTCTCTACAAAGCAATAGAAGGGGTAAAAACAAAACACACCATGGTGCTAACTCTTTCCTGTACGACTAG(3CGCGTAGCGTCTAGCAAGGGGAATTTC 
TCTAAGGAAGATGCGGTGTTGGGGAACCAGTTCCCCATTTTAAAGAAATCAAACAACTTGTCAATTGCAAGACCTCCCTCAATAGAATCTTTTTCTGCAT 
CAGTGGAAAAAATATTCAGGGAATGGTUiCGAAAGTGGGGGAGAAfiAAATTTTCGACATATCTCAGAATGAAGAAGAATGGATGGATATCATATCCTTAGT 
GGAAAGTGTAT ATGAACCTGT ATT TTCTAAATC AC TT AAACC TGATAAATTGGC AGATAAAAC ATGT CT AACCGCGGCTGCCTT T GCAGCACTAGC T TCT 
GCCGTGGATGAAAAATTGACAATCTTATCAGGTAGTGATGGGAGTGTGCTTCAACGTACAACAAAGGTTATGAAAAAGGACCCCAAAAAAATAGCAGAAT 
CT CTTTT AAATAATG AAAAATGGAC ATCTATTTT ATT GGAC AGGT TAAAAAC AGCC AAG AAACT TCTAAGC AGAC GAGGTGCACT GAAAAGCGCC G AAAG 
AGTAGAAGTACTTCATCGGTTGAATAAACTCAAGGAGGCTCCTCTTCCCCACCATCCGAGCCTATTTGATAATTTTAGTGGAGGAAAAACATCAGCAGTA 
TCTGCTGGAACAGTCATCGCATCAGATATGCATTTCAAATTGGTTGAACATATTTTTAAGGTCTCCTTTAGAAAATGGGGTCCCTGTGGAGATAAAACTG 
AAAGCGGGGAAGAAGAAGATGAGGAAGAAGAAGAAGAAGAAAAGAAACATTCCATATCAAGATTCGTGCTTCAATTTATGAACGGACACAACGGGCAACA 
TTATCATAGGCCCGAAAGTGCTTCTGTTO'ACTTTTGTGATTATTATGACTATTTGGCCTACAGGAATCTCCCTAATGAGTACAAATTATCGTCAATGCAT 
CCTGGCACATTCAATATGGAGGATTTACCTTTCCGCCCTTTCGCAGTACCTTCAACTTATAAGACAGAATTAGAGTACAAAAGGTTTGTGCAATCAACAA 
ATCTTCCCCAGCTAAGTTTCGACTATGGGGAGTTTTTATGTTACTGTATCTTCGGAGCAGATTGGTACAAACACCTGGGGGATGTGGTAGATTCTCTAGA 
AAAT AGTTCCATGAT ATCATTT GAT TC TC AGACAT TG AGTGGTGTGTATAAG AAC AC TGC T AAT TAC AAAAGGTTGGGGAAGAAAAGAAACGGAAT AGCC 
G ATT T GGCCGTT AGG AGTATGGC AGAATTT ATCCG C ACTG AAGC GC AT AAGGC AT TGAC AGC AGAAG AG ATGGAAGAAGAAGAAGAGG AAGAAGAAG CGG 
AAGAAGAAGCGATGGACCAGGAGCCTGCAGAAGTAGftCTTCCTTTCAGTGCCTCATTTACGCCGTAAAATTCGTCAAGCTGTTTCTGTGTTAAATAACTT 
TGTGGAGAACGATCTTTCTATATTGGTTTCTAACTTCAAGAATGTGTTAACCGATGATACTCTATCAGGAACAGATACGGACAACTTTGGTTCTAGTGGA 
GAATITGAAGCATTATCTTCCCATTTATTCCTTTCAAGAATATTGGATGAAGTGCACATTCTTAGGAATACTGATATACAAAGAACCCTATTTTCAACGC 
ACGTGTCTCTGTCGGATAAATCTCCCCCTAGCCGTGTCCGTGGAAGCAATGTCAACTTTAArAATAACGCTGGGAACATTTCTTCCCTGCAAACGTATGG 
CGGTATAGAAGAGTTGCCTGAAAATGTACTAGTCGGTTTGTCCGGAGGATTTGAAGATACCGACATGTATTCCGGAGAGGATGTTGTTGTCGTATGGGAT 
GGTTGTGATGGAGGAAAAGTGCTAAGTGTCACCTTCAATTGTGGTGATAATTTTATCCAGCrCCATGAAAAAACAGCAGAAACTTTTAAGGATGATACGG 
ATTTAGTTGAACGAATAAGAGATGTGCTTCAGACTGCAAGTAAGACCGGAAACCTTAACAAAAAAGCATATTCAAGGAAGAACATCTACGCTGTTTTGCG 
T G AAAATGGCATTGAGCGCC CC GGGGACGAT TTTAC AGAA AA GG GC ATTGCTCT CAAGG AT AAAAC AAATC AACCCCCTCCCC C TGC AAGAAGTGC C AAG 
ATAACGGTTGAAGGAGTCAAGGGATTTTTCAGCGGTTTTCGTGACATTTTGGAGACGAGGGCGCTCACCACATATAGTGCAGAAACCTTCAGAGATTTAG 
GCCAAGGCATAGTAAAAGAGACCGAAGGACTGACAGCTGCAACAGTGGCAGAAACATCCTTCTCTGAAGGTTTAGCTGAAAGTTTAAGGTCTGATGCGAA 
TCTAGGTCTAGAATTTTCAGAGGACGCCAAAACGGTTGTATTCAAAAATGACACCTCTCGTTCTTTATTGGAAGAAACTAGGGCATTAAGAGCAAACAAT 
ACTTCTTTTTCGTCGTTTGCAAGGGACATGGGCGTCCAAGTTAGTGCCGATTTAGATGCTGAATTTGCTGCRGAGATGAGAGAAACATACCCCGATGCAG 
CCCTTGAACAAAACTTGAAAGATCTCGACAAATTCGAAGAGACTATACCAGAAAGTCAAGTGAAGAAACTAAAGAAAATAGACAGTTATTTGACAGAGAA 
TCCAGAAAGGGCTGGCAAAGAAATTAACGACACTGAACTGTCAAAGGCTACAGATTCAGTATTGGGGAAGAAACTAGGCAATGCAGTTACAGTGTTGATG 
AACAACTTTGGAAAGGTTACAATTGTAGTAGGGGCTTCTGTGGTGGCCGGGTTTTTAGGTCCAGCCGCTGTCGCCCTGGTGCATGCGTCCAGAGGGGCAC 
ATCTCAACGTCGTGGACCACACCAGCCCTAAAGGTGTCATCAGTTATAAAATTGTGGACTTrrCTTGTGCAGATAGAAACACCGGATGGGCTAAGCCAAC 
CAAGCACCCGTTCAGGGAAGAAATAGACCATGTTATCGCACTAGATGCATCATTCTTAACTGAAAATGGAGCATATGTATTCCCTGAAGACGGAGGACCC 
AAATCGAAATATAAGGCCTACGCACCAATCTGTGGAACAAAAGATGCTGCTCAAGGAGAATGTGGATCTTGGGCAACATTCGACGACCCGCATTCTGTAT 
TGCCTTGGGTGGCAAGCATGAAAGATTTGCCTAAAGGACAATCCCTCTCCTGCGATAAAGGGATGTCCACTTTAAAGGCAGTTTCTTCCGTTCTTTTGTC 
CATAGGAAAGGATGTTGCAGAGGCTATTTTTGAGGTTGCAGAGGACGCCGTGGTGGGGTTGGCGAGCAAGGCAATTTCAGCTGTAATAAATAACCCCTTG 
TTCATATTTGGAGTGCCTCrrGGATTTGGTATAGCTGCTACACGCCTCAATCCATCCAACTGGAAAACTGGCCTCATTGTATTrTCAATACTACTAGTGG 
TCATACTGATAGTTCGCTTCTTTGCAGGGTCGGGCCCGCTAACCCTGAATTGGTTTGGTGCAAAGAATTCAGCTAAAAGGAAACAGACTGAACAATTCGA 
AGACGGGGGAGGAAATCGTTCAAAAATAGTATTGGCAGAAAAGGACAACGCCAATAGTAAACTTCAATCGA6GAGGAATGAAACTGGGCCCATGAGATTA 
GAGGAGCTTCCTGGGCATGAAGATTTGCGCCCAGTTTTCTTCCCTGCCACAACAAATTATTCCAAATCTGCCAAGATTCTGGGCTACAAATCTAAACCCT 
TCAACGACTTTTATACAAAAATAATAAACACAGACATCATAAAAATGGATAGGTAAAATAACACATTAAATGTATATTTTATATATTTTTATTTTAGTAA 
CAATAAAGATTATACATGTATAAAAATTGCATTGTTTTATTCCTATACACCAACAATAGATATAGACTTATCACTAAGATTATAAGTTTCTGATGACGTT 
TTTCCATCACCCATTCTGATTTTATTCCTATTCTTCTCCAAAACAGATTGCATTCCGTGTCCATCAACATCCGTCTTCTTCTTGAAGATGTGGGTTATAT 
ATGGCTCCGACATATCTTGTACTGTCGATTGGATAGTTTCAGCTAGGGAATGTTTTTCTGGrCCAAATTCTCCTCTCCCATAACGGATGCGACGATAAGC 
TTCAGCAGAAGCCAGCAGGCAAAAGTGGGCCCAGCCAGCCTCGTCAGTACACATGGGGTGCrrCAGAGTTTTTGTTCTATTGTGCACATCAACATATGCC 
ATAAAATCGGAATGGGTCAAGTGCATGTCCTTTAATTTACCCGTTTTTGACAAGTAAGTGACTGGCAACTGTTTGGCAGTAAATTGTTCATTGATATGAG 
AAGTGGAGTATTTTTCTTGTTTAGAAGCATGTCGTATATAGAGAGCGACAGCTAAGGCAGTTATTGCTAGAATAAAGAAAACACCAACAACCATAGGCCA 
ATTCATTCGTTTAGTAGTTATTACAATTTTGTTCCACATGGTCATAATTTCTCAGAAAAGTTCCTCAACAGGGGAATTATAGTACGTGATAGAATTGCAG 
TGATAGGCACAGGCGGATATTCCGACGGTCCTCTTCCGAGAATATTTTGTCTAGATTGAAGCATTGCAGCAATCTCCTGTTTAGCATGGACAACCTGTAT 
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ATCATCACGAGGCAAAAGATTGTCCTTTGATCCTTTTGATAAAGCTTCGATGTTTATAAAGGAACCTAAAAGTTTCCACATTATAGCAATGTGATAAAAG 
CTGTCCAACGAAGCATATACAGCAGCTGGGAAAGTTGTCCATATTAATTGCCCACTTTGTAACGCAGAGGCTAAATTTTCAACCCCGATCTTGTTGCTTT 
GGCAGATGTTCGCCATCCAATCTAGTTTTGTTATCGTATCTAGAACTGGGGTATTCTTAGCCAGTTTGACTGCCTCCCCTTTCGGTTCTTTGGGGAAGTA 
AACTGGCTCCTTTTCAGTAGAGTTGGTTTTTATGTTTTTCTCTGATGCAGATTCTCCAAAATATAACAGTTGGCGAATAATGGAGTCAACTTCAATAGTA 
TCACTCTTCACTAATTCTATGTCCCTTTCGATTAGTTGGCACACGTCTTTATCTAGCCTCATCCTGTTAGAGTTTCCTGTTTCGGCTGCTGGTGCTGTCG 
CAGACGCAGTTGCTATTAGTCCATCAGATGCGTCCAGTATGGAATTGGGAGGAGGAGCTGCAACAGATAACATGGCAGTAATATCATGGATGATTTTGGA 
GTAGTCCAGTTCTAATACATCAAAATCCTGTTCTTCCCCTTCCTCCATTTTGTTGGTGTCCATGTTTGAGGAGGGATGGTTGTTGATACTACCCGAGCTC 
ATGCCGTACACCGAATGACGAGAGGAGGTAGGAGCAGGCCCTTTTTATAGCCCTAAAGGGCCATTATCGACAGTACTATTATAGAAGTTATTTTTATCCC 
TGATATTATACAAGCAGAGAAAAATTTCAGTCTACTTGCAGGAGACATCTTTTTAATAAATTTAGGGAGTCTTCTCCAAGTTTTCTTCTTATTGGTTTTC 
TGTTTCTTGTCTGTTGGTTTTTTGAGTCTTTTGTTGTTTAGCTTATGTATTTGGGGTTGCCGATTACGGTGCCTCTTCTTCCTTTTATTCCTGTCAGCAA 
GTAAAATATCCTTTTCGTACTTGATAAATCTATCAACGTACTTTTTTATTAGCCTGTTTCTGGTCGTGGAATTCCGTGAAGAAGATCGTAGAAAGTTTGC 
ATCAGCAGATGTGGATTCTGTATCTGAATCTTCTTCCTCCTCCTCTCCAAGAAGAGAATCGACGTTAGTGTCTGArrCAGAGTCATATTCTTCTTCTTCT 
TCTTCTTCCTCATCGCTTGAAGGGATGGCCGAAGACGTGGGAGAAGAAGAGGAGGATGAGGAAGAAGATGTAGAAGTAGAAGCCATTTCTTTGTTTGTTG 
TTGTTACCTGCCCATGTTCCTCCTCGACATCATCTTCTTCTTCAATCTTATTCAGTGCACGGTTCAATTGCTGCTGATAATGCCTTGTGTTTGTTACAGT 
GCTAGACCATAAAGCCCTCCTGGAAGCAGCCGATGCTCCAGGGTCCATTTCTGATCGCGTACATCTAACCTTAGCAATAACCAGAAATACGTGCTTTTTT 
TTTACTTTCAGCCAGCCGTGTTAGTCATCTACGGAATGGTGTCTTTGTCTTTAGATGAGGAGTCGTACAGGCAAAATTTGGGTAAACCTTCAAACTCGCC 
TGTGCAGTTATCAAAGCATATTTGTATCAAAGAACCTTCCGCACCTATTAAATTTCCAACCATAAAAGAGTCTATGTGTTCATTAACAAAAACACTATTT 
TCTTTAGTGAGTGTTAGTTTCTTTCTATTGCACAGTATTATGTCCACATCGTGCCCGATTGTGTTATTTTTATCATTGATGACAGAAATGTCGTAAGCAC 
TTTCTCCTTCTTCTTCCCCCCACTCCCTTTCTGCTTGTTCCTCCTCCTCGTCTTCAAATTCTTCGTCGTCGTCTTCTTCTTCTTCCTCCCCGCCAAACCT 
CTCCCTCGTTTCTTCCTCTTCGTCATTTATTATATGTCTCCTAAAATTCTTGTCCAATAATTTTGTAATGAAAACACACTCTGGCTTCTTCCTGACCTTT 
AAGAAACTTGCATCCATCATGCTGTTCATTCTCATATCACAAAGGCCATCATACCTATCATCTCTGGCCATGATCGCTTTGACGCTCAAGAATATATCGT 
ACATAATCTTAGACTCTTTTTTAGGTGAGAAAAGTTGGTTTTGGAACTGTCTAAAATTAATGAAACCACGGGACAATTTCTTAGCCAAACCCAGAAGATG 
AGTVACCTCCGTACCCTTCTCTGTGGGAATGTGCTAGTGCAGCATTATGTATAGTAAAAAGCAAACCAGGATTTrCTTTAGCCAAACTCTGTATAACATTA 
GTTGCCGTCACTACAGGTATGAGATTTGTCATATCTAATTCAGGTACATCTAAACATTCACTGATAGGAATCACTCCCTCTTTCTCTCCTTGACCAACAA 
TTCTCACCGTCACTAGTTGATGTGCATCTTTTATGGCTCTATCATAATCATGGGGGAAGTTTGATAGAGCTAACGTTGTAGCATTTTCTTCTTGCAAAGG 
TAAACACCCTTCGTCTTCATCATTGTCGGGAACATCTTGATCAAAGTCCATGCCGTTATTGTTATTATATGCCATTATATTAGAGAAAACATCCATCTTT 
CTATTCCTAGGACCAGAAGGGAGCATTAGAGCAGTTAATGAAGGTGCATCACTTTCTATTTTATTTGTCTTGGAAGTAACTTCGTTATTTTTCGTATTCA 
AAGTGTCTATCAGGGCAGATGCTGCATCACGCACTCGCTTCTCGCGTGGGAGAAGATTGTCACAATCAGATTTAAAGTGGACTAAATGCAGACACAAATT 
AGAAAAAACGAATTGAGATGCGGATTTAACAATGGACAATATTTGATTATGAGGAAGTACAGACGTGAAAGAAAGAGCATGCCCAAGATCGTCTGATGAT 
GTGATTTTATTTTGGGCACCTATAACTACTAGCGCTATGTGCATCGCGAGGACAATTACTTCAGTGAAGATACCATTTTCTAACGTCAAACCTTCTACGA 
ATGCAACCCCTCCTCTATTTGTCCTCATCATCATCTTTGAAAAATGTTCCATCAAGTGTACCTTTGTCATACTGGCTGTGACAATCTGTACGATATTCCC 
CATAGTCTTTAGGACTAACCCTTGATCGATAAATTGTTCCCTCATAACTGCGGCAGCAATATCTATGGAATTATTACTGTATCTATCGGCATATGTTCCT 
GCCTTTGAATCTTTGATCCTCTTATCCAAACTATCAAAAAATCTTCTCTCTGCGGGCGAGTCTGTAGAGAATATGAGGAACTGAGGGACTGAAAGTTGAC 
CGTTTCTGTGAGTCCTACCGATAGTCTGTAGGAAAGCAATGGCGTTATAAGGTACATCTAAAACGCAATGGTAACGTTTAGCGTACATGGAGTTATTAGA 
AGAGTCATGTAAAGAGAGACCGGTATTGCCTTTAGGTCCCAGCATTATAACGTCGACTTCTTTCGTGTTGTTGAATGCACTAATACATTTGTTTGTGTTG 
GCAGTTTTAGTGTTGTTTTTAACCAGAAACAATCCCCTATTCGTTATACGCGAACACAATTTCCTGTTTGTTATTTCAGCGTTACTATCCTCCCCTATTG 
ATTGTGTGATTGAATCTATAGGGTTGCCTGCTAGCGCCATGAGCGGTTCCGTGTCAGGGACTGAAGGCACAAGCATAACGTGAGCTGGGGAAGTTCTTCT 
ATTGGTTTTATTCTTTAAGGAGATTGTGAAAGCTGAAGCCATCATGACTGCACGGCACAAAAGTCTATAGCCTGAGAATATTGTATTAGCAACAGGCGAT 
GAATCGAATGTCCCCACGTCTACGATGCCGTATTTTGCGTCACTTTCATCCTTGGCGATCGTTTGTAAGATGCGTGTAGTCAAATTCTTGAGAAAAGAAT 
CCCCAGTTTGCTCTAGTGACATGACCATCTTTTTTGATTCGTTTGTTGTCCTTAAACGCTTCACTGCATCGGTGACAGAAGTTGCCTTGATATTCAATAG 
TGCACCTTTACATGCTGCCACAGAAGTACTTGCTGTATTAATGAAGTATTGTCTCCTCAACTTTTCAAAACACTCGTCGTCAAATACTTCACCCCATCCG 
TCTTCTTCATTAGCACCGTCGTCATAATCTTCGTTAAAATCAGTTTTACCTTTGGAAGGTTTTTCCTTTTTCTTGATAGCATCGTATAAATCTTGAGCAA 
CCATATCAATGCTGCTTATGTCAACTTCCGGAGACGTTATTATTCCTCCTCCTCCATTAGAAAGCGCATCCTTTTTCTGTGAGTAGGTTTTTATAGCATC 
TTGTACTATACTTCTTAGTGTTGTTTTCCCATCATGTGCGACATCCCTATCGCGCACAACCACCACCTTAAAACGCCTGTCGATATTTTCCATTACTTCA 
TATAAATCATCATCATTTGCAGCACGTTTCCTCTTTCTGGACGGGGAATTAATCTCTGCTACTAATTTTTCAACATCTTCCTCTTCCAATATTCCTCCTT 
CTTGGTGTTCTCGTATAATCTTTGTGAAGGCCCTTCTGACATGTCCTCCTACTTTTTCAGAATCGATGAGTACTTGTCTAGCGTTTAAACAGTACGAAGC 
CAATTCGTCTATAGCGTATTTTTGAAGAGGGGRGGCTTTACAATTAGTAGTAGAGCAGTCGACTCCAGCCATGCTAATACTCCTACTAACCAATTGTCCT 
CTATTTCTTAAAAGTTTAATAACCTGTTCCAGAAACACTGTCACGTACTCGCTGTGCTCAGCCATAGCATCTGGAGTAGCACTGGACACTTCTTTGAAGG 
CTTGTATGCTCGTGTAGGCTGGCGCACTTTTCCTTAGTATGTGGTCTIATCATGTGCAAATCAGCATTGCTCTGGAATGGCGTAGCGCTGGCCATAACCGT 
GAACGTATCATACTTTAAGATCTCTAAG7VAAATACGGAACGCATCAGCTTGAGATAATAGTTTCAAGTAGTTGGACATTGTAACTCGGCCCTTATTATTG 
GATGATTTCATGCGCTTCACTTTAAAGTTTCTGTCGTCACTCATTGCACTCTTGAACCTATTAATAAGGTCTGACACTGAAGGGTTTGCCCTTTTTTGTA 
TGTTGCGCAAAACTCTAGTATCTTCTTCCTCCCAAGTCTTTGCCAGTTCCTTTCTAAAAGAGTCACTAATATTTTGAGTCTTGTGGAACTCGTCACATAA 
TAATGCTGTGACAAAGTTGTCAATGGGCATAACCTTGTTGGAATCGAGGTCTGTTCCTCCTGTTATAAATTTCAGTACAAATTCCAGATTTGTGCGTAAA 
TCAGAGTACGTCATAAATAAAATGGTGGGCGTTTCTGTTAAGAAATCTCCTATTGATCTTGAAGAACGCTTACTATGAAATTCCTTAACATCCCTGAGAG 
TGAAAAATCTAATATCCACCGAGTCTTTAGCAGACGATACAGAATTTTCCACATCTACTGTCATGGACCCACTTTTCCCGTCGCTCTTGAAAGACGTTGG 
TTTATTAGCACAAGGCACTTGTAGTTTTCTCCATGAAAATTTAGGATCACGAAGACCTCTAGAATTTGTAACCACTTCTCGCATGCCCTGTTGACAGCTG 
TTGAAAAGAGGCTTAGAGCACGTCAACCATATAAAAAAGGGGTGCCTTTTCCACACGCCACTGTTGATGGCCGTAATTACCTTGTCCGAATCTTGACCAA 
AAATTGATGGCCCTACGTGTTTTTGTACATCTAAGAGTGCCTTTTCTTGTAGTATGACACTCATGACGAAGGCTGCCAATTCTCTTGTTTTTCCTACACC 
AGTACCATCTCCGATGATAAATCCTCCACTCGTTTTCTTGTCCACTTTAAAATTCCTTTCGACCTTATCTTTCACAAGGTCTATAATGGACATGTTCTTC 
TTCCACCCTCTCCCTTTGTAGAATTGAATCAGGGCTAGAC6CACTGCTAGTGCCTGCACAGAATTAATGTCCATCGGCCGAGG7LAGAAATTTGGGGTAGG 
CATCGGTTACATTTTTGCAGAAGCCCACCGACCGGAATGTTTCACTCTCATAAAGAGCGAATGAAGGTTCTGGGTGGGAGAGGGTGCCAATCTCTTTAGC 
ATCTTCTTCCTCCAGAAATAAAAGTTTGTGAGCCATGGTAATTCAACCAGAAAATCAACCAATATATGTTGTAACAAGAAGTTGCCTTAAGAAGACTCTT 
GTGTACTTTTTTTACGATACTTGAAACGAGTACGCCTGCACGGATCAAAAATTTGGTATACTGGTTAGCTGCTATTTTTGGCAAAATAATAGTTTTTATT 
TTCCCATGCTAAAGAGAACCAGGAGTTACGTACAGCCATACTAGTCTCTACAAGTAATCGTCTATAAGCCTCAGAAGTATCCACACCGTCCCTGAACCAC 
GAAGTTTCTTGCACGTCCGCCTTCTTATGCCCCCACTCGACGAGTGACTTCTTGAATAGAGAGAGGGGTGTTATTGTAGCTCCTTCGGGGGTTGAGGAAA 
TGTTTGACGCATCAAACGCTAGACGAGGGTCTGACTGTACTACCGGGCCACTCTTGGGGTTATGATCCAAACGAAACGATTGCAAACCCCACGCATCGTC 
GTCAATATCCACTCTGTACTCGGATGAGGAAGATGATGTTCCGAACAATTTTTCACTTCCGGGAGTTACATTGGAGCAATCAAAATAGCTGGGGATTGTT 
TCGCTGTCAAATATCCTATTGTGGTCTACTGAAAAAATATCGTCGGTAGCTCCTTCTTCGGGAAGAACAGAGTAGGATGTGGGGAAAGCGTACTGCAAAG 
GATTGGACACTTTCAATGTACTGGAAGTTTTTGATGCAGGTTTTCCTTTTAGGTAATCrAAATCAAACATATCTCGCGCTAGCACAGTAGCAGCATAAGG 
TCCGTATGAACGCATTAATAACCCCTCTTTCAATACAAAAGTATCCTCAGATTTTTTTATGATATGTCTTAGAAATTCTTTCCTCTTCATTAAACTATCC 
CAAGAACTTGATTTGAGATATATTTCACTTGTCCTCATCCCTCTCGCGTGTTTCTTTATTACGTATTCTGTCTGAGACAATCTCCTCTCATCGTTAAAAT 
TACGTTCAAAAAAGTCCCTTATTTCTTCCAGCCTTTGGTCACCTGTAGAATTGTACTCTGGAGCTGGAGTACCTCCAGGGGGAGTTCTGATATAAGAGGA 
GGGTTTATTTTGACTCAAAAGTGTAGAGAATAAAAGTTCAGATCCGCCGTGAACACTACCCGAAACCATGCGACTATTTCGCAAACATTCTGTGGCCAAC 
ATACTATGTTCATAGGGTTTTTGAGGTATGTTTCCGGCCAAAACATTAGCTGCTGGCTCAATGTAAAAGGGTGCAGTACAGAATGGCCTATGCTGCCCCG 
AGGAAGGTTTGTTCAGGACGGGAGCTAAACAACTTCTATGTAACAATGAAGACGGGCGGATGATTACATGGGGAATATTTTCTAGAGGAGGTAAAACAGT 
GCTAGGTATTGGTACAAAGAGCACATTCTTAGAAGTATTTGTTGCATCCAGGTTTA7VAGGCGTGATGATGGGCGCTTGGGCCAGTGCAATTTTCTTGGCA 
GTAGTTTCGTCGATACTTGATTTAGGTCTTTGCTGGTGAATCTGGTCCCCTCTTTTAGATTGGTCGGCCAAGAAAATTCCCATGGCAGGAACAGTCCCCA 
CTAACGGTGTCCTGTCATCGCCTATTATTGCATGCCTGTTTGAGTCTTGGGGATTATTAACGCCCGGTACACACTTCATTGTGCTCACATCAAACGTAAC 
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AAACGAAGGGTCTATGTTGGGCATCATAACACAGGGGTTTTCCATACATTCATACATACCTCTGAATTTCATATCCTTTGTTACATCTTGAGAAAGAAGG 
TCGTTATTTAGTACATCCTTCTTATTATTACAGAAACAGGTGATGGGGTTTTTGTATAAAAATTCAGGATCGTTGCATTTAATTTTGAATTCCGCCAAGG 
AAGAAGATTGACGCAATTTGGAGATGTAAGAAGAGTTGCGGAATTCTGTATGATTCCGTATTTGTTGTTTCTTTAATACATCATAATAGATTGGTGTCGA 
AGATGAAGGAGGTGGTGGTCCTTTGTGGGTTGAAGGATTGAAAGCGCAAGCAGTCATCGATGTTCCTCCCGAACCTCCGAAATATGTTGGGTATTTAGGT 
CTGCATTGCCAGCCCCCATCATCTGTCAGAACGACTATAGATGTGTGCACAGAACACTCTTTAACTACACGTTCTCCAGTGGTTGGGGATGTTTCTATGT 
GGTCACCGGGCACGTTTCCTGACCAACAATAAGGTTTAGGATCTTCTCCAGATTTATGTGTGATAAATGTTCCTCCAGCAACAAATGGTGTCATCACACG 
AACTGCTCTCGAGGTATAACACACAGATGTGCAAGTTTGTCCCTCTTTC7VAAGGAACTGGGTTGGAACAGTACCCGCCAGTATTGTAGGCC7VAAATATCC 
ACATCTTCATCATGTCCTTCTGATGTAGATGGAGGAGGAGGAGGAGCGTCACCACTTACGCCTGATGCACTATGGTGATGGATATCATGTACACCAAATA 
AAGAATCGTACAATTCAGTATCGTCTTTTTCTATATTATCCTTCAAATTGACCATCCTTTGTCTAAACTTTTTCCCCGATACAGCATCTAGATATTTGTT 
AGATTTTTCGATGGTGGCCTCACTCAATCTATCTATATGGTGTTCAGGCACGAGAAGAGATGAGGATCTATTTGCAGCTAAATCATGACTTTCGAACCGA 
GAATTCCTTTCCTTCGTTTCTGACGGAACGTTCCATTTATTCCCTTTAACAGTCAAATTCATCATGTCCAGGAGAGCGGGGTGAGGTGGGTAATTGCTTG 
CATCTTCCACCCCGTCATGCCTACCTCTGGCTTCTAGGTATTCTTCTAGTGTCTTTTCAGGGTTAGCTCCTTGAACTGCTTTCAATTTACTTTTCATCCC 
TAATTCGTGTATAGCTTGAAAATTAACAGACATTGGTTTGGCTACATTATAGGCCATTGCAGCGCAGACTAGCGCAAAAATACAGCCTAACACGAACACG 
GTCACGAATAACGTTATTTGCGCGGTGAAGCGCATACTTCGGTTGGTAGGGTAAAGTAACGGATCTCTTCGTCGTTCATCCATCGTTCCTGATATTACAA 
TGGATATCTTCCTAGATAAATCTCATACAGGTGCCACCCTTTTAGGGTCTTATAAGACGCAGACTGTGAAAAATAATTCTACGTTAAAAAAGCTTAAAAT 
GGGCGCGCCTACCAACGCTGATTTTACACGCACGGTATCAGGAGTAGCTTCTTCTCTCTATCTTGTTAACCCTGGAGCTCCTTCCGATAGAG/iAAAGTTA 
GTTTTGGCCTCTTCTTATTCTGACTCTTTTGTGTACAACTACAAGGATGCAGTGGTGACCGCTGAGGCTCCCAAGTGGTGTCCCTTTAACGAGCCAGCTC 
TTCATGAGCACATCATGAACAGACTTGAAAAAGCTGGTCTAATTAACAGATCTCGTTTTGTGTGTAACCCTGTTAAATCGGCTGGAGAAGTATGCGGATT 
TCGCTATTCTGGAGGAAGTACTCCCCAGAACTTAATTTTCCCGATTGGAGCATCAGAGATAAATTGGTTGGTCATACTTAGAAATGCTGCTCGTTTTGGT 
ACAGTGGCAGCATCGGCCAAAGACGCCATCGAACGCATTCCTGATCTAAGAGAAGGTGGTACAAGTAAACATGTGGCAAAGAATGCAATGAGGAGACTTC 
GTGTATGGCGAGCCTTTAACTGGATAGCGGAAGCCTCCAGATCAGCGGGCATGATTCGCTACGAACCCTTTTCGGTTGGCTGTGCTCTATACGATCACGA 
GGTAAGAAGAAGGCAACTAAAAGGAAGTTATGAGCGAAATGTACTATTTCTTGGTGACCATTTCTGTAAAGAATCAACTCTTCTGGCTGATATTTCAAGA 
GGGGGAAGAAGTTCTGATTTCTGGACGATCGTTGAAGCTGTCATCCGGTACAAGAATAGACATGCTCGAACAATCAGTAATGAAACTAATGCCATCCCTG 
AAGACTCTTCTATAAACTTGGAGTGGGAGGATGTTCTAGTAAAAAACCAACGCGACACAAATCAAGGAGATGATTCTACGTTAGAAAAGACCTTAGAAGC 
TGCCATCAAAGAACATGAAAGCATAGGAGAAAAAAGGAAAAAACATATCCTAGAGTTTATTAAAACATGTCTAACAGAGGAGCAACGAGAAATGATTTTT 
AAGGGAGTTGGAGGAAAAGGGAATTTGTCCCCGGCCCATCTAACAAATCTGGCCGATGCGATACTGGCTAATAATGCCAAAGCAGGCATATGGGTTATTA 
TGCAGAGTCTTTTAAAGCAAATCAATTTCTCTATACTCCATTTAATAGGGTACGAAGCTCAACGACTTTTAATGTTCAAGTTGTATATGCCTGCATTACT 
TGCTCTTTTTATATCTCAGAGGGGAATTGGTGACGTGTTTCTGAATGGTGTGTTTAACCTAGAAGTGATGAAAGAAAGAGCTGCAAATGCAAAAATAAGA 
GACATGGTTTCTCGAGACGCCTACAAGAACAATACTAACGAGTCT/VATT^TTTGGGCAATTATTCCCAATTCGATATCGCTACAATTTATGGTCAT^TGT 
CTGATTGTAATGCAATTCCTCTATCTATTAATATTGGAGTACCTCTCCATAAATCGAGGATGAACATGCAAGACATTGAAAAGACCATACAACACATAGT 
GGATATGGGTTTGGCAAAGATACGTGTAGCGGAAGAAAATAGAGTAGCTAGACGTCTTTTACATAGAAGAAAAATGCAAGAAAAGGCAGCCAGGGAGCGA 
GCAGCTGCGAGACAGAGAAGACTAAGAGGTGAAGAAGAAGAAGAAGAAGAGGAGGAGGAGGAAGAGGGAGAAGAAATGGAAGAAGAGGAAGAAGAAGCTG 
GAACATCAGGTGTTAATGGTTCTGGTTATGATCAAGAAGAAGAAGATGATGGTGAGGAGGAGGAGGAGGAAGAGGAAGAAGAGGATGAAGTiAGATAGTGA 
GGGTGAAGATATGAATGGGGAAAATAACTCAAGAAAACGTAAAAACACTGGTAATACTTCCTCCACTCAACAACCTCCCCAAAAGCGTCAGCGAGGTAAA 
AATGCGCCCATTTCAACCAAGGGAAAAAGGTTAAAGGAAAGAGATAATATTGGAGGATTTTTATTAGCCACAATTCAAAACGATGACCGCCAAGTGAATG 
TGGAGAGCATACAAAAACTTTTGACGGCCAGGCAAAGAAAATATGTTAAAGGTGGAAAATGTGGAGATAATGGACTACCAGAATTGTTAATTGAAAAGGT 
TACAAATCTACTAGATTCAGTGTTCCATTTCAGAAAGGGGTCTATTCTTAACAGCATACACGCAAATAGGCGATCAGAAACTGGCGTATACACCACAAAG 
GCAAACTGTATTTGTGATTACTATGAAAGAAACGTATCAAAAGACAGTAGTAATAATACACCCCATTCATCTGAATGTATTGAGCGGGCAAGAGAAAGGG 
ATGCTTCCTGTGCCGAATCTAACAAACGCCCTTGTCCTGTAGACTCTAACAACCCTGAAGATGTGGAACAACGTATGCGGGAATTGATCATGGACCCTCC 
TTCATTATCAGGGGTAGAAGATTCTCTCGCTATCGAAAGAGTACTTCAAAATGAGATATTATTCACAAGTCTTGTCACCAATCCAATTTTCAATGCCGTG 
TTAGGTGCTGAAAAAGGAGATCTTGGAAGATTTATCGTATTAAATAACATTGTAAAATTTATGAATATGACTATTGCTTGTCTAGTGGACGGAGATATGC 
CTATGCTTCTAGACTCAAGAGGCAAGACAAAAAACCTTCTAGAAAAGGGTACAGTGAAAAACACGAGAAAATTTTTCAAACCAAATATGACTGCAGCAGA 
ATTGAACGTGGCCACTGCTCAATCTGCAGGACACCAATACATGAACGCTGGTCATTGTCCAGAACCTGGAATCAAGCAAATGTTACTCCCTGATTGTATA 
ATGAAACTGAAATCCATCGCCATGGAAAAGGGTCGAGGAGGGAGATCGGCCCTTCACAGACAGAAATGTGACCATGCCTTTTGCAAGATGCTGAAGTGTT 
TATTCTTCAATATTGACCCTTCAAATGCTGCAGATACATTTATTGACCCTGCGTCACGTGCCACCTTATTTAGGCTTGATGATCTTTGTAGGGATAGGAA 
AAAATACAAGAACATTGACTGGGTAAAGGATTTACTAGACCCTGTAATGAAGGGAACAAATAAATGGGTGGGAACTGGAGAATACACTAACATTGGACGT 
GACTCAAACGTGGCCGCCCCTGTTGATTTTTACACAATTTTGAAATACACAATGATTGATGATGGTGTAATAAGTGTTCCTTCACGGAAACCAAATGACG 
TGTATTACAGTACTATTGAAAGAGCGGATGACTTACTCACCGAGAGCAGGGACGCTTCATGTGAAAGCTACCGCCCGACCTTGTTTGACGCCAGAGCGGT 
GTTGGAAGTCAACGGAGACGGACGTGTCCCTTACCCTTCCAGTGAAACCGTAGAAGACTTGGGAGGAGAAGAAGAAGAAGAGGT^AATAACAGGCGTTATT 
GACGACAGTACTGAAATAGAAGACGTTCAAGTACAGGACAGTAATCTATTCGATGTGGAATTATTCGATATCCCTGAAATAGAACAACATCAACAGGGTG 
GAGAAGAAGAGGAACTTCCTTCAGCAATATCTGAAGTGTTTGCTTCATTACCAGCTGATAACGATTCCTCCTCACCCGCGCATATTCCCTCATTTGGGAA 
TTCTGAGGAGGGGGAAAAAAGTCCAGAACCGTACAATATTTTTGATTCTGCCCTCGACCAATTATTGGATTTAATAGATAGCGATGGAAGAAACAATAAC 
CCTAAAAGAGTCGACTGGAACAGTGTCACCATTCAAGAATGAAAATATTACCAGAATAGCGTCTAATTACGTCCGAGCCTTTACTGACACGTGGTCTCAT 
TTGGTCAACATTTCTGGAGCTCCTCTAACTGCTGAGAAGAATCCAAGCGCTATCCCAGCTAACGAACTGAATAGATACTGGACCAAAACTAACGTGTTAT 
GCAACCCACTCTTTAAATTGGAGGACCACATAACGAGGGATGAAGATACTGGTACAATAACACTAAAATTCAAAATGTATATAGATGATAAAAATGGACT 
ATATCAGTCTGCCGTTTTAATGCTGGCTCTCGATTCGTTCGTTTCGCTTGCATCTTTTTCCCATGGAGCTGATTTGGTTTCCAATAAAAGTGAAAACAAA 
TTCTGCGTAAAAATTCCCCACGACACTCGCGCTGAATCTTTACTGAATAATGTTGGATTCCCCGCTGGATTAAGTGGGCCTTTTAAGAGATGGAGCATTA 
ACTACAAGGCTGCAAACTTGAGCGGTAAAAGCGGTATAGATGGCCTTTCAGGTTCCATGTTGACAGTACTAAAAAATAACACCAATAAAAGAGCAACTGA 
TATTTTACATTTGGTGAATAATGTTTCTGCCTCTGCACAACAACTTGACGATTCTGAAATGTCTCGTACTTTTAACCACCAAAAGAAAGTTGGAGTTTGT 
T ATG AT AT AAATGT GTC C AGTTC AAGGC AAGTGAAC C AGCGT AAT TT AC T CC ACC ATC AG AAT AT AAT AGG ACAAC ATCTGATC G AATT T AG AACC AAGC 
AACTTGAACGCGCACAAAATAAAAAAGTCAAGGAAGAGGAAAATGGTGAGCATGAAGAAAT6ACAAGTGAGGAGGAAGAAGAGGAAGATGAATATGAAGA 
AGGAGGTTGCCTATCAGATATTGACGAGGAAGATTTCTATGAGGATGGTTACGATGAAGAAGAGGGCGATGACAATAGAACTAGAAAGAAGAAGAAAATG 
GAAGAAGATGAAGAGGATGAAGAAGAAGAATATGACGATGAAGAAGATGAAGAAGAGGCAGAAACTTGTGGTGCTAATGGTGTTATTGATTGTGAAGACG 
ATGCAATCATTTTCCCCAATGGACAAAATTCAAAAAGGAAGAAAAATGGTAAAAAAACAAACATTAAAAAGCGGTCACGGAGGAAGGGGGAGTGCTCTGC 
TAACACTTTATCCTTTGTGGAAAAATACGTTGGAAATTGTAAGAGCCTAGGTATAT^GCCAGTAGGGTGTCCGCCCCCATCCACTGAATTTACTTCCCTA 
TTTATGAAGGGGAGTGAAGCTGACAGCTGTTATAATACTTGTCAGTCCACCAGAGGGGCTAGCCGTATAAGGTCACTACTCAATAAATACTCTGTTAAAG 
ATTTGATGCAGGTAAACAGCCCTTCGAGTTGGAAATGGGCTAACCCTCCCGACCGACGGTTCGTGCTGTTTGATAAGAAAACTAAGGAAGAAGTTGAGGT 
TAAGTTTGAAATTGAATGTGAAAAATCCGAGTATTTTGATGTCGTATCTGT^CTCCCTAGTAATATTAAAGTATGGTTAAAAGAGACGGCAAT^TAATA 
AAACATTTGGCTCTGATTGAAGACTTTCTTCCAGCTATGGGTGCTGCTACCCCAJ^AAATTCCCCTCAATTTGATTAAAACTATGACGAGCATTTTCTCTG 
TTAGAGATATTGTTGGATTTAAAATACCAGAAGAAGTGCTCAGTTTTATTCCTATAGAATGGAAGACATCTATTTCTGCAATGGGGCTCCTCTCTGTACA 
ATTTGATCGTATAATAGAAGTGATAGATTTAATGATAACTAATGGCGCCTTTGCGACGTCATGCTTGAACAACGCATTCTTCTTGGAAAGAGGAGTGGTG 
CCCAGAGATGGGAGTAACACGTGGCTCCACACGGACCTTGTGCAACTCTCCACCTCCATATTTAGAAGTATTCGCAACAGAGGAGTGAATATTGGCGGTA 
ACAACAACACTGGTAGCAATTCTTCTAGTTCTTCTTGTGGAGGGAATAAGGGCGATTATGGAGTACGTTGTGGATTGAGTATAAGCAAGCGTGGTATAAC 
CCTGAAACCACCACCTGCAGCGATGACTAATTCTTCTTCCCCGTCATCCTCGGCCATGATCTCATTGCCTCAGCCCACGCGCCAGAGCATAGATCTTTCG 
ATAACGACAATCATCCAAGATTTCTCAGAAGTTTCTGGGAAATTGAGGCTTAATGGATTACAGAAAAACATGTCTGACAAGAGCAAAGACGTGTTTAATG 
ATGCAATATACGACTCTGGCGCATTCAAGGCGCTCCTAACATGCACAGTCAACGATAAAAGTAGACGTAAAAGGAAAAGAAGGACTTTATTGGCATCTGG 
AGAGGGTGTGGTACGAAGAAACCTGATGGTGAGTCAGGGCAATGACGTCAATGATGCCCACCAGTTCCAGGAAGAATGCGGAATAAAAATTGGGGGCGGG 
GCTTCTAGGGTGTATAAAAGAGCCCAGCGCCGAGGTTCGGCAGTCAGTTCCAGAAGAAGAGTAAGGAACAAACCCCAGTTTACTATAGCAGTCTCTGACG 
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AAGACGACGACTGCGAAGAAGAAGGCGACTTTTCTTCCGAGTTAAATCCAACGCACTCTCAACTACTTCTATTCCAACAACGGCAACAAGATAGCTGCAC 
AGAAGACGACGACGTTTTAGTGTCTGTAGAAGAATATAACAACAGAGTAAGCGGTTCTTCCACCACAGCCGGAGACAGAGTTCTTGCAAAGGATCTTCTC 
TCTACTGTATCTCCGAACGAAAAGAGGAACTCTGCCGCCCTCGCCGCACTCACCATATCCCGGCACTCTCTTTTCAACGCTCTATCTGCAAAAACAAAGT 
TGGGAGAAAATGGACGTTTCTTCCTATAAGAGCACTATTGACTACCACAACATTGAAGATATGGACGATCTCCAGCGCGCCACCTACAAGGATCGTATGG 
AGACGGAATTGGTCCTCGAGATGGCTAAG7U\.GGAGGGAAGGTACGTCCGATCGTTGGCCACCATGGACGAATTGGAGGTACCTGAAGAACCAGCCACTTG 
CTACACTTGCGGCTACACCTTTATTAGACGCAGGGCACCCCCACCAAAACGCAAGTCAATATTCAGAGAGCCTTGCGCTTACCCAGAACTTCTCCCCGAT 
GCACCATCCCCCGTCCGTTTAGAAGAGCTTGTCGACGTGCCAGAAGGAGCGAGTTTTTTCACCTACCCTCCCTACGACGACGGATCTTCTACATCGTCTT 
CACAAGCCGAATGTGAAGATGATTATCCTCCACCATACGACCCATCAGAAAATCCACAGAGGTCCCAAGTGTGTGATTATTGTACCACACGTCAAGTCCT 
CAGTTCTATGACGGATCACGCCAGGGCCAACCTCATAAAAAATCTGAAGAGGGAGAAGAAGGCCCTGGGTCTTGGCCGTCGCAACAACTTTAGCTACTAG 
GGTTGTACAAGAAGAAAATAGGATTGATTGTATCGATGACGAACTTTGCATGTTTTTGACCATTCTTTTGCTTGCTGAGTGTACATAGCTTTATATTTGC 
CAATAAAACAGACCCCGAATGTATTTTTGTGTATTCTTTATTACTGCCTATTTTTATGGTTTAACGGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGAT 
GAAGGGAAAAAGAGGGCGCTAGTTCATATACTGCCCAATAGGTGGTCGGGTCCAGAAACGTCTGTCTGTTCCAGAAATGTGTTTAGAGATTTCTGGACAA 
GTCATTTCTGGAAAGGGTTGTAATTTATTATAGTTGGTATATTATTTCTGGTATACATTTCTAGCTCCTCTCTGTAATCTGACGTTGGGTCGACCCAGCG 
GTCCACCCTTCGAACTTGACATAAGGCCTAGTCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGATGAA 
GGGTAAAAAGAGGGTTGAAAGGATAGCTGAGGCCACCTGGGTCAGCCGTGTTCCAGAAACGTCTGTCTGTTCCAGAAATGTGTTTAGAGATTTCTGGACA 
AGTCATTTCTGGAAAGGGTTGCAATTTATTAAAGTTGGTATATTATTTCTGGTATATATTTCTAGCCCCTCTCTGTAATCTGACGTTGGGTCGACCCAGC 
GGTCCACCCTTCGAACTTGACATAAGGCCTAGTCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAT^AATTTTTGAAAAGTTTTTGAGATGGAGATGAA 
GGGTAAAAGGCTAGTAATAGAAGGTTGTGTAACTCGGCATAACCCAGAAACATATATCAAAAGGGATGGAAACAACTATTCTTTTTATACAGTAGTTTTA 
TTGTACCAAAAACTCAGAAATCTCATCACATGACAAATCTGAATGTCCCATTCCGTGGCCCTCAATAACGTACGACACTGGACCTGGGAACTCCCACCTA 
CATGTTTTACATACAGAAAATTTTGGCTTAGATGCATTATCAACAAACACTTTTACCACATTGGCATTCATTCTCTTCAGTTTGTCACACATCCACAAAG 
AGGCCATATTCTCCATAGAATCAGATCCATCAACAAATATATAGAATGTTACACCCTTGTCGATCCAGTCCATGGCGACAATAGTAAAGTCTTGTCCAAA 
AACACTGTTTCCGAAGATTTCTTTGTCCCCATCCATGTATTGGGAAAAAGTACAGTATGGTCCACAGTCCTTGATCTCCTCCAACCAGTCGTTCAAAAAA 
TCCCTAATAAAGAAAGACAGGCAATTCGACTTGATCTGGAAACCAATGTTGAGTTTCTTCGTATTTTCTTCATTGTTGGAGGGGAAGAAATACGCCAATT 
GTGTATCTGCTTCAAATGCTCCACACTTCCCTTTATGGATCAGGCGGTTCTCCATAATCTCCTTGCCAGAATTGGGGAGTAGGCCTTCGAAGAACAGGAT 
ATTGGGGCGGGTAGGAACTTTGCCAGTCAAGTCCATGTTGGCAAAGAGATCAGCAGGGTCTTCAAAGACTGAGGCCATTTTTTTGGTTGTTGCTGCTGAG 
AGAGGAGCAAGTAGTAGTGTGGTTACTAGCAAGGAGCTCACAGTCTTATATATAGACTAACACCCCCCCCCCCAAAAATTCTCCCACATCTTTCATACCG 
GGGTCGGCTAGAACCCAGAAACGGTTTGACTCACATCTGGTACATTATTTCTGTTATACATTTCTAGCCCCTTCGTGTAATCTGACATTGGGTCGACCCA 
GCGGTCCACCCTTCGAACTTGACATTCGATCGACTCAGCGGTCCATCCCCTAAACTGGAGTGACCCAGAAAAATTTTTGAAAAGTTTTTGAGATGAAGAT 
GAAGGGGGA7VAAGGGTTAGTGAATACAGGCAGTTCTGCCTTGCTCCAGAAATGATTCTAAAGATTTCTGTGCATTATTTCTGGTATACATTTCTAGCCCC 
TTCGTGTAATCTGACATTGGGCCGACCCAGCGGTCCACCCTCGGAACTTGACATTCGATCGACTCAGCGGTCCACCCCCTAAACTGGAGTGACCCAGAAA 
AATTTTTGAAAAGTTTTTGAGATGAAGATGAAGGGGGAAAAAGAGGGTGCTAGGGTGCGCTCCAGACATCTCTAGAGTGGCTAGAGCCCAGAAACATCCC 
AACTAGTTTCTGGGACATTTTTTCTCCCACTGACGCAGACTATATAAGCTAACCACTAAGCATATTTTGCACACATTTCATCCACCATCACTGCCGAATA 
CACTCTTGTGTTGCTCCGCATAGCCTAACAATGGCCCCAAACTCCTTCCAGAAATTTGCTCCAGTTATCAAGACTGAGAAGAAGGAAGAAGAAAGGGATG 
AACATGACGACCCTTTACGGCAGATTGATTTTAGAGATAGAAAGACATTAATCTGCCTCACTGCAAACTGTGTTTCGAGGAAGAGAAAAGCTGGATCTGC 
ACATGATCGAGTATACAAAGTACTACGCTACGGGAACCCATACAAGTACCGTCGCCCCAATAGAACACATCGAGGATTGGCCCTCTCAATGGATCAAGGT 
GAAGTAGGAACATGCCTCCCTCTGCGACCCATGGAAGAGACTGAAGAAAACCCCATCGACAAGTGCGGAGTGGCGTTCCTGTACTCCAACTACAATGAAG 
GCGATGGCATGACCCACCTTTACAACGACGAAGAGTATATAAAGAAGTGCAAAACAATTGAAGGAGGAACAAGAACGTGGGTAAAGAAGAACCGCCAAGA 
ATACTTCAGACAAGCTCTAGAGACATTGATGATGTCCCATTCTATAAAACAATATTCCAATTTTATTTTTTTCAAGGAGGATATGGAGGAAGGATTTGTG 
CACAAACTCCACACATTTATTAATATGGTACACCCTAAAAAGGTGTCTGTTTTGTGATATAATAAAGATTGATGTTTTTATAAATGAAACGTATTATTTA 
TACCAAAAAATTTATTATGATGTATTTCATTGGGATGGCTTTTCACTCACCAATCTGGCCTATGTGTGTGTTCATGCTAGCCCCTCTTTCACCCTTCATC 
TCCATCTCAAAAATTTTTAGAAAATTTTTCTGGGTCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGCCTAATGTCAAGTTCCGAGGGTGGACCGCTGG 
GTCGACCCAATGTCAGATTACAGAGAGGGTCTAGAAATATGTACCAGAAATAGTGTACCAGTTATAAGAAAATGTGACCCTTTCCAGAAATGGCTTGTCC 
AGAAATCTTTGAACCCATTTCTGGAACAGACGTTTCTGGAGCATGGTCGACCCAGATATCTGTTGGCGGCTGTCAATAGTAGCCTATGTATTCAATGGTC 
TTTTACCCTCTTCTCCATCTCAAAAATTTTTAGAAAATTTTTCTGGGTCGCTCGAGTTTAGAGGGTGGACCGCTGGCTCGGCCTAATGTCAAGTTCCGAG 
GGTGGACCGCTGGGTCGACCCAATGTCAGATTACAGAGAGGGTCTAGAAATATGTACCAGAAATAGTGTACCAGTTATAAGAAAATGTGACCCTTTCCAG 
AAATGGCTTGTCCAGAAATCTTTGAACCCATTTCTGGAACAGACGTTTCTGGAGCATGGTCGGCCCAGGCAGCCTCACAGTTACCTGTTCACACCCTTCC 
TACTAGCACCCTCTTTCACCCTTCATCTCCATCTCAAAAATTTTTAAAAAATTTTTCTGGCTCACTCGAGTTTAGAGGGTGGACCGCTGGCTCGGCCTAA 
TGTCAAGTTCGAAGGGTGGGCCGCTGGGTAGGGCCCATGTCAGATTACAGAGAGGGTCTAGAAATGTATACCAGAAATAGTGTCCGATTATTAAAAATTT 
AACTTTTCAACATGTGGCATACACCGTATGAACGTATAGCCAAATATGTACCACAGTCAAAACATTTAGGTGAGAAAGATAATACACATGTATAAATTTC 
AAATACTTTATTTGCATACATAATAAAAAATCATACAGTTTCGGACTCATCTGTCCACACATCATTTATTAGGAACTGGTATCTATTATACCCTATTCTT 
CTACGTTTACAAGATACTTGTTCATCAGGCACTGCCATTCTTTTACAAGACACTATTGGTTCAACCAACATCTCTTTAACTTGATCAGCGTCAGGTCTTA 
GGGAAGTACGTTCTCTGCATACAGGACACCGGGTAATTTCTCCACGTTGCGTACTCAACACCAATCTGTCCCAACATTTTTCATCAATACAATGTCCACA 
GCGTGTCGTCCTGATGATATTGGTACTACTGAAATCTTCTAAACACACTGGACAATTATCGCTGTCGAAACGCACATTAGCATTCTTCTCATGTTCTTCT 
AGAGCCCTTTTAATTTCTTCCTTATCCCCAGAAATTATGTGTTCAGCCCAAGGATAATTTGAGGATGCAATATACCCCAAGCGACAATTCAGTATTGGCC 
AAGAACGAGGTGCTTTATTGACATCGAGACCGGTGGCAGCAACAAGGGGTTCTTCTTCTTGGAGAGAGAAATCTAAAGCAGCTTTCAATATGGTGGGTAT 
ACTAGGCTCGTATCCTCTCTGGGCAAACTGTGTTAATGCGGTATCATTGAACATCAATAATGCTGCTTCAGTTACACTTAAAGTACTAGATTTACCATAC 
AACATATTAATCTTTGCCCCATGGGCTCCAACTAGTGCATCAGAGACATCGTGTTTGTGATTGGCGATGGCAATCATTAGGGGCGTTGCACCATCGCTCG 
CCTTCTAGCATTCACTATACTTCTATCGAAATCGTATATTATATTCTTGTATTTTTTAGCAAGAGCATCAAGAAAATTGCACGAGGTTGCAACAGCTGCA 
GCATGGAATACGTTTTCATTATTTTCATTAATAAAGGAAAAGGCCAAGCTGTTTGTTGTTAGAATTTCAATACCTACTCTTGTGGTAAAGGGGGAAGATA 
TTGCAGCGCAAACCATTAGAGGTGACATGCCTTTCGCAGAAGGCTTCAGAATGTTAAGTTTTTCTTGTTTGACTAGTTTGCTAGTATGGGAATATTTGTT 
AATGATACAATAACACAATAGAGTATTCCCCTTTTCATCAAAATAAGTGTGTTTGAGCCACGGATTCTTGGTAAGAAAAGCTTCAAGATCTTCCATGAAA 
CACGAGACTTTTACACACAATCTAACATAGCCTTTAACTTCATGAGGGTCTGTTATCACAACGTTTTTATGTGATACGATCCTTTTCACAATAAGATTAT 
CGCACAGTTTGTCGGTCTTTTTTCTTACCATAATCTCCCTCAAACAGATCTTGAAGTAATTGTAATCAATCTCTTCAAAGGGCCTACTGTTAATCATCCA 
CAACAGTATTGCACCACACAAACCACTACCCACTGCTTCGTTTATTGCAACATTTTTTAGAATATTGTCCATCATTTCGGCAGAGAATTTTGGCATCGTG 
TCAAATATACTGTAAAGGGTTTGAAAATGCTGGATTGTTTCAGCCTTCTCTATAAACACATCAACAAACTCTACTGCCATTTCAGCAACAGTTTTTGTAG 
TAGTCTCACAGTACTCTTCTACCATTTCTTCTTCTATCGAGGAAAGAAAAGACTTGATAATCACACTCCTGTAAGCACTAGCCCTTTCTTCGTTGGTCCA 
ATTCAATCTAGCTATAGATCTAGCGTCCATGTTCATTTCTTCTCTGAAGTCAGTTGACACGGATGATGGTGATGTTTCTAGGCAAGAAAAAGGTCTCCCG 
ATAATAAAATTGCCATTGGATATCAGTCGTTTTGCCTTTGTAACACAAGGAGATTCGTCCACAAAATACTTGTATCCGAAAGATATGTCAAAAGGTTCAA 
GTGGTGCAGATTTTTTCATTTCAGCCACGTAATCAGAGGTGATATTGACGATTCTTGAAAAGAGCCTGAATCTAATAACACTCGAACATTTTTCAACGTA 
GAAAACAATACCACTTCTTGCAGAACTAGTAGACTTTTTCAGGCTAGCCAAAACACCGTCCAACTTCTTGATCCTTCTCATAACCTTCTCTCTTTCTTCC 
TCAGCCTGTTCCTTTGAAGTAAACTTGAATCCAGTTCTGCTGTCATCACCAGTGCCAAACTTGATGCCGTGCGTCTCGCGTCTCAAAAATCCATTATCCA 
TAGAGACCAGAAGAGAATATTTTACGAACAAAAAGTCGTCGTGGATGTTTTCGTAAAGGCCTCTGAAGGTTTTGCAGACGGTTGTCAATGCGTTGATAAA 
AGTCATTCCCTCGCAGATGGGGGAAGAATCAGACTTGGTATTGTTGTTGATAAAGAAGTAGATAATATCTCTAAACTCTTCTTTATTGTCTAATTTCTTG 
AAACTACTTGAAGGAACAGGAGGAGAATTTCTGGAGGTZVATTATGTCATTCAGAAGGGCCAATTCCCTTTCTGTGAAAACGTCCAGAAATGACATATATG 
GTTCAATGTTTTCAAGTACTTCTTCAAGCACCTGACGGTATCGTGGAGCTGCTTCAGCCATGTTGATGATGTCTCACATACGACTGTTGAGTTTATCCAT 
GCGTACGCCCGCTTTTATACAAAGATCCCGTGTAAGAAACTCCCTCCGGTTCAGTTCAGGATAGGGGTGTGTCCCAGTTTTACATCCAAAGTTAATATAT 
TTTTTTAATATAACAAAAAAATCGTACCGCTTATTGGCTGCTATAAAAGAGGGAGCACCTGCTCACTTGGACATCATTAACCATCATCAATATGGAGGGA 
GAACATCAATATTTGAACCTAGTCAGGGAGATCCTAGAAAGAGGAGTGAAGAAGGACGATAGAACTGGAACAGGAACTCTATCCATTTTTGGACCCCAAA 
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TGAGGTTCTCTCTTCGAGACGACACTATTCCAGTTCTCACTACCAAGAAAATTTTCTGGAGAGGAGTTGTGGAAGAACTCTTGTGGTTCATCRGGGGCAA 
TACAGACGCCAAAGAATTGGCCAAGAAGAAGATACACATCTGGRACGCTAATGGGTCGCGGGAATTTTTGGACAGTAGAGGGTTATACGATAGAGCAGAG 
GGAGATTTGGGACCCGTATACGGATTCCAATGGCGTCATTTTGGTGCTGAATATGATACCTGTTCTTCCGATTATACTGGAAAGGGTATTGATCAATTGG 
CCAATATACTAAAGACCCTGAGAGAAAATCCAGATGATAGAAGGATGATTATGACGGCATGGAATCCTATGGATCTTCACCTTATGGCTCTTCCTCCATG 
CCACATGACTGCTCAArTTTATGTGGCTAATGGAGAATTGTCGTGCCAGTTGTATCAGCGAAGCGGAGATGTCGGGTTGGGCGTGCCCTTCAATATTGCA 
TCATACTCTCTTCTGACTCATCTGATGGCCAGTATGGTGGGTCTAAAACCGGGAGAGTTTATCCTCACTCTTGGTGACGCACACATTTATAATACCCACA 
TTGAGGTGTTAAAGAAGCAGTTGTGCCGCGTCCCTAGACCATTCCCTAAGTTGAGGATTTTAATGGCTCCAGAAAAAATTGAGGACTTTACTATCGACAT 
GTTTTATCTTGAGGGGTATCAACCACACAGTGGAAACTTGCAGATGAAAATGGCTGTTTGAATCATGTTAAGGAATTTCCTTGTTACTCATTTATTCCTA 
GAAATGGTGTAATCGCTGTTGTGGGCGGAGCATATTTGTGTATATAAGAGCCCGTGTTAGCTCCTCGATTCAGTCACAAGAGCGCACACACACGCTTATA 
ACTAGCTCTCTCTCTCCACTCAAGATGGCCTTTAATTTTGAAGACTCTACAAATCTCTTTGCCAATATGGACTTGACGGCTGGCACAACAACAGACCCTA 
CCCGCCCCAATATCATATTCTTTGAAAGTCTACTCCCCAACTCTGGTATTGAGGTGATGAAGAGGCGTCTCGTACGGCAAGGAAAGTGTGGGAATTTTGA 
AGCAAGTGGAGGTGCTATGTCGTATTTCTGGCTCGAAGATAATGCAGAAGATATGGAGAATCTCAACAGTGGTTCCCATGTCAAGACAAACTGCTTGGCA 
TTATTCCTTCAAGAGTTTATCAGCAACTGGATTGAAGAGACTGATCGACATGGACAGTACTGTACTTTTCCCCAATACATGGACGGTGGGGATGGTTCAC 
GTGGGGGATATTTTACTTCGCTAGCCATGAAATGGATGGCTAGGGATGTGACTTTCTTTGTGTTTGTTGATAGGAATAATACTGTAGAAAATGCGGCATC 
CATATGGATGTACCAAAAACTACTAGCAATTGGTGCAAAGGTAGTAAAGGTGATTGTTGACAATGCATCAAACCCAATGTTTTCTGTATGTAATGCGTGT 
AGGTGCAAGTACCCAGGCCCAGTGTCATACGTTATTGAAGGCCATGGAGTGGGTCATTCTGATTTGACATGTGATGAGATTTCTGGATTCTTTGTATAAT 
AAAACCCCATAAGAAACAATAATCTTTTTTATTCAACACCCATGATTTTAGTTTTTATAGTATATAAAATCAAGAATAAAGTATGTAGATGTCTACTTTT 
CGATAGCTCCTTCACCTATATAGAGGAGGTGGATAAAAACAGAAGATATGGATTCTAATACTTCTATTTTACCGCCAAGCAAACGGCCAGGGTTAAATCT 
GTTACAGGTTTTAGGGATTATAATAACGGTAGCATTAATAGCTTCCGTTTCATCCTTTATATTTTATAGGGTAGGTAAACGCAAATATTACCCTTCTTCA 
TCCTCTTCCTCAGAATTATCTGATGTAGATAATGGGGTAGAAGGAGGAGGAGGAACAACAACGACACCAACTCAACCTTCACCTGACGGTGGAGATGGAT 
ACGTAGATCTTTCTCCTCAAAAGAAGGCTGAACTAAGAACTAGAGTTGCAAACGTCATCTTTCAAGAAGTGTCAAAGGATCAAGGAGTGGCCTTTAGACG 
GGCAATGAATGATTCAACTGATAAGATAATGGAAGAAACTGAGGCGAGAATCAATAACTTTTCAGAGCCATTCAGAGAAGCAACCGTAGAACGTGAAGTG 
TTTAAGGATGACACAGACAAAAACTTTATCCTTTCAACTCTAGATTTAACAGAGGAACAATTTAAGGACATTGTTATGGCTGAAGTGAAAAATCAATTAG 
AAAATTTTGACTATGAAGACATGACCCGTCTCATCTTTGATAACATCCCAGAGACTGATTATTTATGGACT^CTCATTTCGATCCGAAAAAATATGACAC 
GTACTCTGAAAAGGTATTAGGGTTCTCAGATATAAATAGTATAGAAAGAATATCCTCTACATTTTATAAAGGTAAAAAATATGAGGTAACTACTGGAAAT 
GTAGCTGTCCTCGTTGATTTTGAATCTGAAACAATAAAAGAGAAGGCAGGAAATAGTCTCATCCGTAATGTCGAGTTTATTGTTGTGGACGAACAGACCT 
ACAAATCTTTCTTCCCTGCATTCAATCAAGTTTTCTTCTCCTTTAAAGTAAATAAGGAGAAAAGGGAAGTTACTGTATCCATCAATAATGGATGTGTAGG 
TATAGTGGCCAATATTACTCCTCTAACTACGCCAGTTGGAGCAGCTTCCGGACACTACATCTATGGCACTAGCACAGCAAAGGAAAAGACCTATCTATTT 
GTAATAGACAAGTACGATACCACTGAATTTGTTTGTGGTCTGAGTAACAAGTCAACTCCTCTCATGGCTCTAAATATTCTCTTTATGAGTGATACTGTTT 
TCCCTTCATTTGACGRAGCAGAAAGACCTCTGACGGATGCCAAGGCAGTAGAAATTTTAGGTAAAAGACTAGGTGTAGGAAGATACACAAACGCCAACAT 
CAGAAATACTCAGTGAGATGGAAGGGGTTATTTTGGATAAGATAGAAACAATTGCAAAAAGGGCCTCTCCO'AGTTATGGGTCTATTGATGTGGGTACGGC 
TATTTTGCGCCGTCAATTCATGGAAAAAATTAGGGGTAAAATAAATGAAGAAACCACAATGGAGAAGATTATGGGCACAAAGGAAGAAAGAGAGGACACT 
ATAAGAAGTATAGTGGCTAATGTTATCAAAGAGAATACTGTTAAAGAAAATGTAACCGAAAAAATTAGAGCAATGACAGATAAGGAATTT^TGACAATA 
GGGAATTTATGCATGATTTTGGAAAAATTTCAACTGGAGATGGAGGAACCTTCCATCTCTTTGAAGATACACCGGGTTTTGAAAGTGCTTTAAAGGCAGA 
ATATAAAAACGTTCCAGGAGCAACTACTCCAAAATACGTATCTATGAACAGTTTACGTATCGATGCGATTAATGGAAAAATCGAAGAGGTTTATAATCCT 
TCACCTATCATGGGTATTAGAGAATACGGCACCATTCGCAGGGGCAGGTACGAAGAAAATGCAGGTTCGAAAGAATTGGTTTTTATGACCAAGATTGAAA 
AAAGACCCAATAATGTAGCTGAAAATCTCATTATTAGAGTTGCCAACCAGCAGTATAATGTTATGAGGATGGTGTTTTTTATAGACTACGAAACAAAGAA 
GGGGGTGTCCAAGGAGGAAATGTTTATACCATATAATGTTCAGAAAACAAAGGCTCTTAAAGGGCGTAGTACTTACTTTTCATTCGTAAGGAAAATTCCT 
GATG/^CCAGAAGGGAGT AT C AT AAT ACAC GC ACT AGGGTTT T ATTG AGG AAAT AAT AAT AAT AAT AAT AAT AATGGC ATT AC AGGAAAA GG ATAT AACT 
ATAGGGAATGTTTCTGCTGCCCTACGAGAGTTGATGTACTCACCCACACATATGCAGCATCACGATAAGCTAAACACATTCCTGGACAGAAATGTTGAAT 
CATCTTCAGAAGAGAAAATAAGACAAATTGTGGATAAAATACGATCCCAAACAACATCTGACATATCTGAAACAGTCAATAATGTCACAACTAATGGGAC 
TGCATTTTCCCTTTTCGAAGATACCTTAGAAGGTATGGTGAAAAAAAATATAGGTGATAACCTTCAGAGTGGGGACTTTATTGATGGCCGTAAAAAGCTC 
AATGACATGAAGAGTCTAGCTACTGGAGCCATCTTATCTAGACAGCGAGATTTTGTTGCAGAAAGTATAACAGGAACAAAGGACTGGCTCAAGGCTATAA 
TGGGTTGTGGTATTATAAGGTATACTGTATTTGTCAATAACCTTGCAAGATCAACACTCGATAATGATGATGACAAGGCAGCAACCTATTATAACACCCC 
TATATATGGCGGGTATTGTAAAATGGCTATAAAGGACTATGAAATACCAGATTCGTACAGCAAGGTCGAAGCGGAACATACAGTTGAAGGAAGAAAGATG 
ACCTTTAATATA7VAATGGAGAGGCGATACCATAAACAACCTAATAACAATCATCCCTTCAGTGACAGGTTATCTTGCTTCCATCTCTGAAGACGCAGATG 
TGCAGGCGCCATTACTTTTAAACTGCAACAACTGTTTTATAGAGGCAGATATGAGTAGCCTCTACATGGATGAGAAAAAAACAGAGGCATCATTTACCCT 
CAACTTACCGGAAATCGAAGGAGCTGATGCGAATGCAGTCTATGAAATATGTATAGTAGTAGTTTGATGGAGGACAAATAAAAAATTGGATACATTTCAT 
GTAGTTTTATTTAACTCTTAATCTTATATACATTTCATGTACATTTCATTAAAAATCACTATCAGAGCTATAGTACAGAAACTGGTGCTGGAGCTGATTA 
ATCCTCCTGGCCATGACGGCAGATGTCCTCCGTGAAGTGTTGCGTCTGCCGTCTCTGACCCTATCAACCATCCTACCTGAACCTGGGTTCAAATTAACAT 
TTGGTGCATCAAAGGGGGTGTGAAGAGGTGAAGAATAAGGAGAAGGTACAGGACTAGCATCATAGTCCCTCACAATCCTTGTTCTTCCCCTTCTAACAGG 
TGTGGATGCGGGAGGGGGCATATTAGCCGTGTCTAGCAACTCCAACATCTGTGCCTGGTATTCAAGTGCGTTAGACACAAACCTCTCAGTGTCTTCTGCT 
TCATTGTCATTATTAGGAATAACACCATAATCTTCCATGAGGGCGTCACGCCTTCCTGCCTGATGCCTCTCAAAGAGCCGTTTTCTTTCCTCAGGGTCAT 
CAGAGAAGAGTTTGTGAACCCAGCATGGTTTCTTAGCCTCCCCTACAGCCAAGTCAGGCACAAAAACTGGCTTATTTTTTACATTCTTATAGGGAAGAAG 
AGGGGGCAACACCTTGACAGGAGCAGGAGTGGGGGGTAAAGGAGTGGTATATTCACCCTTATATGAACTAGGTTTATGGAACTCCTGGCAAGGTATTTTG 
CC CTC CCTGTC CTCTTT T AG AGGGGCGGCCTT AC AGCCTCCAAAAACCTTCTTAGCT AT TTTAGTATGCATTTTCTCAATCCTTTTC AT ACCTTC AG AC A 
ACATCATGAACCCATTCTTATCAAACTGAAAACCCCTCAAGAAAAACTTATAAAAGGACTTTTCATTCTTTGCAAGAAGACATCCCGCCATTAGCCTAAT 
TGCACTCAAAACATCCCCTATCTCCCCTTCCCCATACTTTATTTCAGGAAACCTATTGAGTGCTGAGCAGGTACTTTCTATAGATGCTCTCAGAGCATCA 
TTTTCCAAGGTTGATGAGCTATAATCAACCTTCTTACAGCTGGCACACTTCTTAACCATATCCTTTATTTCCATTCTTTTCTTATGTTTCTTCATCATTT 
CTGATGAAGAGGAAGTTTTGATTGATGTCTTAAAGACGATATTCTTCCCTTTTATTGTCTTTCCTTTCTTGCTGACGATACACCCTCCCTTCTTGCTGCT 
GCTGCTGCCGCTGCTGCTGCTGCTGATGATGTCAGTTCCGGCCATTTTTAATGACTCTAAAAAACAAGGGGGTAACACATGTAATAAAATGCATTATAAT 
GTAACATTCTTTTATTTTAATAAAAGGCAGTAGAAAATTTGTACATCATTTACAGGCAGGCATATACAAAAACACATCACTTTATAGATTGGGCAGGGAA 
GATATTTGATCAGCTTCCGAAGGGTCAGTATCAAACCCATTCAACAATGAACTGTGTACATGTGCAATTTCGTCATCCAACCTAGCATTATTAGAGGCTG 
ACTTAATGAGACAGGAACCAATTAGCTGCTTATTAGCACTCCACTTTGAAGCGTACTTACGCTTCACCTTCAATTTCTTAGAGCACAAACTCTTACCAAT 
TACATCATTGACAGTTGTATGGAACAGGTTACCTCTAGATATACAATCTCCAAGCTTATTCTCAGTTTCCTGTCTCCTTTCTCTAAGAGCAGAAAGTTCA 
GCCAGATACCTATTGGCAATCTCATTATACTCCTCATCAATATTATCGAGTCTACACATATACCTCTTGACATCTACACACAGTCTCTTACTCTCAGAGG 
AAATATTAGACATCCTGTCCTCGTCACGCCCATACCTTGGCCTCTTTCTACTGTTACTACTACTACTACTACTTCCTGAGGAAGTAGAGGGCTGATTCAC 
TGATGACGAAGAAGGTATTACTTCTTCGTCATCAGAGTCTGAATCAACATTAAAACAAAGGTTAACATTAATGTTAAGAGAGGTTGAGGTAGGGGCGGAT 
GAGGTAGGGGCGGATGACGTAGAAGCACTAGGCATATTCAAGTCAGGCAAAGACATTGTGTCATCATCATCATCATCATCATTAACATCATAAAGTGAAC 
TACTAATACCACTAATAGTATCAAAGGTAATATTGTTACTTCTAGGGGGAACAGTCAAGTCAGGAAGTGACTCTGAGTCATCATTATCATCTACCTCTGG 
GGGGACAAACACTACAGTATCCTCATCATCTTCAATGAAATTGAAGGATGTCACACCAGACAACAAACTAGGGGAGACATTATTATCACCTTCATCATCA 
TCATCATCATTACTCAAGCATTGAGTAATGTCAATAGTATTGGGAGCTACATACTTAAGCCATTCACCTTCATATTCATAAGGGGGGATATCCTTGAATT 
TACCCATTAAAGCTGCCTCATGATTATCTTCAAAGGTCTCTTCACTGAATGCATGCATTTCCTGTCTAGATAGAGTTTCAAGCATGAAACTCAACTTGGT 
ATAAGCCCTCTCCATATAAACATCCTCATTCTCATTAACACCTTTGGCATTGGGGACAAAGAAAGTAGGTGCTAGAATAGTTGACATCTTATTAAGGGTA 
TCCTTTACACCTTCATCATTAACATAGTCAGCAGACAACAAAAATGAGTGAAAAGAGGCATACTTGTACACCTCTTCATACACTTTACGGAAGAACCACA 
CAGGGGGTATATAGGCCCTACACATGGGACATAACAGGTGATTTTCATTCACCATATGCTCTTCAAATCTCCAGGTGAAAAGACACTCCAGGTGGACACT 
TTTCCTGCAACACGGCATCCCTCCATTTGACCAGTTATAGCAATCCGTGTCCGAATCGTAATCACCCATACAGATGGGACAGGTAGGCACCTCATCAACG 
GCGACGGTTTCAGCCATCTTGTTGTTGTTGTTGTTTACTTTAAAGAGGGAGAAAAGCCAACGCTATAGCAACTCATTTCTTTACTTTAAACAGGTTCATT 
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ACAGTCCCCTTAAAAGGGAATATGTGCTTCATGGCAAGGGCATATTCTGTGATTGRAGGGCGATTTTGATAGAAATCAGTGTCTGTTCCGCTTTGCAACA 
ACATCAAAAAGTGCATGTTTGACATCAAATTAAAGGCAGGGCCTTCCATGTTAGAACAAGCAACAACATCTTCCAGTATACTGGAAGCCATTACACTGAA 
ACTGTACGTCATTGAATAGGCACCTAGACAGTGTCCCCTAAAGAACTCGGGAGGTGTGTGAATGTACTGCCTGATTAGTTCTTCCGTGTTCCTAAAATAT 
TCATCGTTGTAACAATAACCTAGAGGGTATGAGAGTCCCAAGTCAATCATTTTAGGTTGGCCAGATATGCTATCTATAACAAAGTTATCAGCCTTTATAT 
CGACATTAATAATTCCCTCATTTGCAACTCTTGTAATTACATTACAAGTCTCAGCCAAAACAAAAGGCATGTTCACCTTTAGTTTTTCAAACATACTTCT 
TACAACCTTTCGGGCAATGAAAGAGTCTGTTATTGGTGCGACTGTATCAACCCCCTTTACCTCTGGTAAACGTCCCATCCTTTGAAGCATGACAACCCCA 
TACAACAAGCGTGCGTCCCTTGACTCATAATACTCCCCACCATCTAAACTATCCTTGACACACCTAATGACATCATCAATACTGATACCTGCAAATGGCA 
TCTCTAAACATAACCCCTGAGGACACACACCAGTCACCCCAACTACCCCCTTAATATTCTTTCTGTATACAGAAAGGCCATTAACACACTCAAAAACAAA 
CTCTTCATAGTTCATATCCTTCATAAACTTGACAACAGTCCCCCTATTCACAATGTAATAGACCCCATATTTAATCTCAAAAAAGGTCTCCCAGGGACAG 
ATGAAAGTGTTAAAGTCCTTCACATCTTCAGGCACTTCGACCAAATCAGACAAACAACCACGAGCAGTCTTACGGAAGTTTTCCCAGTTCTCACACTGGT 
CAGAAACTTCCTTAATTAGCGTGCTAGTCTTTTGTCTAATTCGCTCACAGGCACTTGAAGGCAATTGGAACACCTCTACAGCCATATACATTATGTCCAT 
AATCTGGGAGGACGGAATATTACGACTAGACAACTCCTTAAGAATGTGTCTAGTGTCAAAAAATCTCATCTCTTCATCGCAAGAAGAGGTGGGCCAATAG 
AAGCCAACTTTAGGGTCCTTTCCATAGAAACAGGTATCATGGACATACTTGCTAGACTTAACAATTGACCTATGGCCAATTCTCAACTGTTGACAGGTAT 
CATAATTGTCCTCAAGACTACAACTCTTCTCAATGCCTTTAAAGTAGTAAGGGGAATCACGTTCAATCCACTCATTATACTGCCTCTTACACTGCCTAAT 
CACCTTGGACGCACACACGTTTGAAAAGGTGCGAGTAGTGGGAGGAGAGGGAGCAACTTCAGACTTCTTGAAACGTTTAGGGGGAGGGGGCGTGATGCTA 
AGGTTGATGACGTCATCCCAAGTCTTGATAGTGATGTTGTTGTTGTTGTTGTTGTAAGCGGGTlACAGTTGTGGCTTCGGAGGTTGAACCAGACGCGTAGT 
TGTTGTTGTTATATTCTTCCACAGGCCGTTTTTTATTCCCCTGATGGTAAACATACTGCTGGTGGTGGTGGTCTCCACCAGTATTGATGGTAGTAGTAAT 
TACAGTGGGTCCCCCCATTGCCTATGGAAGAAACAACAACAAATACACGATTAATTCAACGCCCGGCACACTAATAAAGTGATGTGTTTTTTATATAAAA 
AATAGCCCATAATATTTAACGTTACGTTAATCAAAAAAAACAACAACAATAATAGTATATTATTGGTACTTACTTTTGATGGGTTTGGAGCGGAGACTGA 
AGTAAATCCAGAAAACAACAGTTTTACTGTTGGGCACTACACTGCACTGGTGACGTAGTAGTAGAAGTAGTAGTGGTAATAGTCCACTATTGTCACTGCA 
CTGCACTAGTAGTAGTAGTAGTAGTAGTAAGAGCGTTGGATAAGCTCGGGTTGGCTGCTAGAGCTGGAGCTGGAACTGGAGCTTGTATGCTTGGCTGTGC 
CTCTGCTGAGACTGATGCTTCTGGCCTGCCCCGGCTCGCTTATATACAAGTTGTCCCCCCTCACCCCCACTTCCAGAAATTTGCCGTCGAACGCCAGTTC 
TCCAACAGAGTGGGGTCCAGATCTGGCTACGGGTTCACTCCTGGCCATACCCTGTTTAGGGGTCTAAATCCACCCCCGGCACTAAATGGGGAGGACCTAA 
GGGTGTTATAACAACATGTAAGCGTTGGGTAAGAAGATCTGGATCTGGATGACCCACCTTGTCCTTCTTATCCTATCCTTATCCTTGTCCCCTGTCTACC 
ACCACCTCACCCCATATCTCTCACCCCATCTCACCTATACCCCCATCTCACCCATCACCTCTATATTCCCCCACCTTATTCACTCGCTCCAGTTTCAACA 
CCCTGTTCTTGCCGAGCCAACCCATAACCAGATCTGGACCCCAGTCTTCCCTTTTATCCCTAACCGGCACCATTTATGCCCCCAGGCGCTAGCGGTGTAT 
ATAAGGCGGCGCGGCCAGGCCAGAAGCATCAGTTCTCTGCAAGCCAGCAGAAGAGCAACACAACAAGCACTCTCTCTCCTTCTACCTAGAAGAGACCTGC 
CAATACTCAAGCTACAAGAATGGCCTCTCCAGCCCCCGCCGCACCAAGTCCTTACACCATGTTGGACTCTAAGTTACTTAGTTCTGAGGAACTAAAGGAA 
CTAACTTCATACGTCTCGACTAGCTCTCGCCGGTCTGATATGAAGAAACACTTGCTCCATCTATTCGAGGAGCACGAGAAGATCTTCCAATTCATACAAG 
GTAAGCACAAGTTCTCACTATACACTTTGGACTTTGAAATTTTCTATGTTATGCTGAATATTTTGTTGGTTGAAGTGAAAAATATTCTAAGTCCAATTCC 
TTTACTCTTTGACAGTWSATCTCCAACCAGTACGGAGACTATGGATGTTTCACAATGGCCCCGCCTCACCTGAACGCrGCAGCCGATCTCTTGGATAAGGT 
GATGTCCGGACCTCTATCTCCCGAAGGCGCCCAAACTTCGTCCCCGGCTGCTTGTGTTGGTGCGAAGGTTGTGAAGGCACTGGTGAGCTTTTGCCAGAAG 
ACCCGCTTCACCACCAACATTGTGATGAGAGAAGTTAAAGCCATGGAGTTCCAAGGAGACGATTTTAACTACTCTGCCTTGTGTGCAAGTATGCCCCAAC 
GCCCCGTGACTGAGAGGCAGATGTTCGCCCTTATGAAGAGTGAGGACGAAGAAATGGGAGTGTCTGCAAACTTCTCTCCAGTCTCTGATGACGTCATCAA 
CCCTTCAAGCCTCCCCTCTGGACAAGAAGTCGACTCATCAACTTCCGCTCAAATTTCTGGTATGTTTCAAAACGTGTGGAGTTTGCTTGAAGAGTGTGGT 
AGTGGCTCTAATAGTAATAGTTCCCCTGTCTCTAGGACAGTCTTAGTTTGCACCCTGTTTATAATCCAAGTGTTCAAGTTTTTGGTGACTAAAGTGTCTA 
ATGTGAACGTACTTAACCAGTTGTTTGGACATGTTGTTTTTGGATCACTTGATGTGGCTCCAAGTAATAATAATAGTGTCCCATCAACTGTTGTTAACAA 
CAACAACAAACCCTCGACCTCTAATAATAGTAACAACATCAGTAACAAGCGTGTTGGTGGTAGTAATAACAGTGGCGGCGGAAGATCAAAGAAAGTTACA 
GCCACAGCCAAAAATCCCTTTAATAATGTAGATGGGGACAATCATGGCATGTTTGCCGGTGCCCCTGTTGATGTTAATTTGGATGACTTTGTTTTTCCCC 
AAGTTGAAACTCTTACAAGTAAGAGCACCATCCCTAAAGAAGAGGTAAATGTAGATGAAGATTTGAGTAAAATGTGCCGTAAAACTGCCCTTACCCCCCT 
AGAAATTCATACCTTTAATGTGTTCATCTCTGAGATTAACCCCTCCAAATATGACCGTTCAATGTTTTGCAAGGGATTTTrGACTGCATGGGATAAGTTT 
GTAGAGGGGGATACTGCTGGCGTTAAACGCTTCCGTAACTATATCCTCACTCGCTCAAACTATGCCTCAGCCGCCAGGGCCGTGTATGAAGCGTCAATTA 
AGGGGACTGTTTATTATAATGACAAGTCAAAGTTTCTGTTCCACGATAATGTTAACCCTGATCTGGACAAGAGCTGGGGTAACAAGAATGGGAAGAAACC 
TA6ACTCCCAGCTAACTTGATGGCATTCATGGGTATTGACATTGTAAAGGTGTGCGCTAAGGGGATTCAAAAGTATATGTTTGCAAAGCAATTCCAACAT 
CCGGAAGTGGAAGAACTTGTGCCTCCTATGGCTGTATACGCAAAGGTTGCCGCAGGATTGAAGTCGGGGACTTTGTTTGATGACTGGGACCTGCCTGAAT 
ACGAAAATTGTCAGTTTATCAAGTATGACACAGAAGGGTGCAAAAAGCACAGTGAGTTATACGCCAAACAACTTCTCCGCACAGGACTTAATCAATACAA 
TAAACTGGAAGAGGGACAGAGTGCATTCCCATTTGCAAATATTGTGACGGTAACATCCGCCTCTAGTGATGATATTCACGGTGACACAATCATTGAATTG 
ATGTACAAGACAAAGGATGGCGTAAAGGGAGTCTCAAAAATTGAGGACGAAAACATCATCAAGGTGAATCCAGCAGAAGAAAAGAAGAATAATAGAGTAC 
AAGCCGAGAAGACCCTGTATTTTGAGATTGATTCCGATGATGAGGTGTGTGAGAGAACAGAGGAAGAATTCTTCAGGCCTACATCTGTTGTTGCTGCCCC 
GACAACACCCCTCGTACCTTCTAATGTGGAGGAAGAGGAAGAGGAAGAAGAGCAGATGGAAGAAGAGGAGGAAGAGGAAGTAGAAAGGGAAGAAGGATCT 
GATAAGGAAGATGACGGAGACGCACCAGCACAGGAAGAAATGGAGGAGGAGAAGGAAGAAGAACAACAACAACAGCCAGAAGAAGAAAGCAATGGTAATG 
AGAACCAAGAAGAAGAACAACAACAACAACAACAACCAGAAAGAGAAGAGGAGAATAAGGATGCAGATAGTGACAGCGACAGTGATAGCAGCAGCAGCAG 
TAGTAGCAGCAGTAGCAGCAGTAGTAGTAGTAGCAGCAGTAGTAGCAGCAGCAGTAGTAGCAGTGAAAATGAAGCTGAAAAGAAGAAAGAAGAGGAAGTA 
CCTGCCAAGATTCAGAAGAGAAAGAGGCTAAGTGAT^GGCCATCAGAAGCTGCTTCCTCTCCCAAGAGAATGAGAGTAGAAGAAGAACAACAAC/^CAAC 
TATCACCATCATTGGACATACTCCAGACTGCAGTTGATGAGATGATGGAAGAAATTCCTGCGCCTGAGCCTATCGTTGCTACAACCTCACCCAAGGCAGC 
GACACTTGCACTCAAGACAGGATTTAGTTACTCTTCATTCGTAAGAGGAGATGACCTTTCAGTAGCTGGTAATACTTCCCCTACTGAACCAGCAGCTGTG 
CCCGCTGCTGCCACTTGCACTTCCGATGTTGGAAATGACTTTTTGGACATGTTGGACGGTTTACCTGGCGATATAGTAATGCAACCTGGCGAATGCGACG 
TGACCGCAAAATTCTTTGAGGGCATCACCCTACCAGATGGTACTGATAATGAATGCACAGGTTTCGATGATCTTCTTAAAGCCACCGAGACTGATAACAT 
TATAACCACCACATGCTTTACCTCCCCGATTCACCCTTCTAGCAACTCAGCCCCCAGAAAGGATATTGATAATTGCAGTTCTATTAAGAGGTCTAGGGCA 
GGTTCACTTTTTGACACTGATGATGATAGTGAAACAAATGAGGTTGAAAAGGAAGCCCCTAAACGTAAGAAGCACTTGAAAAAGAGGGGTAACAAGTCCC 
ACCGTGGTTCCTCTGGTTCTGCTTCTTCTTCTCATTGTATGAGTAGTGATGAAGAATCAGAGGATGAAAGGGATATGAAATCAACATCAAAGGTTCACAA 
GTCACCAAAAGCTCATGTTAAACATTCCCCTAAATATGATGCTGTAAATAGTGATGTAAATAACTCATACAACAATGTTAATAGTACAACATGCATGTCG 
TCATCAGATAGTGATGCAGAAGCACAGCCTAAAAGCCATAATAAAAGCCACTCTCGTAAACACTCTTCTTCCTCCACAAGTGATAAGAAACAGAACCAAC 
AATGCTCAATCAATACTCAAAATGTCAAGAAGACTGTTGTACAGTCTCCACCTAGTTTTAGAAGTTTTAGTCCTAAGAAAGATGAGCTTGGTGATTTCTT 
GTCACGCAAGCACACAAAGCCAGTTAGGCCCTATAACAAGAAGCGTGATAATGTTAACACCACTAATAATGTAGTACAGAGGTCTGCCTGACCGACTCAA 
ATGATACTCAATCAATGTACAATAATAATCTTAGTACTTAACAAGAAGAACTATATTTTTATAATATTTTTACATGTCTTAATAACAAACAAAATAAAAG 
AAAACCAATGTATTATATGTTTAAAATCAACCCCATTTGCATGATTAAACTAACTATAGTGTGTAAGGAAAGAAAAAAAACATGATTATTTCTGCCATTA 
AAACAACAACAAAAATTCTAAGCTTCTCCTTTCTTTCTGTGTCTTTGCAGATTCGGGAGTCTTTCTAAAACCACAGCAATACAAAAAACAACACAAATTC 
TCTACTGCTCTCCCTCACCAATCCCCCTATCGCCCTTATGCTCCTTCTCCTCTCTATACTTCCTCCCCATTCACTTCTGACTATCAATTTTCTGATTCAT 
TTCAACAACAATGCCACCAATACAATTACTACCCCACACCCCCTCCATCTATGTCCTCGACGACCTCTCCATCGTCTTTGTGGGATGATGACGATGATGA 
TGACGAAGAAGACGAAAAAGATGTCAAGCAAGAAGTCTCGAACCGTCCCCCCATTTTTTCGTACATGGAAACTGTATCTTTTAGTGATAACGATGAGGAC 
GATAACAAGGGAGAAGAAGAATGTTTTGGATCAAACTTTGATATGTTTGGTGATTCAGATAACATGCCATCAACTTCTACTGCCCCTTTCCCTCCTCCCT 
CTACAACAACACCACTTCCTACTCCTCGATCCATCATGGATACTGATTCGGATGAATGTGACGAAGAAGGAGCAGCAGCAGCATCAGCACCGTCTATTGC 
CGCCTCTTCTTCTATCCCTGTCGGGATCTCTGAAGCTGAATTGAAAAAAATGGAAAAGAAAAAGAGGAAGGAAATTAAGAAACTCAAAAAGATGATGAAA 
GATCCTCTCCCTCACCTATATGTAGGAGGAGAACCTCCTGTCGCAGCAGATTATAAAACAAGGGCAAACATTTCCCTTTATAAAGTTGACCCTAGTATCG 
ATATGTGCGGTGTCGCCCCTCCTCAATTTTGCGCTGAATTGCCCACCCCATCCATAGATGTGTATACTTCTTCCTATGTATTTCCTCCTCCCACACCTGC 
CATGCATAATAAGAAAGGGTCCAAGAAATGTCAATTCCTTAAGGGGAGAAAGGCTTTGAGGAAATGGATTCACGAGAATGTATGCATGGCCCCTCCCGGT 
AAAAGGGGAGGTGTATTTTTGGCTCACTTGGAACAAAGATTCTTGGCTGAACATGGAGATGAATACAAGGTCCCAAGGATGTTTGTTTCAAGAGTATTGA 
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ACAAAGCTTTCCCCAATCTGATTGCTCGTGCAGACACACTGTGCAGTGATATGACATTCTATACTAACCTTTGTTGGATAGTTAATGGAGTTGTCGTATG 
CTTTGATAAAGATGATGGAGGAATACATGGCGATGCGTCAGAGTATGCAACAGGAGAAAATTTTGATACTGTAGTGTTCCACAAGAGGGAAGAACAAAAG 
ACCAATGGGAGTGCCAGTAAGAAGAGGCGTCTCACGCCTGACACTAGTAATATGGGAACAAGCACTGATGTGCAAGAATTCCAAACGATGGGAACAAATA 
CTGATATGCAAGAATTCCAATCAATGGGAACAAATACCAACCCCATAGAGACTTCATCAGTGGGTGTGAATACCAACCCACTTCCCAACCCTCCCCCAAG 
ATTGGTAATTACTCCTTTAACGAATGATGTACCAGAATTGGACATGATGTGGCTTTATTCGCCTTCCAGAGGAGGTGGAAATTCTAGAATGAGTGCAAAT 
ACAGGAACATCTCCCCTGTCTAACACCCCAATTCCTACCTGCTTCACAGGAGGTGCGAATGTAGTAGTGCCTAATGGATTTGTCCCTCCCACGTTCCCCT 
TAGAATGTGACGAAGATGATCCAAGTATTCCCAATTCTTACAATTACGAAGAGGATAAAGTCTTTCATCCATTTTATGAGTATATGGCCAAATATCTATC 
CCCTCTTGTTCCATCATATAACAAGGGACAGACTTGTAATGTTGTCCAGGAGTGGTTCAAGGGATCCTTCTCTCTTGCAAAGCGTAGAGGAACAGTCCCC 
AAATTCTGTAGTAACATTTCCCACGCTTTCTTTTGTAATATGGATGTATGTACTGCCATGTGCAAATGGGCG7UVGACTGTAATTAGACATGGACAATATT 
GTAATAGATGTATCGTAAGGAGGTCATGTACATCCATGCTCGCATATCACTACATTGTTTGCAGAGACGCTTCATGTGATGTTCCCAAGTGCAGGGAAAG 
GGTTCGCAACGACATGGATGACTGATTGATTGGTTGATATGTGACATTTTCTGTATATT6TGTAAATAAGATACCAATAAACTAATGTTTTATATATGAT 
TCTATTTTTTTAAAAACCTTTAAAAATATACATATAAAATGATGTATTTTTGAAACTACACTCTGGCAGAATCAGACCAGACCCCTGACCTAAAGCAGAC 
CACAGGGGAGTCTTAGAGAGGGGTGTGAATCTGGCTAGGGGTTCATCCCTCAATGTTAATCACACGCAAGTAAAAACACCACTTCCTAGAAAGGGAGGGA 
AGGGGTGACTTGGTGATATCATAACTGGGGAATTTCCTCrCCAGATATCTGGCTGTACACGTGTGAGCGCTTCTGGGCGCGACAAGAAAAAAATTAGTGA 
TATCATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACACGTGTGAGCGCTTCTGGGCGCGACAAGAAAAAAATTAGTGATATCATAACTGGGGAA 
TTTCCTCTCCAGATRTCTGGGTAAGAAAAAAAATTAGTGATATCATAACTGGGGAATTTCCTCTCCAGATATCTGGGTAAGAAAAAAAATTAGTGATATC 
ATAACTGGGGAATTTCCTCTCCAGATATCTGGCTGTACACATGTGAGTGTTCTAATCTCATTTTTTATATAGAAAATATAACTGAATTAGCTCTCTAAAC 
TTTTCCTCATTTCTTACTCCTCTTTGTTTGTATGTCCTACTACCAGTGTGCATATAAGAACTGGAGAGATGGGAGTTTTAAATAACAACCCTTCTTAATC 
CCTTTCATGTTTATATTTAATAAATATAATCACCATGGATATTTCAAATAAGACATTATTTTTAGTAGTCGGTACCTTTTTTCTGACCACTTGTGCATCT 
TGCAGTCCAACTCAAATCGTCTGGAACCTCATGGTAGCTTCATTTGTTGGATTTCTAGGACATAAACTACTTAAAAACATTACACCTGTCAATCTGGATC 
TTGTCGGAAAAAGCTTCGTATTCTCTGCAAGTTTAACCATCTCAGAAGAAGCCCGTTTATTGAGGATTGGAAACGTATTAAGGGACTATAATGGCAACAA 
TTTTGAAGAGTATGAAGAAGAAGAAGATAGTGGTATTGAAGAATAATAATAATTUITAATGGTTTGAATAAAATATAGAGATACATTTATATTGTTTTATT 
TGCATTATATATAAAAAAGCACTACAATIATTTGTACACATACATTGAGAGAAAAAATTGATACAATTTCTTCCTTTTTTTTACTGGTATCTGATTTCTTG 
ATATTCGAGAGAGTTAGTAGTAGCAGAAGAAGTAGCACCAACTCCGGCAGCAGAAGTTGTAGGGATAGGGGCGGTGGCACCGGTGGCAGCGGCAGCGGCA 
GCAGCTTCACGGCGTTGTTGTTGTTCCAGGATAATCTTTGCGCGTTTATTCATTTCAGTGGTGCGTGTAAAGATGCCTCTGTTGTATTGGACCACCTTAT 
TCCTATATTCTCCGATGTCCTTATCGCGCTTCTCAAAAAACTTGGCCATTTCTCCACAACGTTGACGGGCTGCCCATGCAGTCATAGCCATAAAATCAAA 
TTGGTCAGAGGTGATATTATGGTTGTTTAGAAGGTCCACAGCGTAGGCCATACAGCTAGCGGCACGTTGAATATTGGAGTACTTGCTGTTTCTTGGTTTA 
ACGGTATGGATCACCTTACTTCCATCATTCAAGAATTCTGCCTCTTCTCTGTGCAGAAGAGATGTGCCTCTGCTAACTGTCCCGTCGTCGAGGACAAAAG 
CAGGTCCAGGTATGGGTGTTTCGCAATCTCCGTAAGAAGTGTCAGCAACCGTGAAGTGGGAAGGATATAGTCCATTATCTCCAACTCCAGAAGTGAGTGC 
AGCGCTATAAACTTCTCCGGTGATTTGGCGAGTGACTGGATTACTTAGCCCTGCGTCAATTTCAGACATATTTCCTTCTTCGGGTGTGGGAAGGAAGGGC 
CTAGGTGGGGCGAGAGTGTCGTTGTTTCTTTCAGCCGTAGTAGTAGCAAGAGATGTGACCCAAGAATATTTCTTGTTCTTAGACTGTCTTCCAGAACCGT 
CGTCGATAACAAGAGATGGTTCGTCCTCGTCACTGCTACGGTTGCCATTCTGGTTCTTCTTCCTCTTTGAATCCTTCTTCTTAGACTTGGACTTGTTCTT 
GGGGCCAGGAACAGGTATAACATCCACATTGGGTGTAGAAGACATCCTCGTTCCAGTAGATTCAGTCAGAGATGTAAGTGCTGGCACGGGAGGCATGGCC 
GTAACAGAGATACCTTGTGGTACTCGGAACATTGCAGAAGAAGACGTAGAATTAGAGTTGATACTTGTAACAGCAGAAGCAGAAGTCCGAGAGGCGATGG 
CAGCAGCTCTGGCGGCAGCAATCATAGGGGCTGTAGTGGTGCAAATCGGCAGAGTGTTGGATGAAGTCGTGGCAGCAGAGGTAGAGTTGATGTGAGACAT 
CTTGATTTCTTGGTTGGTTGGTTGGTTGTGGACTATTGTGTTCTCAGAGAGGACTTGTAGTCGACTGTGCTCTACCCAGATCCACAGCCCCTTTCATATC 
CGATTTTCGGGGGTTGCCAGATACTAATATACTCCTCCACGCCCTCTTCTTGCGCCATATCTATAAGGGTCGAAAGTTGTATACCTTGAGTTGATCCGGT 
TATAACCGCCATTGTTGGGTCTTTCTGAGAATCGAGAGTTGCGATCACGGCATGGCCTCGTGGGGGAATTTTCATTGACGTAAGCAGTAACGTCAGAAGA 
ATTGTTGGAGCGGGGTTGGTTAAAGTGCCTCTGGTGTTGGTGGCGGTGTTGGTGGTGGCGGTGGTGGTGAGCAGCAGCAGGGGCGTCGTAGTGTTCGTAT 
TGCTCACAATAATAGTCCTTGGGCACTTGCATTCTTCTTGGATGAATATATCTTTCTCTTTCTTCGTATCTGGGTTCGTCCTCTCTTTTTATGTGTTCAG 
TTTTTATGTGAGTTACTCTGTCCATCTCCATTTTCTCAGAATTTCTTCCCCCTCCATTGACGTCATCCGAATAACTCAACTGGCGCCGGTGATACTGATG 
ACGTCTTATTCTTCGGACCTTATTATCCCCTCCTTCTGAATCATCAGATGATTCAGAGTCAGATGAATCAGACGAATCGGATGAGTCAGAGGAAGAAGAA 
GAAGAATCGGATGAGTCAGAAGAGTCCATGGAGCCACTCCCAATTCCACTACTAAATTCAGCTTCCAATTCTCTGACGACATCTATATTTTTAGTATCTT 
TTTCTTCCACATCTTGATTCGTATTAGGGTATGTCACACCAGTTTCGTTGCTTTGTTCTGCGAGGATATTTTCGAAACCTTCAAGAAAAGGAATGTCAAT 
AAAACCTTCCATTTCTACGATCTCCCCTTCTTCCAATTCTTCCTTTTTGTTCTCTTCTTGTTGGTTTCTTGAGACGGTGGTTTCAGTAAAATTTGGGTTA 
TCTGTAGATCCAAATCCACCAGTTCCACGCACGCTTCTTGCTTGATTTTTGTTGTCCTTTTTAGAACTAGAGTCAATAATCGTTCTCTCTCCCGTGGTTT 
CATTAATATACACAATCTGTTCTTCCTCGACATCACAATATCTTAAGAAAATCAACTGGGCAATGCTGGTTCCCTTTCTGATTGGCACACTTTTTTCTGC 
ACTATGGTTGCGCAGAATCACTTTCAATTCTCCCCTATAATCCACATCAATCGTTCCAGTAGGTACACTAGTGTTGTTCTTCCATGTCATCCCAGAACGT 
GACACAATTTGTCCATAACAGCCGTCGGGAAACTTGTCTATTCCATATCCAGTAGAGATCTTGGCCAGTCCCATAGGTTCGATATCCATTTCTTCAGAGG 
GAAATAGGTCGTAGGCGACAGAACCGGGCGTGGCACGTCTGGGCGGAAGTGCAGTTTCCTCCCCGGGAGGGGCGAATCTCATAAACACGACAGATGCAGA 
TGAGTCCATGGCTGGATACAGGGAGTACACAGTGGAGACGGGATGTAATAAGTGTTCAACGTCTGGAGGACAAGTGAGTACCTTCCTCGGTTCCTCCCAC 
CCTTTTATAATCGTGAAACCCCCACACCCCCGCTAAGGCAGTAAAAAAAATTGACCCACCATCACCCCTTGTCCTTGAAGGAGGAAACTAGAAGATAACT 
TTTATCATGGCAACATTTACTGAACAGGATCATAAAAATGCGTTTTTATATGCTAATGAGAAGCTGAGGCAGGAAAGAATATACAGACTTAAAATGTCTG 
AGCCTTCAGTTTATGCTTTTATTGACATAAAAGAAATAGAAAATGGTTGGGAAAAAGAATTCGGGCTTTTAGTACAACCAGGACAGAAATTAGCTCCTTT 
CAGGGATATTTCTTATGACTCAAGCAAACTTGATTGTGACGCATTTTCTTGCATACCTTCAGATATACTTCATTCTGATAATGAAAAAAGAGTAGGAGAG 
TGCAACTTTGCCGAACACACCTCTGTCTCATTTCCTGTCAAGAACCCTGAGGGAAAAACATTGCGCCATTTCACGGCATGTGGTCCAGGGTGTTACCGGA 
GGTACAAGCAAAGAGACCCCCATACTGGTTTGCCAGTAGCCAGAGGCGTCCTGATGCAAGATCACGTTGACCATGAAACTGGAAATAAAATGTGTGAATA 
TCTGAACCAGAGCTTAGTCATGTGGGCTGCTGTCCCTTGGATACGACCTGGAGACCTTACTGAAGGTTACAACACAACACACGTGCCTGGCTTTGCATTC 
AAGGAGGATGACGAAAGGGATTCAAAAAGAGTTAAATATGAAAATGTGGTCATTTCAAAGGCTTATTGTGATTTCTTTAAACAGTATTATGACGCAGACT 
CTGGCTCATGTTATCGATCTGGATGGATGAAATTTGTCCATTTAATGTTTGGGCAGTATTrTACTAATCTTTCTTACAATTTAGCTAACCCAAAACCTTA 
CAATTTAACTGGAAATACATGGTCTGATGTAGTTTCTGTATTGACAGATAACCCTATTGTAGATGCGGGTGCTGCTCCGTCCAGATCTGAAATGGATGAA 
ATTATCACCAAGAAGAAGTTCAACGTATTCCCTTCTGAGCAGACATCTGCTCGCCAGAAAGCAGAGAATATAATACGTTCTCAGTACGGAGATGGTGTAG 
AAATAGACCCATCTTCTGTGGATGCTTTAATGCAGTTTGTTAATAGGGAAGGTGTAGTAGGAACAGAGAAAAAATCCGACCGCCTCATGCGAGTGGCAGA 
CGCTGTTATGGATGCGGCTATGCGCCTTCAAGTCATGGGTCTGGACGACAGTCAATCTAGACGATTATTGTTAAAAAATATGATTAAAATGAGTAGAAAC 
AACCCAGAATATGCAAGACATTTTTCCAGTTCTCTCAAATTAATTGGAGTAACTTTGGCCATAAAAAGATCTGTTTTCTCTAAAGGCGCTTCAGCTAAAA 
GAAAGGAAACGGCCATTAATAATGGCGAACAGCATAGAAGAAGCAGATGGTCACCCGAGACGGTGACAGAAGAAGATGCACTTTTATTTGCAAGAGAAAA 
TATCACTGAAGACCCAAAACACCCTGCGCCCTTTGTGGACATTTTACACTCACCAGATATCAATTCATCTATCAAGAGCGGTTCTTCTTCCTCAATATGG 
AACGATATACTTTCAAGAATTTCTTCGACCAGAAAACTGGAAGAAAAGGCGAGTGTTTTCGTTAAAAATCTTGTGGTGAAAGTAGTGAGACAGTTCTTGG 
ACATTTTAGAAGGTAAACTGTTTTCGGACGGATACGAATGGGACGATAATATTCCTCTGATGATAGGCGTGGACCAAATATTGAGAGAAGTCATTAAGGC 
GGCCAGTAATATGTGCGCTAGATTTGCCTCCTCAGCTCTGGAGTCTAGTTTGGTTACTGGCTTTATAGACTCGGCGAGTGCTATCACTTCTAGGTTAGCC 
GTACAACTGGCAGCCAGAACGTTCTCCGTATTCTTGGAGGAGTCTGTTATAGAATTTGTAGTTGCCGCAAGCCTTCGGCTAGCGATACAGGCATTTGCCG 
ATCTGGCAACTCTTGCCGCGTCAGCCTTAACCGTTATTGGAATTGTTATATTCGTAATACAAGTACTTGGGTTGATTTTGGATCTTGCGCTAGGGTTAGG 
TTGGTACGATCACATTTTTAGCCCAGAGGATTTAAAGAAGCAGGTTCTAGTGTTTAGGAGAGAGTTTGCAAAGGCAGGAAATGTAGATGTGGGTGTAGCT 
CAACCAGTCACCCCTGAAGAAATCGTCGCTATCAACGTTTTCCTTCAAACTGAAGAAAACGGGGAAGAAAAGAAGGAAGAAGGAGCACGAAAATCAAAGA 
TTGATTTTCTTCAAAAATACTTCCATTCTACTCCTTTGATGGGAAAGAAAAGTAAGTTTGTCTACATACAAGAAGCAGCTCAAGAATACTTGGGAGGAAG 
AACAATGAACGCTTTTGGGCAGCGTATAATAACAGCTGCTGATGATAGTGACACCACCACCACCACACAAGAGGGAAGGAGGGATGACGAAACAGTGACT 
AAGAAAATGAGGAGTATTATTCTAGAAACAGGTCAAACTCTTAAGGATTACTCGTCTGCTGTTAACTATAACGCCTCCCGTCTAGATTACGTGGGAGAGG 
AATGGGTAAGAAATACTGCCCTAAAAGAAGAGACAAGGAGCAACACTACTAGTGATAACCTATTCAAGAAAACTGTTTCTCTTGCTAGTATGGCCGGCGC 
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ATTTCTGGTTCTAGGAATAGGTGTATTGGTAGCGTCCCATATTACACTTTTACGTTTCACCAATATTGGTCTAGCTTTTGCGTTCGCAGGTCTCCTAGCA 
TTTATTGCACTTATGAGTATATCATATATAAACATGAATGCTATGGGTGTAGTGAATTCGGACGCAATATACAGGTCTACTGCTCTAGTTGGAGATATCA 
AAACAGACCCCAGAAGAGTAGGAATGGTCCAGCGCCACGTAGGTGTCGGGGCTAAATACAACATGATTACAGATTTCGTCTCTCCAATGTTAGACGAGAT 
CGAGAGTGACTAAAAAGTGAGGGTTATAACCCCCCCCCCAACAAAGGCTTTAAATTAGACTCAAACCAAATTCATAATCATTGTGAAGACATCGACGGTC 
GAGGAAGGTCCGTGCTCAAAAGGTCCCAAAAATATTCCTCAATCAAAAAATACCATGGATTCGTTGACTAATACAGTAACACTCCTCGTGAATGACCGTC 
TTGGAAATCATCGAACAAACAAACCAATCACCGAACAAGATGTGGAAAATACTCTTAACCTTAACAGTCTAGAAAGGGCAAGTCTTTTGAAGTTATATTC 
TGTTTTCATCAAAGAAATGCAGTCCTATTCTGGATGTATACCCAAGAACAAGTACACGAATGTGCAAGAAATATTCGAAGATGGACTAATTACTTTCGAA 
TGGAGAGATGGARCAAAAGTACACAGATCAGTTTCACCAAGTTCCCCTATACCTCTTTCTACAAAAAAATCGCCTCGGTCCTCACCTTCCCCTCCTCCAT 
CGATGCCTTCTATTAAAGAAGAAGAATTTGAAGAAGAATTTGAAGATGATGAAGAAATATACGAAACAGATGAAAATGTGGAAGATTTCATAAATGGTGA 
TGGAGAAGATTCAGAAGAGGAAGAAGAAGAAGATATAATTGTTGATGACGAAGAAGAAGAGAATGAAGAGGGAGAAAACAAGTACGTTCTCGCATTCTCT 
AATCATCTGAGGCGCCAGACTGCCGCCGCTGCCGCCGCCGCCGCAGCTGCTGCTGCTGCTGACATTGAGAAGAAGGACAAAAACCACGCAGTTAGTGCGC 
ATGACTACACACTGTCCGCCCTCCAGCAACAGCAACAAAAACTGCTCCAGCAACAGCAACAGCAACAACACCAGCAGCGCTCCTCATCTGAGAAGGTCAC 
CTCCACACCCAACAAATTCAACAAGTTTTTACTGCCGAGTAATGGCTTCTCAGAACAGACCGAGCTCTTTGTTTGTTTCGATGTGGATAAAATTGCGCAA 
TATAATGGACTCGTGGAGCTAGACATCTTACCCATTGTTGCTGAATACATTATCAACGGCCTTGGTCTGAAATGCAGTATGGAAACTCCCCCAGTGAAAC 
CGTGCAGGAGAAAGGAAGTGAAAGATGTGTGGTGTCAGCCTAAAACTAGCTTTGAAAATGATGCTGTGGAAGATAAACATCTCGCATTCGCAGAATCGCC 
TATACTTCAAAGGCCTAGAGATTTCCCTATCCCTAAAAAAATCACCGCCTATTTTTGTTTAGACGATTCTGTAGACATTAAAAACCCCTGGGGTTCGTGT 
CCGCTTTTGAAAAGTGGATCAAACTTTCGAGTGTCCGAATATTCGCGTCATTTTAATGAATTCTCAGGAGTTAAAAATGACGATGACACGTCTTCAAACA 
C TTGC TTT AT AT AC T CT C AAAAAAACC CC AAC AT TGAAATTGT ATC AAAATTTVAAT AT T G AAT T T GAGGT AATGATGG A G GG AAT T AT AACCC AT AG AAA 
AGATTTGTTCGAGACGGGCATTTTGAGCGATTCTTCATTAGCTACGGCTATGGCATTTTGCCACCCAAAAGCTAGAGTTCGAAATGTTGCATTGTTTTAT 
TTTTCTGTATATTTACCATTCTCTAAAATAACTCGCAAAGAAACTATAAAATGTTCAGAGACGGATAAGGTACATATTGGTTCAGACGCGATCTTTTCTC 
CCCCAAGTGATAATCCTAATATAAGTGCTCACCAGAACAATAACAACAATAATAACAATAATACCAGTGTGAATATCGAGGACAGACCTATCCGAAATAA 
TAATATAAGCAGAAAAATGACCATCACAAACTACCAATGTATGGCATGCAAGGAAAGATGCACAAACAATTGCACTAACGGTAACTATCCCGATCGTGGT 
AACCAGCACTTGTCRCATAGTGTAAAAGGGGAAGATTTCTTTAAGATTTTAAATAATAGTAAAGTAGATTCATTGAAAAAATTGAGCAGAGTACTGATTC 
CCGCTCCTCCCTCTGGAAATTATACATCTAAGTTTTGTGATAGAAGCTCTATGTGCCATAGCTTCTTTTGTAGAGGGATCGAACCAGTGTCTACTTCTTT 
CTCATCGGACAGTTTTGAAAAGACTAAACTTGTTTTGTATGGTAAAGTCGTTGACGTTATCAATAGTTATTCTGCCATAAAAACTTCCCATAATAATAGA 
ATCAGGGTCTTTTTTAACTCTGAGGAAAAAGATAATAAGACTATCCCCTCTAGAGCCGAAAGTGCAAAAAATGCATTCAAGGATATACTTGTTCACGAAT 
GTAATAAAGAACGAGCTGTTTCATATTTTGAGCAAAACAAATTATCCTCTAAAGATGGGCATCTATCTAACAAGTGGTGGATCGAACTTAATGACTTGAA 
CATTATGTTTGAGAAACACGTGGAAGATTTTTACAAGAAATGTTCTAAAGTAAATGATGCAGAATCTTTAAAGGATATTTTTAATGATTTTGAAAAAACT 
TGTGATAAATACAAAACTGCCAAGAGGGCAATTATTGGAGCACAAGACCCTTCTACTTCTACTCCCTCTAAAAAGGAGAATGGTATCACTAGGATTATTA 
GTACATTATCCGAATTTCATTCAAAAGATGAAGCTACAGTAAGTGCCCTTCTCGACAAAACAATGCTCTTGGGATCGAGGACAATAATGTCTGGTGTTAG 
ATGTGTTATACGTAACAATAGTGTGTTTTCGGGCTTTGAAAATAAGAACACTAATAATAATTGGGAACTTGAGATTAGACACTATGTCATCTCTATGGGA 
GGTGCTGCAGTGACAAAGATTTCCGATGAAGATTTGGAACAATTCACGCCTGTAAGAGGTGCTGTCTCTGTCACTACAGCACCTAATGATAAGCTACCTG 
T AGGGGC AC ATC AG AC ATG G AAGG ATG AAC AAAC ACT AAAAAC AAAC ACT AAACGT AAT AGTCT ATATGACTCTTA C AA TTC AAAAAGG AAT AAT AGGGA 
TAATAATAAAATAAAAAATCGTTCATTAAAACTATCAGATTTTAATTGGAGAACACCCAATATCTCTATTCAAGAATTTAATGCAAATAAAGATGATGTT 
AACAAGAAGAGGTACGCAGAAGTCGTGGCGTCAGCTGCTCCAAAGTCACCTTCACCAACGAGCAGCAGCAGCAGCAACAGCAACAGCAGCAGCCCTCCTC 
TTTCACCGCTCTCACCAACAGTGAAGAATAGTAATAATAAACCATTGTATATTCCTCCCCATAAAAGAATGACCACTACTGCTGTTTGACCAATTTATGG 
GTTATTTTATAATGTTGTGTGATATTTTATACAATAAAAATATATAAGAAAACAACCAAATGATTTTGTATATCCTTGTTCTGTGTTGGGCGGAAATTCC 
TCCGCGCCCATTATGATGAAAAAATTGCTAGAGGCCTTGAATCCTGGTCACGACAAGGTTAACAACAAAAAAAATTACCTGTATGTGCAAGTCATATCCT 
CCAGTCCCACGCATGAGAACCTGTTATTGCACCACGATTCGTGTAACAAAGTCGTATCTGCACATGTGCATAAATTCCTTCACGCTAAAATGAACGACAA 
AATTGCCGGGAGCAATGGGCGTGAGAGCCGGTTCCTGGAAAACCCCACCTCCGTTCCACGCACTACCTCACTATTGCACCATTTTTTGGTGTATAAAAAG 
GCACGGAGAGGTAGCTAAAACATCCACTCGCAGAGACATCACCGTCAAGATGACGTCTCCTACTAGTGCTCTCCGCGTTGCTACTACTGCCCTGCTGTTT 
GCTGTGGTGGTTACGGCTACTGTGATTTCTGAAGCTGGGGGAACTTlAGATAGACGACCGCTGCGCCTTCCCGCCATGTGACTACTTCCCGGAACCrGAAT 
GACGTGGAAGAAGATGTGTTAGAGCTGCTCGAAGGGGGAACAACAACAATAACAACGGAACCCGTGATGGAGCCAAAGTCTTTGGCGAAATCGCAAATGG 
ACTGATTGGGCGTAAGAGGCGTGAAGCAGTATCTCTAACACCAGTCCATGAAGATATGCCAGATTTCTTCCCCTACCCTCACCCTGAGCATCCCATTGGA 
GGGCGTCTACCACGTGAAGCAGCTCCTCCACTTAAAGGTGCACTTGGACGTAAAAGGCGTGAAGCAGCTCCTCCACTTAAAGGTGCGCTTGGACGTAAGA 
GGCGCGAAGCAGAATCCTTGGAGGAAGAACTTGTGTCTGCTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCACTTGGACGTGA 
AAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCACTTGGACGTAAGAGGCGCGTW^GCAGCTCCCCCACTTAAAGGTGCGCTTGGACGTGAAAAGCGCGAA 
GCAGCTCCCCCACTTAAAGGTGCACTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCGCTTGGACGTAAGAGGCGCGAAGCAGAATCCT 
TGGAGGAAGAACTTGTGTCTGCTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCACTTGGACGTAAAAGGCGTGAAGCAGCTCC 
TCCACTTAAAGGTGCGCTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCACTTGGACGTRAGAGGCGCGAAGCAGAATCCTTGGAGGAA 
GAACTTGTGTCTGTTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCTCTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTA 
AAGGTGCTCTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCTCTTGGACGTAAGAGGCGCGAAGCAGAATCCTTGGAGGAAGAACTTGT 
GTCTGCTGAAGAAGAACGTGAAAAGCGCGAAGCAGCTCCCCCACTTAAAGGTGCTCTTGGACGTAAGAGGCGCGAAGCAGCTCCCCCACTTAAAGGTGCA 
CTTGGACGTAAGAGGCGCGAAGCAGCAGCAGCAGCTATGCCTCCCCCTGAAGACGATCTCGACTTCTTTTACGCACCTGTTGCTTTGCCTCTACATGGAG 
TATGGAAAGCACCAGAACCTACAGGTTAAAATGTGGCTGAACATCACATATACAAGTTAATTGGATAATTGAAACAATTTGACTTTTTAGTACTTAATGT 
TTATTAATTGAAACAATTGGACTCTTTCCTACCTAAGGTTTATTAATTGGTACATTGTTAACACTTGCTCTTTGCTCATAGCAGACATACCTGCTATTTT 
TAACGCTTTGTTTAACGTGCTTTTTACAATTGATATTGGTACATGTTTGTGACCTCTTGAATAAATTTTGTTAGCATCAATATTTATTTTATTTTTATCC 
TTTTCTTGTGTGTTAGAATTAGAAGGAGATGTGGAGGTTGTTGATGTAGTAGATGCTGTGCAGAGTTGCTCCTGGTGTTGCTCCTCTTCTTGATTCACAT 
TGTCCTCTTCAACATACTCCTCTAATCCAAAGCGCTCTCGAATTTTCTCCTTTACACCCTCACCACAATCTCCTATCGAAGACTGTGCTAATATAATTTT 
TGTAGTAGAAGTAGAGAATAATGGTACAACGTTATTGATGTTGTTACTTCTGCCCATAGCTGGTAGTATTTTTGAAGGCATGCCTTAAAAAAGGGATATA 
TCTTTGTCCTTTTCTATACAAAAGGATAATCGTTCATCCCCATGAAAAAGTGTTAGATGGGGCAATGTACACCTATAGAACAGTGTTATTTCTTCTATAT 
CTTCCAGTATTTATACCAGACCCTCTTGCCCCATAGTTTCGTATCTCCATTTTACTATTCCTGTCTCTCTGGTGAACCCATTCTGAGCCAGGTGTTCTGC 
CCATAATTTCTGATGTTTTATCAACAACAATATCAGCTGAAGGGGCAAAAGGCCCCCCATCTTCCAGTGCACTATTCCCATTAGAAGGAAGTGGACTAGC 
TGGCATAGGTGACAATAATGAAGGGAGTGGTGAAAACACAGGATAATCATCCACACCCTCTGTCACTAGAGATTCTAGATCCGGGGGAGGAGGTGTAGGC 
AACATAATTTCCATCTGCATTTCTTCCCCCGACGTGGACGGTTGCTGTTGTTCTACTCCTTCTACCTCTACTTCTACTCTTCCTCCTTCTTTATTATTTT 
CTGCTTCTAGTTCCTTCATTACCATTTCCACGCCTGTTGCTAGTCTCATTTTCTTATTGTTATTTTTTATGCGTTCTTTAGCTTGCTTTGTTACTTCATA 
TGTAATATCAGATGAGTAGTATTTGCCTGGACAAATAAGGTTATCACTCATGGCTGTATAATTAGTATCAGAATTGCCATGTGCATTATCTTTTGTTATA 
AACTCTTTGGGTGGTTTATTTGCACCAGCCGTGAAAGTATTTGCCAGAAGAGCCGAGGCTGATGAAGAGTTTTGCTTAACATACTTTTCATCGATAGTTT 
TATTAAATAATGTAACTAAACTGGCCGAACACTTACTATCCCGGCAATTCGCACATGCATTATTATTCTTCTTTTTATTATCCATTGAGGCCTCCTTTGC 
AGTTTCAATTAAAGCTGATATTTGGCCCATGATTTTAAGGCTCGCACACAGTGCCATTTCCTGTCCTCCTTTCTCTAGTTGAATTGATCCCCATTCATTA 
AATTTTTCTTGTAGAGTCAGGAGACTACTCTTGAGAAAAGAGTGTAAAAGTGTTTGAGTAGTTATACAATTTTTCAACGTTTCAGTGATTGGGTCGAACC 
CGAATTTGATCTGATTTCTTACACTATCCAGTAAATCCTTGTTTAATTCTTCCGTGTCAGAGGAAGGGAAAGAAATTATATCATTAGCTGGAATTGATTC 
AGAAAAGTGAATATTTTTAAGTACATTTCCGTCTGTTGAAGAAGGGTGCTTTAGATCACACGTGCTAAGCATTACATCTAATGATAAGAAAGGGGTAACT 
CTTTCTCCTGTATCGAGATTTCCTACAACATTTTCTGGTGTCAAGAGATCGTCTGTTTCTGTCACTCCCACCAATCTCTTGTAATCTTCCTCAGCAACAG 
TTTGGTTGTTGTTATAGAAAGACCCATAAGTCTGAAAAATTCTCCTCGCAGTCTCATTATCTGCATCATTAGCTCCTCCTCCTCCTCCTCCTCCTGCCAC 
ATTTAAAAGATCTAGGTAGCTACTCATCTGGGAGGGAAGGGGAGGGTTAAGGGTTGTATATAAAGAGGCTTGTCTGAAGACGCGAGTGATCACAACTTCA 
ACGATGGACGTCGAGTTCGGTTTCTTCCACGGTCTGCTCTCCAAGGCCCTTCTCCCAGATGAAAAACATCAACCCGTTATAAGGCGTCTTTGTGCGGATG 
ATTCTAGAAATAAGGGAGAGGATGGCTGCTGCTCGTTCTGTGGAAGTUVGAGGAACAGGAGAGAGCAATACTGCGTGCCTTGAACAACTAATAGACGTTTG 
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TTCCTTTATAGGAACTGTCTCATCTATTGGTACAATTATCAATTCTAATCTTTCAACCAGTTGTTCTAGACTACAAAAAACGTCAGACAGTTATGCGGCA 
TTATCCCATTCTAGTTTTCTGGATGTGGTATATCCAAGTTTGAAGAAAfiCAACTGAAGACGTATTGCCTCATTCTTTACGTGCCATTTGGAATAAACAAC 
TTCCAAAGTTGTATGAAAAAACTCTTCAACCCATAGAAGAAGAGGATATTGGTTATAAGGATTATGTTGTTTCAATTGAAGACGACGACAATGTTGATGA 
TGGTGACCAACAAGAACAAATGATTATTGATGAAGAATCTTATAAAACTATTGGAGAAAAATCAACCATTGAACTGATAGGCATGTATAACAATAACAAG 
TTTGGTAATGAATTTATAAGGATTCCTTTAAGAGAAACTGCGTTGCACGCACAATCTCTGAGGTACGACACTGAAGCTAAATTTGTTAACCACAAGGACT 
CTATACCTCTATTTTATGAAAACAGCACGTGCACATGTAAGGAACGTCTTATTGATTTTTCTGAGAGACAACTACAACAACTAAAACAAGATGGAATGGA 
TAAACCAACGGACAAGTAGAGAGGATCTTTTCAACACGTATACAGGGAATGCAGTAATACGTTCGGCAGCTAAGCAAGCACTGGCTATTGAAAAACACGC 
AGCAGAAAGAAGAGGAGAAAAGGCATGGACGACTTCAGCAGCAGCAGCAGCTTCTTCTAATTTTAATAATGTACAACAAGATTATACTGATGATGATATT 
ACACAAGTGTCTATTGCAAACAGTGTTTTGAATAACCCCTTTTTAAAGAGATATGCAAAACTTATAGATAATTTAGCAATATCTTCTTTACCTCCTGATA 
TAGAGGATGATGTCATTATACACACTAGAGATGCCTCCAACTCTACAGTCAGAGTAGATGGAGCCAATATCTATTTCGCCATAATTGACGGTGATTTATG 
TGTATACCCTAAACAATATATATCTGATAAAGTGCTGTGTGGTTCTCTCAACCGGGAAAAGGCACTGTTCTATAATAGCTCCAAGAATAAGTGGACGTAT 
GGATGTAACCTAAACTTTGATATCGTTGACGCTGCCATCATGAAACACCCCGACTACAAGGAAGAGACTACATCTACAAAACATATACGTAAAATATTGG 
GTATCGGAGCATCGGAAAAACTGAACATTACCCACTATTTAAACTACTTTATCCAATAAATCACCATGTCTATATACAGCCCAGT^T^ATGTCAGCCATGAT 
GAATGAGAATGAAGAAAAAAACGTTCTCAGTAATAGGAGGGCTGTAATATTGATCTTTTTCATAGGTGTTGTCATTTTATCACTCTTGTTCTTAATCAGA 
CTAGATATAGATATCAAATATGGGAATGTGGGCGAGAGAGCCACTATTGTAAAGAAGACTAGTCCCATTTTTAAGGAACCTATCCAGACAGCCATCAACT 
TTAACAACAAAAAGATTTCCCACATTGAAGATGGAGGAGAAGAAAGTGATGTTATAGCTTCAAGGACTGTTAGTTCTCTCCAGAGAGATCCTCGTGTACT 
TTTAACCAGTTCATTTGGATTGTATAATAGGACAGATTATATGAAAAAATAAAAGTACCAAATACCATATCAATTGTTTTATTTCATCATTAGTTAACAT 
CTTTTTAAATCCACAATCAGAGCTAAAACATAGTGCAGTTATACAATACAATAATACATTTCTCCCTCCAATGATACAATGTTGATGTTCTCCTCCCATA 
TACTTTTCAATCAAAGCCGAGTCTTGTTCCTTGGATAATTTATCCATGACCAGCTTAGAATTTCTCCAGTACATGGGACGAAACACTAACCTCATACTAG 
GATTCGTATCATTGTGGGAAGGGTTTATACAACTTCTTGATGTCTTTACATCCTCAGTAGGCATTTCACCCAATAAATTAAACACATCTCTTATAATAGA 
AACAGACTCAGAATCATACTTTAAATCTTCTATGCATTCACCTCCTATTTTAGATTTATTAAAGGGTGCAAAGAAAGGGGTTTCTTCCTTCTTCTTCTTT 
TGGACATCATTTTTACCTAATACACTTAAACCTATCTGTTCTCTTACAAAGGACTGAAAATTGATATCTATATAATGTACATCTTCAAATTTTTGGATAT 
TGCGATTAACATTTTTGTATGATTCTGGAACAGGGATCTTCTTTGCAGTCGTCATCATTCTGGAACACGTGCAGGCAGGTTCTGGGACAACACCCATGGA 
TAGGGCGATAGCATTGGCCAGTTTCACTAGCCACATTTTTGTTTGTGGATTAAAAGTCTTCTTCTTTCCAAAACGAAGAAAGAAGACGCTACATACGTCT 
CCTTGCTCGTCACCAATTACACGTATTGTGTCCCTTTTGTTTAATATTTTGTAGGCAAATTCCAGTAGTACTTCCTTTCCTTCAACAAGAGGTATTCCAA 
GTTCTTCAAAATCCTTTTCTAAATAATCCACACTCTCCTCTAAAGACAAGGAAGAGGATGTGCATTTGGGCAGCAAGATTTCTGTGGTGCACATATTATT 
ATTCAAACCTTGAGAAGAGGACATTTTTCTCTTTACTGTTCAGGACGAGGAGAATTGGGAGGAGTTATGTTAGTTAGTGATACGTCTTGCCGCTGCTGAT 
GCTGGAGGTAGAATGAAGACCACAGCGTGAGGGGCGACCATGGCTTAGATATGGCCACGCCGAGCACTGGCATGAATTGCCCCTGAGAGGGTAGCCATTT 
AGTGTTTTCACTCGGTATTTTTCCCTCTTTATTGAGAGTTGCCAAAAGTTTATTATAGTTGTCTGTGCATGTCGATACCTTGTCTTTAAAATCGTCCAAT 
ACTTTTGGAATCCTCTCCCAGTTCTTGCGTTTTATTTGCTTGATTCTGGGAGTCTTTTTTAGTTCAGGCATATACTGCAACATTGTGTATTCTAGTGCAG 
CAGACATAAAATTAGTTCTATCTGTGCATATTTTATCCTCTTCTGTTTTACCACAAAATTCACTATGTATTATGGGCATTAATTGCTGGTACACAATAGT 
ATTATCCTCTAGACCGCACATGTTCATATACACGATAACTAAAAGGTTCATGGTTTTTACACTATAAGGAACACCAGAGAATCTTTCATCGTCGATGCCT 
TGAACTGGTTCATAGAGCACCACGTTACTATTGTTCCCGTCTTCTCTCTTATTGGCGTCCATTAAAACTCCACTTAGACGTGATAATAAACTATTGTCAT 
TGTACGTTATATTGTCAAGAGAAATGCATTTGTCGAATTCGTTAACAATTTCTGCCCACTTTTCTTCATCCATCACGGGTTGTTTATTGTTAAGTTTTGA 
TGATACAGAACATCTTGAATGTTCATCCTCTTCTATATTATTTTTAGCGAGCGATGACGTCATTTCGTTCGTGGCTGCAGTTACAGCTTCTAAAAAGTCA 
GAATCTTTATACTCGAACGCTTTCCTTGCCTCATCTAACCAGTCGTCAGTTTCTAGATCTGCCTCCTCTTCCTCTTTTTCACCTTCCTCCATTTTTTCCT 
CCTCCTCCTCTACATCATCCATTTCATTGAATTTTTTAAAGAAATCTAAAAAGTCACTCGTGCTGTTATTTTGTTCAGTTTTTAAGATTGACGTCTGCTC 
AGTAGATAGACCTGGTATTGACGATACTATTCCTCCTCCTCCTCCATTTTCTTCTCCCAATGCTTCTGGATTCGTATCTAAATTATTACCAAAAACATCA 
ATCCTTTCTTTAGTCAATATTCCCCTATCATTAATTATACCACCTTCAGTGTGGCTCTCAAAAAATCCACGAATTGATTCGAAAACTGACGATGATAATT 
TTTTAGTAACAGGTGCACTTTTTCTGGTTGCATTAACCTTACTTTTCTTACGGGTAGATTTTTGAGGGTCATTACCACTTTTACGTTTATTATTAGAAAA 
TGGGATAAGATCTTCAGTATCAGATTCTACATCCGACCCTGTATTTTTACTCTCATTATTATTAGCACCGGAAAGATGTATACTGATATCTGATAAGGTT 
GTTATTGTACCAAAAACACGTTGATTAAAGGACCCCATTGTCAAATATGACATGACATTTGCCGCTGCATTTACAGCCAGGGCTCTATGAAGTGCGTGTT 
TCAAACTAACGTCAGTTTCCAAATCAATCAAGTGTGTTGGAGGAAAACACAACTGAAGC6GGTCTCGTACAAAATCTATAGCTTGAGACAGATAGAAAAA 
TTTGATTGTCCAGAAAGGAAGTACTAGTTTCTTATCACATTTTCTACAATTACTTTTAGACAAATTCCCCATAGAAATATTACCAACACTATCATCACAG 
TCTTTATTATTCTTCCTAGCTTCTTGTTCACTCAAAAAAATCTCCCATCCGCCCACACAAGTACAGTACGTGTTTTCAAATACTGCAACTTCTTTCCTTA 
CCCATGTGTCGTACTCACAGTTACTAACAATTGCACTATTGTAGTCTGATCCTTTAGTTAGCATAATCAATAGTGTACTTTCAGGTGAAAGCATGAGCGG 
CGATTCTGCCAAATTATATATCGTATTTGGCTTTGTTCTTCTTTTAGCTGCAAAACCACCCAACTCGTTTGCTTGATTTTCACCCGAAGAATTACTGTTC 
ATGCACCTAAAGAACTTGACATGGCCAATCTTTAAGAACGTTTCAGAGTCCTTAAATTGTGTACCAGTAAGACGACACACGTTTTTATGATGTCCAACCA 
TCAGGTTCCATTTCACCAGCACATCAGAATCGTGGGAGAAAAGATTCATAGAAAGTGTTTGATTTTCATCGTTGGTGTGTTGCTTCAAAACATTCAATAA 
ATCTTCCCACCTCTTGGATTCTTTTCTGTTACCAGACATATTATAAGAATCGTACTCCAATATCTTCCTCTTTTCTAGTAGGGAGAAGTGTTTCCCAGGA 
GTCTCCAGATTCAAAAGGCATGATGTTAAACGTACGATATCATCCTCTGCTTCGGTGAATGATGCATTAGAAAGGTACATGCATTTGCTCCCCGTAGTCT 
TTAGAACAGGGATCCTCATTATGTCCAGTAATGCAGATGCCATCAGACCTTGAGGAAGACGCATCATCATCCGCTCGCACGAGTAGAATAGTAGATCGTT 
GGTCTTCCTATTGAGGCAATTTTCATTCCTTGATTTTGCCTTTTCCTTACTTCTTTCTTGTCTGATCTGTCGTTTGGCTCTCGTCATCATCTTGATGGCT 
GGAGCCTTTCCATCCGTAAAGAGTACAGTATACCTCCCTTTAAGGTTTTTAGACTTGAGGAGATTTTTCATCACTCGGATCATCCTAAAAGCGTAGTACA 
AGATGACCCCCTCCATCTGGCTAGCAAAAAATGCATCTTTGAATCTGGATTTCAAAACTGCTGTCGGGGAAGAATTGAATGAAGTAGTAGGCACGTTCCT 
GAACAAGAAACTGTTAGTCAAAAACTTCCTCTTCTTCATAACATCTGTGTGTTTGTTGATCCAGTAATCGAGACCCTCATAGTCGTCCATTTTGTGATTA 
GAAACTGGAGGAGCCATCATCATGTCTTCGATTGGAGGTTCAGTTAGCCAATCAACTTCCTCTTGGGATGAAGGGATTATAATAGGCACTCTTTCAATTC 
TAGCCTTAATTTTCTCGTGTTTATATTTTCTAGAAAAGAATTCAGAAATAACCTTCTTTTCATCCCTGCAGTAAAAATTTTCATAAATATGTTTCTTGTT 
CTGCATCCGTGAATGCATAGTATCTTCTTCCCTGATAACATTAAAACCGTCATGTAAAACGTCTATTATCTGTTGATTTGTAGTATTGATAAACAGGATC 
ATCTGACCCATATTTTTCTGTCCCTTCAACATGGTGGTTGCAATTTGCCTGCCCAACATGACCAGATCTGATACATTATAAGTGGGCAAATCAACCATGT 
CGAGTGTGTAGTCACAGGTAGGTGAAGAAGAGGAAGATTTCTTGCCTCTTCCAAAATCCAGGAAGTTCTTGGAGTTGATGTTGGCAAGAGAAGTGGCTTT 
GAGAAGTGAGCTAGTTGTGCAATCTTCTTCTCCTGAACAATAATCCTCGTCGTCTTCGTCATCGTAACAATCTTCATATCCAGTTTTCTCCTTGATGACT 
CTCCCATTCTGTGCACCGTACGCATTTTCGTACACTTTTTTGGCTTCTGGGACACCCATTTCTGTTGATGTTACTCCCACGGGATGTTAGACGATAATGA 
ATGCTCACTGATGGGCGCCGTAATTTTTACACACTCTGCGTAGTCTGGTGTTTTTACCCCrCCTAGAAGAACTTCGTGGAGCTGCTTTAAAGGTTGTAGT 
GCTTGGAATCTGCAGCAAACGCAGCTCTCTACAGAGACTTCTCGGAGACTTGCCAGAAGCGAGTTTCCTCTTCTTAATAGGGGTACTTTCTTCCTCTTCC 
TGTTGTTGTTTGTAGAGTGTTATAGATTCAGTAGGGAAGTAGTTTCTAGATCGAAGAACAGTGTAGTCTTCTTCCTCTTTCACAGTCTCAATAGAATTGG 
ACACCTCGTTGAACACTTCTAGGGCCAATTCCCACACACCTTGTTCCTTAAGTGTCTCGACCACGTAATCAAATTCGGATTCACTCAATAACTGTAAGGA 
GGAAGAATCTCCATCCATGATACGAATCTGGAACACCTGTCTCCTCCTTGACTGTTAACTGATGTCTGACCCCTTGTAACCACTTTACTGATGTTTTATA 
CCCTCTCCCGACGGAAGATGTATACGTCACAATTTTTTTTAAAAAATAACGTTATACACCAAAAATGAGCGATACAGGGCAGATGGAAGAAAATAGGCCT 
GCTACCCAGAAAAGGAGACCTGGAGATGAAGAAGAGGAAGAAACTGGTAGTAGTAATGXrCCATATTATGCCAACTTTGGCGATGACGCCACGTACTCCA 
TGTACACTGGAGAAGGAAAAAGGGGTAAATTTGTATTAGAGCCACCTAAAGAAAGAAGTGTACAAAGGGTGCAAAAACCACCTAAAGAAAAGGAGGAAAG 
GGAACAACGTTCTAATGTTCGGACACGGAGACCTGGTCAAGAGTTTGAACAGAAAGTGCTACAAGATCGATCGCGAGAACGGTCAGAAAAACTTGGGCAA 
AATTTGGCAGAGAAAGGATTGCAAGAACGGCAAAAGAAATATACTCCAAAGGTAGCACAAACAATGACAJUVAAAAATAATCAGGTTTCGTGAAGGAGGAA 
GAAAATTCAAGGCGCCGCAACAGCAGACATCTGACAAAGGTGCAGCAACCAATGTTCTTGAAAGGGAAGAAATTGAGATGGCTGCAGAAAGAGAACAACC 
AGTAGAAATTACAGGAGATACTATATTAGGTGGGCTAGGAGAAGAAGATGACGAAGATATGGGAGAGGATGAATTAACTATACAACATTCATCTATGGCT 
GTATCACAACCCGTTCAACAAATCGTTGTCAGTTCTCCTATACCGCCAAAGCCCACTAGGCCCGCTCCTGATATTCCCATACAAGAAGATATAGTGGGGA 
AAAATATTAGCCAGTTACCACCATTACCACTTGATGATTATGAGGACGAAGAAGACGAACATTTGTACGAAGAAGTGAATGATTTCTTAGTGGCACCACC 
AACAGCAGCAGCAGCAGCTTCCACAAGACCTCCCAGGCCTAATATTCCTCCTCCACCTCCrCCTGTTGTTGCTGTTGCAGACGAAACCTTGAAGAACTTG 
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GCTTCAATTGCAGCCTTGGAAAAGGAGGCCGAGGAACTUUVGAGCGGCCGCAGTTGAAAGGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTG 
CTGCTGCTGCTGCTGCCGCCCAACGGGAAGCAGACGAAAAAAGGGAAAGAGAAGCAGAGGAACAAAGAGCGGCCGCTGCTGCTGCTGCTGCCGCTGCTGC 
CCAACGGGAAGCAGACGAAAAAAAGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTGCTGCTGCCGCTGCTGCCCAACGGGAAGCAGACGAA 
AAAAGGGAAAGAGAAGTAGAGGAACAAAGAGCGGCCGCTGCTGCTGAAAGGGAAATACTTGCCCAACAACTTCAAGAAATGAAAGAACAAATGCGCATAA 
AGGAAGAGGAGAGGCGGAAAGAACTAGCAGATAAGGAGGAAGAAAAACGTCGAGAACTAGCAGCCAAGGAGGAAGAAAAGCGTCAAGAAATATTAGCTAA 
AGAAGAGCAACTTGAAAAATTGAATTTCCAGTTGGGTACAGAAATCACGTCCAAAAGAGCACTCGAACAAATGTTAGAAGAAGAGAAGGCCTCACGCTCA 
CGGTCCGCAGCCAGTGCACAGTTAGCGATCCAAGCAATAGAATATGAGGATGAACTTCCTCAGGCAGTCGAACCTCAAGGACAGTTAGTTCCTATGGATA 
CGGATTTGTACGGAAAAATGTACGATCTCAAT7UVGAAATTAGAAGTACAGAATAATACATTAACTTCTGCATTTGAAGACGTGAACAAAACAAACGAACA 
GAACCAATTGGTTGCTCAATCCCTTGAAAAATCCGCTAAAGCCATTGAAAAATTAACTAGTCAAAAACATCTTCCTGTGGATGATCCTGCTTTTATGCAG 
AGAATAATAACAGAGAGGGATTTTTCTTTAAAGAATCTGGGAAATGTTTACAAAAGAGTTCTCGGGGTCTATTTTACATTAAAAAGGGACCTTTTTAAAT 
CGAAGGCATTAATTACAGATAAAGAATCAAGGGATCTGGAGGTGCGTCTAACAGATGTATCGACAGATCTCAGGGCTAATGATCTCAATACAATACTGGA 
AAGGTTGGATGTATCCGTTAACATACGCTCTGGTGGAACATTATACACTAAATTTACAGAGGCAGACACGGCATTAGCAGATCAAGTTCCTTCGAGGATT 
GAAATAAGTAACAGATCAAGATCTGCCTTATTGCCATTTTCATCTGCAGGTTTGGATACTAATTTTACTAATAGTTCCGACAAGTACAATGAAATAGTGA 
ACCAACTAAGCAGTATAAATGAGGCTATGAATATTTTGAAAGAAAATATTGTCCCAACATTGAACCAAATCAAAATTGATGTCACCAATCTATTAACAGT 
TTCAAGCTCTCGTCAATATGCTATTGAAGAAAGGGTGTATTCTGATGTGTCCCGAATGGATTCTGAAATAAGAAAATTCCTCGCTATAATGAACAGTAAA 
ATATCCCCTTATTTTAAAGGCGATTGGACGGATGAAAGACAACGCTCTATTGCTGACAGTATTTCCTCTCAGATAAAATCAAACGATAAAATTAAAGAGA 
GTGTTGCTACACTACACGATATCAATACAACCTCAAGAATACGTAGTAATCCCCTCCTGCACAAATCCTCAGTTTTATCATCTCCAGACTTTTTAAATGC 
TGTTAACGACTTTAGAAATTTTCTCGATATCCAAGGAGGTTCTCAATTTACTTATGATGTCCTTTCAGGCCAAAATATTGATGACCTTTCACTGGCATCA 
AAAACCACTGAAAAGGTTACAGAATTGTGCCTCGAATTATCCATAATTTTAGACGTGATCCATAAAAATGCTTTGAGTTTAAATTTGCCTGCAATrACTT 
ACCCCGCAGGGGAAACATCTATGGAAGAAAGTGGTTCATTGGCTGTTGACATTAGACAAGAAATTGGTAAGAACATATCAGATTCTAGCGCTGAACTTAG 
TCGCACACTGTCAGAGGCGTTACAAATTTTTCAGCAACAACAGCAACAACAACAGCAACAATTCCAACAGCAGCTGTTACAACAACAACAGGACCAACAA 
AATCAACAACAATTATTACAACAACAAATAGAAGAACAACAACGGGTTCAGGAACAGCAGCAACAGCAGCAAAGGGACCAACAACAACAGGAACAACAGC 
AAAGGGAACAACAGCAGCAACAGCAGCAAAGGGAACAACAACAACAAAGGGAACAACAACAGCAGCAACAGCAGCAACAGAGTGACCAGTTTCGACAACA 
ATTATTGCAACAACAGCAACAATTTCAGCAATTACTACAACAACAAGGAAGAAGAAGAGGGGGTGACGATGGTGATGAAGAAAGAGAGGAAAGAGAAGAA 
GGGGCTGAAAAGGATGATTGTGTGCGTAAGGTTGCAGAATCAGTAGCGACAAAATATACTGCTGACTTGACTACCTTATTCCAACGAGAAGAAAATAACT 
TCCAATCTAAAATAGCATCAGCAAAATTGGGAACCCTTGTCTTTGCCACCCCTCCTTCACCTATCATGAACTTGACAAAATTGAGAGAGGAATATTCCAC 
ATTCACAACCCAGTGTTTTTCAAAACTAACAGCTGAAAATAATAGTATTATGCGTATTTTCCCCGAGAGGATTGTAGAAGTATGCAAGAGTAAGAATCTC 
AATT T AATG GGAAAAT ACTTGT AC ATT AT AACTACCGCAC AAAC AGAAATGGAAG AT CGAGTG AAG AAT ATATTGTC TGGT ATT TTC AATCAAATTG AAG 
AGTTTTCAAACAATGTAAAACAACAACAACAACAACAAGCTGCTTCAGCTTCTTCTACTAATCCTCCTCCTCCTTCTACTCCTTCTACTACTCCTCCTGT 
TACAAGCATGCAAGTTTGTGAGTTGGATGATCAACGTACCCTAGAAAAGGCTGCCATAGTAGAGGCAATTACTCTGGCCAATGCTGTACTTCAAACTACA 
AAATCCGCTTCAGCTCCTTCCACGGCGGCTGAGCGAGAAATTGCTCTAAAGCTAGAGAATGGGAAAACATCTATCCGTATGGAAAAAGTGGATCTAAGTT 
CAGGAGCTACTGGTGTTTCCGACCAACAAAAATGGATCGACGAAAGTACTTCCAAACAAGAATTGGAAGATTTCATTGCAGAAGAAAACTTTGTAGAAAC 
TGCACATAATGAAATGGATATTGGATTAATTTTGGATGCCAAGAAGAACGATCCGACCCGTGATGCCAATCTTAGGCTCGTGAAACCTCATGGAATAAAT 
GTGCAGTCTTTCCCATATTACGTACTCCGCACATGGCTAGGAGAAACAGATATATTGGATGAAGATACTGTACATCCTGAATATTTCCGCCAATACATTG 
ATCGCAATTGGAAGGTGGAAGAACATGAGCGTGAAGATACATTAAAGGCACTGGGTGTTTCTTTATCAGATACGTTGGCACACATCAAGGACTACTATTC 
TCCCAGTGTCAAAAATGATGCATCAAAATCAGTACCATTTGCGTTGAACACTCTATTGTACAACATATTTGCTATCGACGGAGGAATGATTTCTAGCCTT 
TCAAGAACAGCCTTTATTTACCGTAAATTTTTAAGGCAATCTATGACAGATAAAGAGGTTGCTCAAGGACCTGTTCGGTCTCAACTGTGTGAAGCGACAA 
TAGCGTCTCTTTTCACGGCATGTAGTAACCTTCTTCGATCGTCTCCCTTAGCCGATAAAGTAGAACCACGCCTTCAAGAAAAATTAGCGGCCGCTGCTGC 
TGTAGACACGTCAACCGGAGACATGTTTCGTATACGAGTTTGTCATCTCATGTACAATTTTATAGTGGCTTATGTGAACCTATGCAATAATCGTATAAAC 
TATACGTTT^AATGTATTGAGAGCGTCGGGTCTGGCAAACAAAAAAGTCGTGGCCGGTAAAACAACTAAAGGGCATACATCTTCTTCCCACCGGTTTGGAT 
CTTATGATGTCACATATGATTTTTCAGTATTGTACAAGATTCTTCAACTACAAAAACAGAATATCTCTTTACTTTTAGAGAAGGGTTTCAATGCATGGGA 
ATCGTGTGTTGCCGCCATGGCAGCCTTCACTGCCGACCCTTCTCTCTCCATCTCAGATGCCGACCAATCCATACTCTTCCCACTAGAAGGGGGAGAGATT 
GTTATTGAAAAACATGAAAACGATGCAGAAAA/^AATGTCGATATGGTTCAAGAATTGTGGAAGGATACTGCACTCACACTCATGGCAAAGGAACTAAATT 
CATACTACAACTGGTTCATATACAGTAAGGATACCGATATGGAAAAGTTGGCGAGAGTGTGCAGGATGATTATCGGTATAGTTAAGGCTGTTCTCAGATT 
AACAAATAAAGCCGAGAGTTTAGTAGACACTAATGCTCTTTCTGATATTTTCAAGCTACCTGTGATTCCAATTGATGATACCAAAACTCTGGCAATAAAT 
ATTGTGGTGTTCACTTTGAATAATGTAATTAAACCATGGATGGTTTCGTTCAAGCAAATGTTTAGACAGAAAGACGGGGGAGTTTCTTCAGCCTATTTCT 
CGTTTCAAAATATCCAACAACAAAAACATCAACAAACTGCTTCCATTTTAGATGCATGGGCTTGCGCGCCCGGTAAGTTAACAAAGGCAGCACACGTGTT 
TATTTCAGGATATGAAAATCATATCAAATTAAAAAAAGATGATCTTTTATGGGGTGCTTCAATGAAATTCCCTGCCGACGGACGAGGCACTGTGGTCGAA 
GGATGGGCCCAACAATACAATAACGAAAGCGTATTGGAAGATTTTACCGACTTTTCAATTGAAGTAAACGCACCTGCCTCTGGACTCTTAATACCGCCAG 
ATCCTTTACTTTCTTCCATGTTCGGTAAAGGAAATGGTGGAAGCAGCAGCAGCAGCAGCAAGGATAATACAATTATTGGAAAGGGAGGATTAATTTTAAA 
GCGCCAAGTTGTTGGACAAGAACAAGCACCACCAATAAACACGTCGTCGGATACTAAAAAAATAAGACGTGATGCAAATATTGAACCAATAATAGGGACG 
CCTTATAGTGTAATTAAGGCAAGTAAAGGAGTATCAATTTCAGTACTGGATGATTTCAATGAGGATAGTCCAGAAGATTTCGCCCTTAAAACTTCCATCA 
TCAATGATGCCATACGAGAAATAGGGCAACGCATGACTTATACAAGACCTATATTTGATCATCAAACACAGAAAAATATACACTACAGTTCACCTAAAAT 
TATTCTCGAAGGATCAGATTTAAAGAATGGACAACGTTCAGGACAATCTTGGGCTCCTTCTTCATCATCCTTGACTCTGGCCTCCGATTGGAATCTACCT 
TCTCTGGAGCTTTTATATAGAGAACTTGCCACAAAACAAGTAGAGAAGGAAGAAGAAGAAAAGAGCGAAAGGGAAGAAGATAAAGGACAAAAACTTAATG 
AAAAATTATCATTTGTCGTGAATAAAGCTATCGGAACTATCCAACAACAACATCAATATTCTGAAAGGGGAGGAGGAATGAAGAGGTATCAGCAACACTC 
TGCTGATCAAGCTAGTAATGGTGGCATAGATGATATAGAACTTATGAATAGTAAAGATGCTACTTCCATGAGAAAGGCAAAACTGGCATTAGCCGTTACT 
AATAAAATTGCAGCAGCAGCAGCAAGGGATGGGGAAAATTCATCAGCTAAACCGTCAAACTTTGGCAATAGATTGGATGAAGCAATAAACCCTGGAGCAC 
TTTTATTACGTAGAGGAGGAGGAGTAAGAGGAGGACAAACACCCCAGAGTTCAATGCTAACAATGTTCCGTCCTGGACAAACTGGTGGCAATAGTAGTTG 
GTGGACTACTAATACACCCCTTATTCAACGCACAACTAGTGTTGGGAATAATTTAGTTGTGCTTGTACCCAACCTATTGGATTCACACCCTCCTACATTT 
AATTAGGAATAAGGAATTATACACATTTATTTGTATGTTTTACATGTATTTTATTCAACAATAAAGGTTATGATCATACATAATCAACTGGTTGTTTTAT 
TACATAGTTTCCATTACTGACCCTTCCAAAAATGAATTTTGTACCTCCGAATACAATTTCTTCCCTCGGTTTACAAAGGAAATTGCACGGAGCGAATCTG 
CTCACCCATTCACCAACAACACCCTCATTAATCTTCCCATGAGTGATGTGAGGATTCAGTCTGTTGTCAGCGAGTTCAACATTTCTCCCTATTGCATCTG 
AAATGTGTTCCATTATTCGTCTACGCAAATCGAGTGCATCATCTCCAACACATACTTTGGATACGAGAATGTTGTTGTTCATCCCAAACAACTCAAAATT 
ATCCCCAGTGTCGATTTTGATAGCTGGTGCAAAACAACTCCCAAAATCTGTTTTCATCAGAGTATTGTATTCTTCCATATTTCGCAAATAGAAAACAGAA 
ATGGTCACATGTGGAGGCAACTGAGAAAATGATTTGGAGACGTGTTTCTCAGATAATAATTTTCCCGTTTCTGAATAGAAAGGAAGTAACTCTTCGCTGT 
AGAGAGCAAAGAACATTTTCATTTCTTCACCTTTAAAATCAACTTGATTTATAGTGAAGATTCTATTGTTGTGTTTGATTGAAAATTGTTTCACCACTTC 
AAACATGCCATTGATTTGTTTGTACATTTGCTTGACTGCGCATCTTCTTTGCACCTGTGGTTGAGTAGATGAAGGGGAGATTTGGTGGCCATTATCTGGG 
TGGTATCCATTCAATATGTTGTAGCCGGCACAAGGGATAACTGGGTCCACAAACTTGAAAGCCTCTCTATTAAACCCTCTCCCTAGAGGAATAAGTCCAC 
AACGAATTTCGTCTGCGTCTTCTGCCTTGTCCATTAACATCATATTTAGATGCACAACATATCTCAGAATCCCATTCAAGGTGTGTTTTTCTGGATAGAG 
GCAAATAACATCATCCACATATTCCCCATCTACATAAAGTCCAGTAACAACAATCTTTAACGCTTTAGATAAAACAATAGTTTCTCTATGAGAAAATATT 
GCCTTGTTTACTTTGGTGGAGAGTTGTTCGTCACTGGGAATGGGCGTAGAGGCCAAGAGTAGAGCCTTGGAGGAAAATTTCTTGTCCATAGGAAGTCTGT 
GCCGGATATTGTCTACCATGTTAGACATAGAAGTATTATCACTAAAGTCAAAGAAGACTACTCCATTTTCTGGAGAGGGTTTGTTGAAGGATAGATGTTC 
TTCCATGGTGGTGGTGGTGGTTGTGATGGTTTGAATGAGGTGTGGTACAGTCATTATATACACGTGGGCCCTTTTTAAAAGGGGGAGGGAGGGGGAGAGG 
GTTGCCATCAAGCAATATCTTCGTCCAGAAACACCTGGTCCAGAAATGGCCATTiAGATACTTCCTCTATTTCTGGAGCrATTACATTTCTGGTGTAATCT 
GACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTGGAGCGCCCTAAAAAATTTTTGAAAAG 
TTTTTGAGATGGAGGAAGAGTAAAATTCTCTAGTGAAAACAGAAGGGTATACCCTCTCATTTCTGGGTCGACCAGTCTCCAGAAACGCCTGTTCCAGAAA 
CACACAAAAGTTAATGTACGTTTCTGGAGCTATTACATTTCTGGTGTAATCTGACATTGGCCCGACCCAGCGGTCCACCCTCGGAACTTGACATTCGGTC 
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GAGCTAGCGGTCCACCCCCTAAACTGGAGTGCCCTGAAAAAATTTTTGAAAAGTTTTTGAGATAGAGGAAGAGTAAAATTCTCTRGTGAAAACAGAAGGT 
TATACCCTCTCATTTCTGGGTCGACCAGTCTCCAGAAACGCCTGTTCCAGAAACACACAAAAGTTAATGTACGTTTCTGGAACCAACAATTTCTGGTGTA 
ATCTGACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGTGCCCTGAAAAAATTTCTG 
AAAAGTTTTTGAGATTGAAGAGGAGTAAAA7VACTCACTATATGAAAGGTGTGTAGAACACCACATCCTTTCTAGGCACGGCCATGTCCAGAAACGCCTGT 
TCCAGAAACACGCAAAAGTTAATGTACGTTTCTGGAGCTATTACATTTCTGGTGTAATCTGACATTGGCCCGACCCAGAGGTCCACCCTCCGAACTTGAC 
ATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGCGACCCTAAAAAATTTTTGAAAAGTTTTTGAGATGGAGTAAGAGTAAAATTCTCTAGTGAAAA 
CAGAAGGTTATACCCCACCTTTTCTGGATGCGACTAGATCCAGAAACGTATGTTCCAGAAATACCCATAAGTCCTTTCCTCTATTTCTGGAACCAACCAT 
TTCTGGTGTAATCTGACATTGGCCCGACCCAGAGGTCCACCCTCGGAACTTGACATTCGGTCGAGCTAGCGGTCCACCCCCTAAACTCGAGCGACCCTAA 
AAAATTTTTGAAAAGTTTTTGAGATGGAGGAAGAGTAAAATTTCTTGCCTGAAAGGTGTCCAGAAGTGTAAACACACACACGCATTCTCGCACCTTGTCA 
CCATCATTAATAAAAAATGTCCCCTCCCGCTCCCCCCTCAAAATCGGGTATAAATAGAGCTCACCAGCACACCACAGACATCATTCTCAAGACATTTCAA 
GTACTGAGAACATCCTCTCCTCTTGTGATATTATTCAAGAAAACCTACTGAAATCGTCCCTAAAATGGATTTTGAAGGAACTACCAGTTCTACCCCCTCA 
AAAATGTCCCAGTTGTATTCATCAGTGAAGAAAGTTGCAGAGCATTCCTTTGCCAATCTTCATGACAAGGCTACTCTTGCATCAAAGGTTATTAAGGACC 
TGGAAGGGGAGAGGAAGAAAATGTCTACCCCAAAGTCCTCTTCTGATGGACAAAAACTGGACAAGGCTATGTTGGACGATATTATCAACGAGTATCAGGC 
CGTTAAGAGCACTGCAGATAATTCCATTGAATCGACCATCAAGGAAATTGAAAATGTACTTGAAAGTGTGCGCAGAACCAAGATTGAAAGTGAAGCCAAG 
AACAGTGTAACTTCCAGCCCAGAAAAAGTGTTTTCTGTCGAGGATTTAGAAATCTACTCCAAGGGGCGAGTGTGCAAAGGTCTCAAGTTAAACGCCAACT 
GTTCAAGAATTGGAGGCAAGTATGCAGTGTCAATGAGTATCAAAAAACACAACGTCTCCTCATTTGAGAACAACAACAACCAAGTTTTCTCTGAAGAACC 
CAGGGATTGTTTTATGCTTGAAACAACCTATCCTCTTGTTGGGTTCGAAACTTCTACAGAAGATGGAAATACATATGCAGTTTTCTTGACTGGTGTTGGG 
CTAGAAAGATCTCTACCTAAATATGTACCAGTTTTCGACATGAATGCAGGTATTCAAACCCTAAACATGACTGGTTTGAGGATGGCCAAGCTTCCTGTTC 
TGTGCATGTTTGGACGTACAGAATATGACAACTTGGAAGATTTTTACATCACTTCAATTGAGACGCAGTCTTTTGACGAAGAGGAAAATGATGCCAGAAT 
GAGGTGTCACACCGAAGATTTGGAGAGGAAGAAGCGCATGAATGACGCACCAGCGATTACACCTCATGTGGCCGTGTACGACTACAGTGGAGACGGGAAA 
GAACAATTGCTCTATATGATAACCGAGTATGAAAACACGGCTAGTTGGTGCAACGCAAACGGTGTGGTCACATCTGACAGTGGATTTTCTAACGAATGTG 
CAATTAGTGATATGAATGACTTGTGCTGTTTTGCTGACTGCATCGATGTTACTGTTAATAATGAAGAACATGAAGAACGTTCTATGAATATTGTGGTCGA 
ATCTGACAGGCGTCTTTTTGATGCTAGTCCTTCCCCCATCAAGACGGAAGAAGATGGAGAAAATTCATCGTCATCGTCTTCCTCTCCAACAGTTCCTCCT 
CCTACACCATACGAAGGTAACGCAGTTGTGGAGGGGGAGGAAGAAGAGGAAGAAATTGATGAAGACGAAAGTAGCAAGTATGAAGGTTCAGAAGATGCTC 
TTGTTATGAAGAAATTAGCCAAGCTTTCTACTATGAAACAAATGAGAAGGGTTAAGAATGAACCTGCACTCAAAATTACTTCTGGGGGTAACAATAGTAG 
CAGTAGTATCAATAACGAAGATGATGGTGATGATGACGATGCCGTTGACGCTACTGCATTATGCCCCCAAACTGAAGCTACAGTGAAAAATTCCTTCATG 
GCCCCAAACGACGAGAGAACTGAAAATATTTTGTATGAAACTATGCAAATTTCTCTTGCTAAAATTTGTAATAATCCATCATCTATGAGCAGTTACCGTG 
TATTCACCAACAAACTCCAAGAGTGTTTGAATACCATGGACGATAGTATCCGTCGCCGTCCAACCATTTGGACTGAAGAAAGTCAACAATTTGCTAAGGG 
TTTGTTGTTTGATGAGGTTGTCACATCAATTGTGGCACATCAGATGGCTCAAGATATTTGCAAGTCTGAAATATTTGGAGGAATGTTTAACGCCAACTCT 
ACCAACATTAAGGGTAAATATGAAGGACAAAAGAAGAGTCTGTATGGAAACAAGCACATTTCTTCCTCGTGCTTCAAAACCAACACGGAATCTAATGTGA 
ATAATGCACTATTTGCGTGGGTGAAATCGAAACTCCATTCTGGCACAGTCATACCTAACGTATTCTCCTTCAAAATGGCATCAGAAAAGCCCTCAAAAAT 
GAAGCGCAAGCGTACCTCTAGTGCTTCATCATCTAACGATGAACACCAAGAACCATCAACAAAAATGATGAAAAATGATGAAGGGGAAAAGGTTGCACAA 
GAATCATCATCTCCTTCTTCATCGTCTACACCAGAACAACAACAACAAGCTGGTCATGACAAGGAAACTATCAATTTAATTCCCCTCAGTTTCATAAAAA 
TGCCACGCAGTAATGTCAATGGCTCGGCTTCATATTTGTCTGAAATATTCGGTCAACGTCTTTGTGGACTGTCTGATGCTTCCAGCACATTTAAGAGAAT 
GTGCAAGACTTTTGAAGATCTTGAAAATGAAATCATGAGGAGCTCATTCACTAGACTGACTAGATATGAGAGGGAGGTAACTCGCTTGTATGAGAAATGC 
AGGTCTCAAGCTGTAGATATTGAGGAAAATGAAATGGATGTTTTGTCTCACCAAGGGGAATTGTTTGCCGAGTTCTTGGAGGACCCGATCGCTTACTTTG 
AAGAAGTACTGGAGAATATTAAGAGTTGGAGCCTAGAAAACGTTAACACCCCTAAGCGCAAAAACAAGTATGCAAAGGTACTGGTGAGCGTTAATGCTAT 
TCGTAGGACATATGAAGAATACCATGCGTTTAGCAAGTTTGTACCAATGTTCTTGTTCAATCTGATTAAGAGAGAATTGGAAGGAGACAACTATACCCAT 
GACGTTCACTTTTCrTCCACTTGCCTGTGGTACCTGACTGTAATGACCAGGAACAGGATTTGCGATGTGCTCCAGTACATCAACAACAACAATAATGATA 
ACGAAGAAACCGATATTGTGGAGGAAGAGGAGGAAGGAGAAGGAGAGGAGGATAAAATGGAAGAAAGTATGGACGTAGAACAACAGAAGCAAGTTCGCAA 
GGGAGGGAGAAAGAAGGGTCAAAAATTCAACAGTATTGGGGATCAAGTCATTAGAAAATTTGTGAAAAGTTTGTGTGAAAATTCGATGGTAGTTTCTATT 
GCAATTAATAGTTTGATCTCTGGAATAAGCTGGATGAACAAGAAAATCCCTCCCGGTTTCTTGAAGGATTCTAGCACAATCAATACCCTTGATGAGGTCT 
CAAGGTTTGTGTTTAGCGATGTCAAAATCAATAGGAAAATCAATGGAACAGATGATAAATATGAAACTGTTTTTGGAGTCAGTACGCGTGTGGATTCACA 
TATTGTAGGCCCCTTTAGTATACCTGTTGATTTTTCAAGCGCAGGACTAGATAAGGCCTCATGTGGCAAATTGTACGTTAACACCATAGACGGAAAGGGC 
ATTTTGACAATTTCACCCAAATATGATTCATTAAACGATGAGGATGTTGATTCTACTACAACAGACAAGCTAGAGAAGGATATTTTGCATTTGTCTAAGC 
ATGACACCTTTTTCAATATTAATAAGAATAAGGTTCTTCCATTCTATAATATTTCTCCTAGCTCTTCTCTCACTGAAAAGAAAAAGACAAAATTCAATAG 
GAAGAAGATCTCATCTGGTATGAGCAATAATAATGGCATGTGTGTACAAACTCCTTCTAGTTCAAATTCAGTCTCTTCCGTCTCGTCTATTGTAGCTCCT 
TCATCTTCTGTTCTGGCTCTATCTTGCTCCCTTTCTTCTACAAAGAAAAAGAGCATCTGGAACGAGAACATGTTTTTGACATCTAGGAACATGTGGAGGT 
GTGGATTTGTTGTACCACCCAAACTTTGCAGTTTTATTGTTAACCATAGACACGCTGTAAAACTTGTAGCTGAAACTGCACCTAAAACAAAGTTGTGTAG 
GAATATTATTGATAGGAATAGGAAGATTAGATTTAACGGTCTAAAGAAGGTATGCAAGAGTGTTAGCGCCTTTACCGGCGAGTCTACATATTTGCTCAAT 
AAGAATATGACTGCAACTTCACCTAGTGATTTGRACCTATGTATTTATACTTCATCTTTAAATGACCCATTGTATACTTGTAAATTGACCCATGAAGAGT 
ACCAGGACGGAAATGCTTTGGATGACTATGGTGCAGTTTTTGTAAACTATACATTTAAGAGTATCAAATCCTGCTCGTCCAAAGATGAAACCGCTGACGA 
CAATGCTGCTGCTGCTGATGATGATGGTTCTACTACTTCTACCTCTTCTTCAACTGATACTGATGCTGCTGCAATCCAAGACTTTATGCATGTAATGATT 
AAGAAAATTGATGCCATGAAAGACATTAGAGGAAAGTACAAGAAATCTTTGGCAAAGAAAACAAAGAAGCATTAAATGTATACCTTTATTTTTACCAATA 
AAAATTATATGTTTTACATGTTGTGTTTTATTCAAATTCATATGGTTCATCCCCATTTAAATCGTCTAACCCCATATCCCAGTCAAACGCTGCTGCTGCT 
GCTGCTGGTTCAATGTCTGAGCCATAGATGGATTCATTGACATCTAAAAGTTCATCTCTAAAATTTCCTGGTGTACGCGGTTGACGAATAATTGGTGCAA 
TAAATGCCTCCTCTTCTTCTTCTTCTTCTCCTCTGACTCCCGTTTCCTTGACAGCTTGACGTTGCGCTTTTATTATCTCTTCAACGTTCCTCCCTGGAGT 
AGCGGGAGGAAGATCAAAATCAATACTGCCAAGCTCAGAGTCTATATCAGTAAAGGACATATCAGTAGCGTTTTCTTCTTCTGCTCTTGCTGTAGACGGA 
GGAGGAAGAGGAGTAGGCTTGGGTAGAACTGGTGTAGGTGGAGGTTGAGGGGTAGTAGAACGAGAGTCAAATCTTACACGGGAAATAGGTGGAGAAGGAG 
CAGTATTAGGTATAGGAGGAGGAGGAGGGGGACTAGAAATGAGTTGAACCGGAAAGGGCTGCTGCTGCTGCTGCTGGGGAGGAGGAATATTACTGGGTGA 
TTGTTGTTGAGGAGTAGGGGGAGGAGGAATATTATTGGGCGGCTGTGGTGGTTGCTGCTGCTGCTGCTGCTGCTGCAGCTGAGGCGGCGGCTGTGATTGG 
TTGCCTAAAAAGTACAGCGGGTACTGTTGTTGTAATTGTTGAGGAGCATATGGGTTATAATACTGGTTATATTGATAATAAGGATTTAAGTATCCTTGGT 
AAGTAGGTTGTTGAACCTGGAAAGGTGCCTGTGGTGGAGGAAGAGGAAAGGGAGGAGGTTGGAAAGGGGGAAATTGTTGTTGTTGTTGCTGACGTTGCTG 
TTGTTGTTGTTTGGGCACAAACACTCCAGTCGATAAATTAAATGCTTGACTTGTTTCCTCCACTGCACTCTTTATCATTTTTTCAACCGGTCTTGCTGAA 
GCTCTTGCTAAAACACCAGCTGCACATTTCTCCAGCATGGAGATAGAATGCTCTGATTTACTCCTCATCCCTCCTCCAACCAAAAGATCCATAGTCGATG 
ACATTTTTACCCACACGTTATTATTTATAGCTCTAAAATAACGAGGTATTGATAGGCCACGTACAGTGGCGTTAGAGTTATCAGAAGCTACAGATACAAA 
ATTCTTATCAAAGTTCTTGTTCGGTAATATACGTGTACCAAACAGGTTATTACACACAGTAGCAAGTTGTTCAAAGTATACTCGTTCTTTCACAGTTCGC 
TGTATACGATCCTCCACGATACTGCTCATTAAATTGTTTATCTCTGTTGCATTTCCATCGTTAAGAGATTTCAATATTTTTGAACTTAATTCTTGCCACT 
GAATCTCTCCTGACCGTATATCTTTCCTATGGGAAATTGCCCAACAACTATTAGCCAGAAGAGCTTTCACCAATTCAGCACCCACATTTGTGACAACAAA 
AGTGTTTGTTCTTTTTCCTCCGAACTTTTTCTTCCCTTGTTCAGATGACAATAAAGATGATGAAGTAGGAAAAGCCTTGGTTTTATTGGCCTTTAAAAGG 
TTAGCTAAAAGAGACCTTATTCTGTACTGTATAGCCTTCTCCTTGACCACTGCCTCGTCTGCCAAGTATGACACGTAATCGTATTGTGGCGCATTTTCAC 
TCCTTACACCTAAAATAGAGCCCGAAGTTATAGCACCTCTCAATGGTCGAGACACGGTCATCATTTTTCCAGCAGGCACTGCTAATGCCTTTGATTCTGT 
GTCACTTCTCTTTCTTTTTTTAGAGGGAGGAAATATGTGCTTTTTCATTTGTTTTGCCCGCTTCGATACAGATGTAGTAGGTGTAACATGTTCGTCTTCA 
ACATTCTCCTCTTCTTCCTCCTCTCCATCTTTTTCTTCTTTCTCTTCACTCTGTTCTTCCTTTTCTTCCCGCTGCTGTTCCTCCCCTTCCCCTTCCTCTT 
TCCCTTCCTCTTCTTCTTCATTACCTCCTCCTCTTCCACCACCTTCTTGATCAGTGTCTCTTTTTCCCTCATCATTTTCTAAATCTGAACAATCGCCAGC 
AACACAATCGCCACTCCTTGACAGAATTTTACTAGTACGAATCTTACGCCTTTGCCTATTCTTACTAGAAGTTGGCGCGTCCAATTTAGAGACCTCCTGA 
CCCATGTTTCGTTGGAATCTGGATTCGGCGTCATCCTCCTTTTTTCGTTGTTGGTTCCTAATTATATGCTTTTTTAAAGCAGTATTAGGTTTATGTTTAG 
GTGGCATATTTGTAAGGTGGGATAAACATTACAAGAAAACAAAAATATTCAAGACATTATTTTATTTTGTAGGCACCGCTGTTTTAGTATGTACATCAAA 
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TTCAAAACCATCTTCCATTACATATATCTCAATCTCTTCTTCTTCTCCTCCTCCTCTTTGTTTGGACAAATAACTATCACTACATTTTCTTAAAGCTAGA 
CAACATAGAGTACAATTATACACCTTTTCTAAAGCAGCCTTTGTAGACTTACAATCAGAATACAATAGTTTATGGTCAATTAGTATATGGTCAGGGTTAG 
TTATTTCATCTTCACATATCCAACAAAAAACTGTAGGGTACCCATCTCTTCTACCGCTACAATCATAACAAGTCATGGTAATACATCCACCTAATTTTAT 
GTTAGGTGTCTTACATTTAGGGCATTTTACAGTCTTAATTTCCTGTCCTTCTTTCTCAACCTTCCATTCTAATGTCCTCTTTTCATCCACAGTGAACACT 
GGATCCTTTATTGATTCAGATGTCATGTTTTGACGTACAATATTTACCTGTTTAACAATTTCCATTTTTGCACCTTGGTTAGGTCTGATACCCGATAAGG 
CACGCCTATCTACACAATGAAGGACAAAAGAATGGGCTCTATGGAACACACATTCTGGCCGCTCCAACACCATTTCCCCTATAAGATTCTTGACAAACAT 
TTTCGCCCATAATCCTAATGCACTCTCATGGTTCAATTTTTCATAGACATAAAAATAGGTTGCAATTAGAGGTATACTCACACCTACAACTTGTTCAAAA 
TCTCTATCATCAAAACCCCCAAAAACACTGTTTAAGTAATGGACTTTAACCCTATTCTGAAACCTCTCCCTGGTGTTGTGGATTACAGATATGTCGTTGA 
GAAGCCCCTTCATATTCTCTGGCATACGATCAGTCGAGTGCCAGTTGGCTAGCGCTAGAAGAACCTTGCCCTCCGGAGACTCTGGGTTCCATCCTCGAAT 
CTTCTTTTCGGGAGTTCTTTTTAGTATAGAAACAGCCTGCCTACAATTTGTCACCTCTTCATTTGGAGGTACAAAATTTGACGCCAAGAAACCGCAGGAC 
AATCTCATGGGCCATTTATGGAGTCCGGGACCAGTArCATATCTTACTAGCTCATCGGTGGGAGGGTCATCAAATTCAGCATCTTCTACATCCTGGAAGT 
AATAGTCTTCCAGTTGTTCGGGAGGAGGGTACGGCAATACTGGAGGAGGTACATCCACTGTTACAATGTCTTCCATCTCATTAGGCTGGTCACATACATT 
GGGTAGTAAACACTGGGTACAGATCAGGGAACATTTGCTATCACCAGTTTCTCCCTCCACCATCTTAAAGAGTTTAACGGGCGGTCTGAATCTATTATCC 
ACCACAGTAAATTTTGAATCCTTGTCTAGTGCAAAGTCCTTGTCCATCTTACACATCTCCATGAATTCGGCCTGAAGAAGTTCCACAGTATAGAAGCATT 
TCTCTGCTCGCTTGATGAGTTCGGTGATGAGAAGTTTGTTGGCTGCAATGTATTCTGGGTCGACATTTTCTCCTGCTAAAATCGCCCTCTCGAAT7VAAGA 
TGAAAATTCGTGACATCTACCTATAAACTGTACCATATATTCTCCACAACATGGGCACCTAAAAACGTCCTTTAGTTTTTCACTACCAGCCGAGACCTTT 
TCGTACACATACATTAGGTGTTTTATAGTACAATCAGGGCATAGTTTTGCAGAACACGTTTCTTGTCGTGCTTCAAAAGTGTCCATGAAAGCATGTACGG 
GGAAATAATCGTAAGAGGCTTTCATGTTTTCATGAGAACAGTTGAAGAATTTTGGTTTCATGTCCAAATCCGCGTAGCACAACCCACATGTTTTATTGGT 
GGTATTCCCACTCAATATCTTTGTAACGTGTGCTTGTATTACCTCCACTTCGTCAGATGGTTGGTACACCATCGCCGCTTGACTTTCTCCCCAAGACTCT 
TCTTGCAGTGTCTGGCTAACCATGCATTTGGCGATrGCTTCTTCTAGATGCATATTTTCAGCGTTTTCATGTTCGAGGATGAAAGATCCTAGTTTTCTGG 
CTGAAGGTATGCCGTGTCTAGATACCAGGGACAGTGGTGTATTTTCTAAGAGCGAAGCGTTCGGGGAGATTATTAGACCAGAAGGCGACATTACGGATTT 
CTTTGGCTCCTCTACTCTTTCAGTATCTGCTGCCAAAAGATAGGGAACATTGTAGATAGAGTATCTGTACACTATATCAATCATCTTGGCTATTTTTGTT 
CCAAAAAGAATATGGTGTAATTCTTGGGGCACATATTCCATAGCGTCATACACGGGCCTGGCAGCGTACATGCGAAGATGACCCTTATTGTAGTGGTTGT 
TGGTGGATGGACCACTTAAAAATGCTTCTGTTACGGAGAAGGCCATATTTTCCCAAATCCTGTCATTGTTTCTTCCTTCATGCATATCAGAACCAAATAA 
AACACTAAACGTTCTAGCGTCTCTTATTTTTGTCCCCACTTGCAAAGCAGAGAAGGGCCACATGTACCACTCCTTATGAAACACATCCTTCAAAGTGTTG 
AGAGAAACTACCACATCTTCATAAATGACACTTTTCATGTGGCGGTGCTTCTTGTTCTGATCTCGTTTGATGGCAACAAGGAGTAAGAAAATGCGTATCA 
AATCCATATCTGATATCATCTCGGCGTACGCCTCACACAACGCATTCTTCTCCTCCACTGCATCGCCTGCATTTCCACGTATAATTTCATCCAACATCAA 
GCGCGTATTTTTCATGACAAACATGCCACTATAAATACGATGAATTAGATCCTTGTATGCTGGCCCATCTTGTTTTTTGCAACTCTTTAAATGTCCAATG 
AATTCTTGTTCGTTTTGGTCAAGAGTTGTAAATGGAACTGGGTCGTCGTTACTTTTCATCTTGATGTTCATGTCGAGGGCTTTCTTCATTAATAGAGATA 
TCTCTTTCAAAAGACCGGTTGGATTAATCACAACTGTTTCTCCCTCCAAAAGTTCATCCCTATAAGAATTGCGGTTGCAGATGTCCAGTATGGCCTCCAG 
AAGATCACCACTTTCTCCCACTATACCCACAATTCGTTCATAGGTGGACCTCTTATTATTTCTGCCTCCCATTAGGTCGGTGATCAAGATTGCAACTTCT 

gggacagcagctgcaactatggatgttacaccttccatgttgaacgagggaatgaatgcgataaagtctggggcctccatcttatatacccttcagatgt 
gaaaaaaaacaacatgaggaaaaaagtaaaaaattcaatatacatgtatgtctttattggctaggaagaacacatttcacaagcaccagggttattgata 
gaacataaaagagcagcagcaggatctggaacaacaatagaggacttcttctcctcttcttcttctcggacaggagcagaaaaagctgcaacaggagagg 
gagcttccttcttgacttcttggagtacattcttgtcgactgtgaactggacagctctagctgcgccctttgtgcgtagataatagagagtcttgatccc 
cttctcccatgc:atacatagtcattgaccgcaccttgctgagttcgggttcttccacaaacaagttgagggattgagcttggtcaacaaacatacctctc 

TGAATAGCCATGTCCAAAGTAGTACGAGGATTAATTTCCCATACAGTTTTGAATAGTTCCTTGGTCGATTTAGGGATATTAGGAAGCGTCTGAATAGATC 
CACCACTTGCCATAATCCTCTGTTTAGTTACTGAAXrCCATTCTCCCAGTTTTATGAGCTCTCTAATCACATATTCGTTCACCACTTGGAATGATCCTGA 
AAGT AC AT T ACGATT AT AC ATGTTGG ACGTTAAAGGCTC AAAGG ATT C AG AGTTGCCC AGGAT C TGTGC AGTGGATG C AGT AGGC AT GGG AGC AAC AAAC 
ATTGAATTGTGAACACCATACTTCATAATGTCCCTTCTCAATTGCTCCCAATCGTGAATTGGCAAAGAGTTG7\AATATATGTCCCTATTCTTAATATTTT 
CCTTCCCCATGTCAAATTGGAAAATCCCTTTGCTCAAAGGACTACCCTCAAACAGCTCATATGTTTCTCCCTTTTCTTTGGCAATTTCACATGAAGCTTC 
CAAGGCACCATAGTATATAGTTTCAAAAATCCTCTTGTTAATTAGTGCCGCTTCTTCAGATTCGAAGGGGATTCTGAGTTTGAAGAACAAATCTGCTAGr 
CCCTGCACACCCAATCCCATTGGCCTAGTTTTCATArTGGAAATGCGGGTCTTGTCAACTGCATAGAAATTGACATCAATCACCTTGTCGAGATTTCTGG 
TCATGATTTTTACAACCCTCTTCATCTCCCGGTAATCAACATAGGGCCTTAGGGAAGGGATGGGAGAATACTTCACAAACTTGTTGACTGCGATAGAAGC 
CAAATTGCACACTGCAGTTTCCTCCGAATCACTGTACTGGACAATTTCAGTGCACAAATTTGAAGACTTGATGATGCCGACATTTTCTTGGTTAGACTTT 
CTATTGATGGTATCCTTAAAGCACACATAAGGTGTTCCAGTTTCGATACGTGCAGAATTAATTTGGTCGAATAATGCACGTGCCTTCACCACCTCTTTAC 
CTTTGCCTTCAGCCTCATATTTCTCGTACAACGCCTTAAACTCTTCGCCATGGACGTCGGAAAGGCCAGGGCACTCGTGGGGGCACATCAGGGACCAATT 
CTTCCCAGCCTTCACTCTCTCCATGAAGAGATCAGATACCCAGATAGCTGGGAAAAGATCCCTCGTCCTCAAATCTTCATTACCGGCATTCTTTCTGCAG 
TCAATAAAGTCCTTCACGTCCAGATGCCAATCAGAAATATAGATGGCTGCAGCTCCTCTCCTCTTGTCTCCTCCCTGGCTAACCTTTTTCACAGAGACGT 
TAAAGATTTGGAGGAACGCCATGAGACCAGGGTGGGTACCACTCCATGACGAAATGGGGCTTCCTTTTGCCCTCAAATCATGAAAGTGGATGCCGAGTCC 
TCCAGCAGTCTTGGAGATGATTGCCGCCTCCTTAAGAGTATCATAAATACCCTCAATGCTATCATCTTGAAGGCCCAGAAGGAAGCACGAGGAAAGTTGG 
GGTGTGACTGTTCCACAATTAAACAGTGTGGGGGAAGCATGGGTAAAATA6TGCCTCGACATGAGATCATATGTTTCAATGACAGATTTGATGTCTGATG 
CGTGAATGCCGACAGCCACACGCATAATCATGTCCTGAGGGCGCTCAACCAAGATTCTCTTCTTTCTATCAGTGGGGGAACCAATCTTGATCAAATAAGA 
GT ATT C T AG C GTCCTC AGTC CG AAAC AGGTGAAG AG ATAATCCATTT TGT AATCGAT AAC AGC ATCGAG AAT TTC AGC AT TGGCC AT AAC ATTTTC AT AG 
TAGGTATCATTAACTACZTGATGCTGGTTTACCAGTTCCAGGGTGGATTGCGTGCCTCAATTTCTGAGTTGTTGCACTAAAACTATTCCACTCTTTGGTGT 
TTTTGTGGATGTTCGAACAGATGAATCTTCCTGCCAGTTTTCCAAAATCAGGGTGGTCAACAATTTTTGTCTTTGCATAATCGGCCAGAAAATCGTCCAT 
TTCTTGGAAGGAGATGGTAGCGGGAAGACGGTCCATGATGTGAGATGCAAGTTCTTGAGGGTTAATTGCGTTCTTGTCAAGCTTGGGCACATACTGGTTG 
ACTGGAAGACAGGCATTTTCAATCCTCTTGATTATCTTTTCAAGACTAATTTCTTGCTTAGTGCCATTCCTCTTTGAGATGAATGATTGTTGCTGGTTAG 
AACCCATTTTATGTTATTAAATTGGGATGTTGTTATTTGGAGTATTTTCCTATACGTCTTCTCGGCTCGAGGAGGGTGAAAGATGTGAGTGAAGAGGCTT 
GGACGGTAGCTGTATTTATACCGGCCCGGTGAAGGTCGCTGGATATTTTTGAAGGAGGGGAAGAAATATATAGTATTATACGTAGTTCTTTGATGTAGAA 
AGAAATCAGAAGGTGAGGTGACAGCTGCTGCTGCTGCAGTATTAGTAGTGCAGGTGCAAAACTGTTCCTCAAAAACTTTTTAAAAATTTTTCTGGATCAC 
TCGAGTTTAGGGGGTGGACCGCTGTCTAGGCCTGATGTCAAGTTCCGAGGGTGGACCGCTGGGTCGACCCAATGTCAGATTGCACCAGAACCGACTGTTG 
CTCCAGAAATGTGCATTAACCTTTGTGTGTTTCTGGAACAGACGTTTCTGGGCATAGCTGCGCCCAGAAAGGGAAGGTGCAACATACTGTTTTCGCTATA 
GAATTTTACTCTTCCTCACTCTCACAAAATTTTGAAAAATTTTTCTGGGTCACTCCAGTTTAGGGGGTGGACCGCTGACTAGGCCTGATGTCAAGTTCGG 
AGGGTGGACCGCTGGGTCGGGCCAATGTCAGATTGCACCAGAAACGACTGTTGCTCCAGAAACGTACATTAACCTTTGCGTGTTTCTGGAACAGGCGTTT 
CTGGGCATAGCTGCGCCCAGAAAGGGAAGGCTGTATCTTACTGTTTTCGCTAGAGAATTTTACTCTTTCTCCCTCTCACAAAATTTTGAAAAATTTTTCT 
GGGTCACTCCAGTTTAGGGGGTGGACCGCTGGGTCGACCCAATGTCAAGTTCGGAGGGTGGACCGCTGGGTCGGGCCAATGTCAGATTGCACCAGAAACG 
GAATAGCTCCAGAAATGTGCATTAACTTTTGTGTGTTTCTGGACAAGTCGTTTCTGGTCTAGGAGGCAACTAGAAATATAAGCTTGTAGCCTAGAATTTA 
GTACACCAAGGATAAAGAAATATAACAAATACCATACTATGTTTTATTTGAGTAATATAGTATCCAAGTTAAAATAACAATCACGCACTTGTACATTAAT 
AGTCTGTAAAAGATGGGCAAACGTACACCTGAGAGTATTGTCCCGACAATCGTACATGAACCCTCTAGTGCACGTATCACGTTTTCGCCCTATTACCATT 
GATGCACAAATTTCCTCCTTCATTCTAATAGAGGCAGATTGTTTGTCAAAATAACACTCCCTATAGTAACAACCAGGATTTCCATTGGATTGACACTGTA 
GTTGTTCGAACGGTTTTCCTGTATAACATTTCCCAGCTGCGTGGTAGTGACGCACAAAACTCATGCAAGAATGTGCGTGGGTAGTTTCATTTCTAAGAAT 
CTCCCACGCACTAGAGGCACAGAAAAAATCAACTGCTGCGTGGCAGTGACGCACGGCCACACACCCGCATTCTTCACACCATAAAAGGACATGATTCGTG 
TTAGTCGTCACATCTCTCAGAAACCCCCATGTGCATTGACGTCATGGGGAATTGCTATTGCACCACCTTATGGTGTATAAAAGTGGCCCTGGGTGGCATC 
GGTTAGTAGACAGAAACAAACCGTCAAGATGGTGTCGTCTATTACCCACCTCTCTCTGTTGTTCGTCGTGGCTGTAGTAGCTTCCGTCGTTTTTACAACT 
GAAGGAGCTAGTGTGAGAGTGAAACGGTGTGCTGTTAGCCCGTGCCCCGACGTTATTGACCCCGACCACCGCTGCCAAGGGCGACTGTGCCGCAGGTCTA 
CTCGAGGAGGTGACGACGACGACGACGATGACGATGGAGGAACTTTCGATACAGTAGGGTCTGGTATACTTGGACGCAAAAAGCGTGCCGCACCTCCACC 
TGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCA 
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CCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGC 
GTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCG 
CAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAGTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGAG 
TTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAG 
AGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGATGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTGCCGCACCTCCACCTGAGGA 
TGAAGAAGAGGATGATTTCTACCGCAAAAAGCGTTAAACTACGCACGAAAGTGACGGTGGTTGAAGAATAGACTAATATTGTTGATATGTTAACCCCTTT 
TTTTCATGAAATGTGTACACACCTGCTATATATACGTGCATATTTGAATAAGGAATAAAGTTTATCTGCATCTGTATTTTTTTATTAGTATCCTTTTTGT 
GGGATAAATAATCGGTATTTGTGTCCAAAGATAGTGGTGCCATGATGCCATATACAAAGCACGTGATACTTTCCAAGAAGCAGAGTCGTATGACCATTGA 
CGTCAATGCGTGGGCAGAGGCGGAGGTGGTGATAAAGCGTTTCTGAGAAACATTGGGCGTATGACGTCAACTACATTATTCTTCCTCCTCCTCCTCCTAT 
TGCCTCTGCCAGTTCAGTATTTTATTTTTCTTCCTATACAATAAAAAGTATCAGATGAATATTTTTACTGTTTTTCTGTTCATCCCTCTTTCCTATTGTA 
AAAAAACCAATAACTAACTAATCATGGATAACTTGAAAGGGGAATTTGTTGCGCTTAAAACAGACCTCACCCATTACAAAACACAGTTGGATAGATCTAT 
ATTGGTATTTGTTGATGTTGTTGGTAGATTATATGTTATAGTAAATAGTGAACAAACAGCTAAAAAGGAAGGTCTAGCAACTAGAGTGGCAAAGCAAGCC 
ACAGAGATACAACAATTCAAGGACGAAATAAACAACAAATATAATGCTCTAACAAATACTTTGGATGATATCATCTACATTTTTGATCATGGAGGGAGTT 
TCAAAAGAGCAAAACATAAGGCCATAATTGAAGCGAGGGAATACTCTAAACCGCTGAGGGAATTAGAGTGCATGTTTACGCGTATAGCGGACATGTTAAC 
CTTGACTTTTATGACTGTGTACACCAATATCATTACTGAATTTAGACACTCTAGTGAACAAGCCACTAATAGTATAAATGTCACCCTCGGACGTCTTTTC 
TTGTGTGACGACTTGTGCAATCAATTACCAAAAGAAGAGGAAGAAGAGGAAGATTTGAAACAGAAATTCATTACTTTCCATGCGAACCTATACATGCTGG 
ACACACGCCTAAAGAAAGATTTGATAATTTTCAAAGATGTCATACAACAACTTCACGTGATTTTGCAAAAGGATACCTATGCTGTAAAAGAAGGTGTGGC 
CATTAGATGTGCGAAACAGATGAACGAAATAAGTCAATACAGGGACAACCTC7UVGGATAATTACAATACATTTTCAAACATTTTGAATGAAATTGTCTAC 
ATTTTTGATCACGGGGGACATTTTGAAGAAGTAAAACACAAAGCCATAACTCTGACTAGAAATTACTTGAAAACACTCATGGGATTAAAATGCATGTTCA 
AACGCATATCCGAAATGTTGTCATTGACTTTTCTAACAGTGTACACTAATGTTATAGCAGAATTTATAAACGCTAGCAATATTTCTGATAGAGAGATCAA 
TAATTATCTTGTCCAACTTGTAACATGTAACGAATTGTGCAACCAACTCCCCAAACCTAAACAATACCGTCCCCTCAGTTTGATAGATAACATAGCTTAT 
TTTTCTCTTTCTGTCCAAAAACATCTGAGTGGGTrTCTTTAGTATGTGGAGTATTATTGTCTTGAAGCTCATAAGTATTCAACCTCTCCTACTGGTAACA 
TCTCTTCCTCTCTATAATCCCAATATGGATAGTTGTTGCCTGATATCGAGGATAACACCAGAACTGGCTGGCAAGTT7UVCCTGGATCTTCATACCAGAAA 
ACAACTTCAAGATTGTCCAGAACTCACTCCCAGACGACCAAGTTATCTCCCAATTCAGATATTTCGACCATAGACATTGCTATACGTTTATGGAGATTTT 
GATGGCAAACATTAAAATCCAAGACAGGAAACAAAACACCACAGCCATATGTGAATTGACAACTGGAAGAGAAGGACTTTTATGTAGAAGAACCATACCT 
GTATTTTTGGGTTCAGAGGAAAAACGAGAAGAGTTATTGGGGAATCTCCCTGAAGGTGCAGAAATTTTCAGGCCTAGAGAAGTTATGCAAGTAATTGGTA 
CTCTCTTGGACAAGAAACTAGAAATTGACGACGGTATAGCTTCTGTAAAGGCTGCCCTCTGTGCTGGTTCATCATCGTTATACCTAATCATGAGCCACAT 
AGTGAAAATGACCTTTTCTGCTATCACAAACATGAAGGATATAAACGAAGAATATTTCGTAGACTTTATATTTCGTCATAAACAATTCCTCAACCCTGAA 
TTCTTCAAGCACCTTATATCTTTGCTCAAGAATTCCAGGAAGGAACATGTTGCCCATCTAGTAAGACGTCTAGAACACTTTCTCATGCTATGGACCCTTT 
CCAAGATGAGGTTCACAGAAATGGAAGAAAACTACTTCCCAATCTCCAGCGATAGTGATTACGGCATCTGTGAAAAATGTGCACGAAAAACTCCCAAATA 
CAAGCTCCGTATTTTTAGGGAACGAAAATGCTGCGATAGATGTTGCCGTCTTTATCACCAACAACCGCCTCCGGAGGTGTATAATTGGGATGGAAAAArA 
ACCCAACAATCCAATAAAGGCTACATTAATGCAGGCGATGAAATTATCGGCATGCTAAACTCAAATGATAAGGGAAAAACATTCCCTCCTATACCTAAGA 
TGGTTGTACGAAGAGTGGTGGACGGTGTCTACGGGCAAGGAACTATCCTGTCAAAGATTTTGAAGTTCAGGCAGGCAAATATCCCCACGTGTCTATTCGT 
GACATGCAATAAATGCAATAGGATTTTCAGGCTCACTATCTTAGGGCCTACAAGAAACATCCTTTGCCCACCTTGCAGAAAGAAAAGTGTTGCAGTAAAT 
ACACAACAGAAAGGAGAAAATAAACCTTCGTTTGTGCAAAAAGGAACAAAACGTCTACGAGTGGATACCGGTAGCAACAAGAACACGTTAGAAAAATTCT 
GTTCCTGGGAAAGATTCAATACTGAAGTTTTGCTCCCTTGGCTTGGCTACACTATTGAGTCTAAGTGGCAGAACTGGGAATCTTTTCTGGGTTATTCGAG 
TACCAGATATAAGGAACTGTGGGCCTTTGTGAACAAACAGGAAATATCTTCCATGAAAGACTCCTACATAAAAATTGAAGACATCGACCAGTTATTGAGG 
AGTATCTTGCAAGACCAGAAGGGTGTATTTGAGACCGTCTGCAAAATAAAGAGCAGAGATGGTTTGTGAATTGGCCACACTGATTCCGATACCGACTAAA 
AAGGCTGATTGATGGCAACCCCCCCCCCTCCAGACTCAGCCGCATGAGTATAAATATGGCCACTTCTCACACCACAGCATCATTCCCTCGTCATCGGTCC 
TACCGTCAACTTCCATTATTACTCCAATAATACCAACAACCCCAGAAATGGAGTCAATCAAACTGTTCACCGTTGCTGGTCTGAATATGGAGCAAGCCAA 
CCAAGTGGCTGAAGAAATCAAGTCAGAATATAAAACCGAGGAGGAAAAGAGGATTGCCCAGGAAGTGTTTGACAAATTCACCAAAAAACTCATTATGCAA 
GTAGATACGTCTAAACACTTACTTACAAGAGAAAACCCCAACCGTTTTGTATCCCGCCCCATTGTCCATGAAGATCTCTGGGAAATGTACAAAAAAGAGG 
TTGCCTGTTTTTGGACATTGGAAGAGATTGATTTCGAAAGGGATCCTAAAGATTGGGAGAAACTCACTCAAGATGAGAAGGATTTCATTCTCCAGATTCT 
GGCGTTCTTTGCATCCTCTGACGGAATTGTAATTGAAAATCTTACAACACGTCTTCGTCAAGTGGCGCAGATTCCAGAAGCGAGGAGTTTCTTTGACTTC 
GAAGTTGGAATGGAGAGTATTCATGGCAACGTCTACGGAGAACTGATTGATAGACTGGTGCCCGACGAAAAAGACAAGGCTATCTTGTTTAACGCTGCAC 
AACACTTCCCCGCCATCAAGAAGAAGGAGCAGTGGGCTATTAATTGGATGCAAAGCAATAACGATTTGGCGGAACTAATTGTTGCCTTTGCTGCAGTTGA 
AGGAATCTTCTTTAGTGGTGCATTCGCATCCATTTTCTGGATCAAGAACAGGGGTATTTTGCCTGGTCTCACCTCCTCCAATGAGTTCATTTCTAGGGAC 
GAAGGTCTTCATCGCGACTTTGCATGCATGCTGTTGAAAAAGGGTTTTGTTGATACCCCATCAAGAGAAAGGATTCTTGAAATTGTCACTGAAGCCGTCC 
GAATTGAACAAGAATTTCTCACAGTTTCCCTGCCTGTTAAATTAGTGGGAATGAACTGCAAGTTGATGAGCCAGTACATTGAATTTGTGGCAGATAAACT 
ATTGGTTGAAATGGGACTAGAAAAGCACTATAATGTTACCAACCCCTTCCCATTCATGGACAATATTTCCCTCGAGAATAAGACCAACTTTTTTGAAAAG 
AGAGTCGCCGAGTATCAACGTGCCCAGGTCATGGCTTCTATCAATAAGATCAAGAAGGACCAACAAACCCAAGAAACTGGTTCTCCTCTCCCAATTCTGA 
CTGCACCTCCTCCAGTCTCTTCCTCATCATCCGAACAAGAAGATGTTGAAGACGGCGTCGGGGACTACATCAGTTATGACGATTTTTAGTTCCACTATTG 
TGTCAATAGGTTGTGTATTGTATTATTATTGTTATAATATTTTTAAAAAATAAATGTTCTATAAGACTAAAACAATGAATTTACTTCCAATATTCCTGAC 
AACCTTTTTTGTTGCGGTAGATGCATGCTCTTGCTCTACCATCTGCCTTTTACCTGATGGGAAGAAACAACCCTTGGTTTTTGATTCTGTATTAGAAGAG 
GTGGTATACCCTACAGATGTGTGTGGGCCAAAGGGAGCTGGCGAATTATTCACTGGTGTGGATCTTTTGACCCTCTGTATAGGAGGTAAAAACAATGGAG 
GTGAATGGTCAGGAAAAGGTCCTTGTCCAAGGATCAATAACGCTGTCGTTGAACGAGATTACTCCCTTGACGAGGAGGATTGTAAAGGGTTTAGAAAGGG 
GTTCCGAATTCCTGGCACTGACCATTTTCATACTGTCTTTTCCCTTTGTTGGGTAGACAGAGATATGCACGCCAAGTGGGTGCGCAACAAAATAAACCCT 
GGTATAGTAACTGATGATGAAGATTTGGTAGATTCTGGTATTAGGACTAAATTTAAATACTCTTCTAAAATTTTTGGTAAAGGATTCAATCCGAGACCTC 
TTTACTCCCTCGACTATCAAGAGAGGATTAAGATATTAAAGTCTCATTTTAACAAGAGGACGGGTAATTTCTTTGCTCGAGGCCACTTGGCTCCGGCTGG 
AGATTTTTTCCTCGCTTCAGAGAGATGGGCAACTTTTGCTCTAGAGAATGCAGTACCTCAAATACAGAACCATAACAATGGT6AATGGAAAGATATTGAA 
AATCGTGCAAGAACTACGCCAGGTGCCGCGTGGGCTGAGACTGGACCAATATTTTACCAACACAAGAAGAAGGAATATCTAGACAAGAAGAAGAAGTACA 
TCCCTATCCCTCATGCCCTCTACAAGATTGTGTACGACAAGAATAACAAGGAATTGTTCCGTGTACAGAGTGATATGTCTTGGAAATAAAATACATAATT 
AAAGAATTTATATTGTTTTATTTTGTCATTTATTTGATACATTTGTTGTATGTTTTAAAGACAATACAGATAATTTTCCTCCATGACGAAACACTGGTGA 
TATGTCTGGAGCTCTGATGAATGTCTTGCCAGATGCCATCCTAGTTTTTCTCCTGGGGAAGGTGACGTCCGTTTGAGAGAGGGTTCGCTGGGTTTCAACT 
GCAGCCAAGAAAATATCCCTTTGTCCAGTGACGGTAAATTGTGTACCGTTATTTAGTGTGATGACATTATCCCTCTCTGATTGTTTCAAAGCTTCATAAT 
CTTCCTCCTCATCTTCATAGTCATCTGCAACATTCTCAGTGTCTAATAGAACAATAACAGAGTTGGGGGAAAAATCAGTCGTGCTACGTCCAGACTGGCC 
AGTATTTTTACTGCTCCATGGTGTAGGATCATGCTTTCTAGGCTTGTCTGAATGTAGACCATTATAGTTGGCTCTCCCCATCATATCATACAGTGTACAC 
ACTAGTGTATTACCAATGCGTTTTCCACTCTTATTGTTATCACAATAATCCTCATCTTCTCCAACAAGAGTTGAGTACACACTGGAATCTGTTAGAATTG 
TATGAGAAACTAATGCCTCTTTAAAAATATCGTCACTCTCTTCTAACCAGAAATCTGATACACTATCATTACATTTATTTGCAGTATTATCAATAAATGA 
ATTGAGTAAAGGGTGTAAAGGTTCCACGCCAAATAAAGGTATAGGCACAAACTTTGCCTCATCCGTGCGAGGTAAAAATGGCCTGTCTTCGTTGTATGAT 
TTTGAAAAGGTTGTTGCAAGAACACAATTTCTTGCATTTTTTAATGCCTCAGCTGATATACGTAGACGTATACTGTTTAACTTTTGCTCTGTTGTCTGGT 
TATTAAATCCTCCACACAACGAGTACATGGCATTATCGATATTCTCCCTCGCATGATCCTCTATGACCAAACCAGCCTTGGTCATTAGAAAATGGTCTGG 
GTCTTTACCGAGAGGAAGAATATCTGCATTGAATTTTGAAATTATATCATTGACAACACGTTCACAGTGCACAGATCTCTTTATGAATTTAGAGGCTAGA 
GATATGAAATGTCTATTTAAATTGGAACCTAGCCTATTCCAAGGAAGTGAATTTTGATGTCTTTTATTATCCCTCTCGTCAGAAGGGAACAAAAATTGAA 
GTAATCTGGTCCAATTAACCGCAGCTGCAGTTTCTCTCTGGACGAATGTTTGACGTTGTTTTAGTAGCTGAAGAGACACTTGGGTTGGCTTGTCACTACT 
AGCTGG7VAGAGGGTGTTGAAAGCCACAGACGCCAAATCTGAACGTCATCTTTAGATTGGTGTGTAGAGATTTTGTGTATTCCTCTTCGTGGATTTTTCTT 
CTCCTTTCATTGGGCACAAACTCCTTTGTCTTTAATTCTTCCAAGAAGTCTTGTACATTGTCCTCATTCGCGAACGTTACCTTCCCATCTTGCCCTCGTT 
TCATCACGTCGTTGTTCTTAAACAGGCGCGCAGAAAGCTGCATCAGGGACATTTCTTCCAACTTTTTAGGGTCTTCATGGATGTCCTTCTTCCTCTTCCC 
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CGAAGAGATAAAAGACCCTATAACATTCTCAATGCAATGAACTGCATACTGTCTCAATTTTCTGTTTGTCATCCCTGAACCAAAGCCTATTACTGGAACA 
ACCATGCCGCACTTGTGGTTGTATGTGAGCCCACAAGCAACTGTCAATTTCTTCAGAAGGATCTCGTATTTTTCACTTCCATTAACATCGTTCGTAAAAC 
TCTGTTT AT AC T TGTGAGC AAC CCA GTTC ACT AT AAAT AAC AAGGC AT A AG AAGC AG AC ACC TT TCGTCCC AAAAGTT T AG AAATTTG AAGGGAGAG CC A 
GTCTGCCATGTAGCCATCTGATTCGTGTTCGGCGATAGGTGTCCTCATCATTGTGACGGATGCTTTCTCCACTCTATCTGCTACAAATTTCACTGCTTTA 
ACAATACTTTTTCTGCACGCATTTTTGTCATTAGAGTTTATTCTTGCATATTGGAGTTCACAGGGGTAATATTCTTCCCCGTCATTATCTTCATTGGAGG 
ATGAATTTTTAGTTTTGTTGTCGTTCTTTATATACTGTTTCAATTTTGAGAAGGGAGTATCCAGTACGTTTTCTCTACATGCACTAGCCGCAGAAAGAGT 
GAAACCGAGCAGGTAAAGTTGTACACATGAAAAGGCACCTCCAAAAATATTACCAATGAAATAAGCAAATATTAATCCCCTGATGTGGTTGCTAGGGAAT 
CTAAAGTTGTGCTCCAGACTATAATTCGTTTCAACGTTAAAGTACCTTTCCAATCTACGAGGCACTTTAATGTTTATGTCTGGAAGGATTTCAAAATTCT 
CACTAAACATACCCGACAACTTGTTTGATTTTCCCAATACAATATCTGACCTTATGTATTTTTCATAAAAGTCAAAATCCGTCATCGTTAAATCAGTTGG 
GGGCTTTTTTGCATTTTTAAGAGTATCCGAAAGATGCACCAAACCAGTTCTGTAGTTGTAGTTTGCCATTGCTTCATCGGGCAATAATTTGCTCACTTCA 
TTGTATGCATCGACAACTTTTTCTGGAGCAATTTTTTCAGCATATAAACGTTGCCATATTTTCTTATCAGGATCTAGTAGTTCAACGTGTCTAGATTCTA 
TTGTCTTCACATCGACAAGAAAGTATCCTTCACGTATACTGGTGCAGGCCAATCTCTCCATCTTCTTGTTGTAGTAGTCTATCCTCTTCCrGAAGCAAGA 
TATTATGGAAGCAACTCCATCCAACATGTGGTACAGGGCTGATTGGGTACACCACTCGTCACTATTTACTCCATCACGAGCAGTGAAGAAATCTTGAAGG 
AAACTGTATATAGAGATGAACGAAAACAGTCTTCTGGATGATGTTTTCAGTCTCATGTCAGTTCTCTGCATAAGTTTCTCGTGCCGTATAACTCTGTCTA 
CGAATTCCCATGTCAGGTTTTCATTTACGTATTCTTTTAGGAGTGCCTTTTCTGCTTCTATAGTTGTAGTAGAAGTTGAATCCATAATGTAGGAATGAGG 
GAGGTACTTTGGCACGCACGTGTGCACTGACCGAGACACAACGGTGTACTGAACAAGGGTTAAAGACGTTGGCTGTTTTACACCTCTCAGAGGAATCCCT 
CTACCAGAATAAGTCCTTTCATTTTTTTAATTAACACACGCAAAACTGACGCTGAAATCGTTTCGCAATAATGGGAGAGGAAGTGGGAGGAGAACTTTCG 
CCTGTGGTCGTAGGGATACTTTCGGTGGTGTTAACCGCCATAATTATCTCTACTATATTTGCCATCTACTACTTTTATTTCCGTCCAAAGGAAGGGTTAA 
AGAAATCGCGAGTCAGTAAAGACCCCTTGGCTATACGCGTAGATGCTAACCATGTCCTCTCCAATAATTCTCCCTACATGATGTTCAATACAAACATTAA 
GGCTAGAAATACACTAACAGAATTTGATCCCAAAAATATGAATGAAGAGGAACGAGCAGAATTTTTCAAGATCCATTATTTTACTGGGTTCCCTAAAGAT 
GTGG AAAAG G AACAGGT GC AG A AG AAG AAGGAGG AAG AAG AAG A AGACGAAG AGG AAC AG ACG AC AGAT ACT AACCC C AAAAAT AAGAAAGTGG AAT ATG 
TAATGCTCGAATCTCTGAGAGGGGATAAGGCTGGATTTTTGATAGATGAGAATAAGACCATGTCTATACAAGGTTCTCCTGGTACAAAAACTGTCTGTAA 
GAAAACGACCCTACCAAAGTTTGTCCCAACCCCTAAAACGGAAAAGTATGTTCCTTCAGATGAAAGATTTTTCAGGATGGAAGAAGGAACAGGAGAAAAG 
GCAGAACATTATTCCTTCACGTCTGTTAAGACAGGAAGAACCTTAATCATGGGCTCTTGTGGCTACCCTGTGATGAAGGATATAAGCAAAGGTACAAGCC 
AACAGTACATGTACACATTTGAACCAATAAAGGAAAAAAAGTAATGTCTTTGTTTATTTCCCCCTTTTTTAGAAAAATTGGAACAACAACAATGTCGTTG 
GCTGTGACAGAAGATTACGGGCACAATGAAAAGTTGATCAAACGGTTACAAACCTCTGTATATCACACCCCTCTATTAGGTGCAGACCATGTAATGAAAT 
CCATATCAGACTACATAATTTCTCGTCGATTCATGAACTACACAAATTTATTAAAACAAGTTGAATATGTTTTCGATGAAGAAACAGGAGCAGTTATAGC 
TAATATCTGTCTGTTAAAAATCCTAGAAAGATGCGCACAGAAAGGAGGAATATATGATGCACCAGAAGATGTTGCATTCTTCAATTCTAAGATGGGGGAA 
GTAACGCGCCTATTTACTATTATAGGAGGTAGGCCCAATATGACGGTGCGGGTTAATTTTAAACATGGGCAGACAAATAATCCTGCCTATGGTTATCTCA 
CAGATGATRATGATACTACTACTGTTACTCCTCCTGTTACTCCTCCTCCATCTCCAGCTGCAAGAAGATCCCCTTTTTTCACACGCACTCTCATATCCGA 
GTCGTCTTCAGTTGACCATTATGTATTGATGCATGATAACCCAAAAAGATCTTCATTTAAGGTGTATGATATTCACGCAGAAACCTTTCCCCATAAAGCT 
CCTTCTGTTCCTACCTTCCCCCCTAAAACCTCGTTTGAAATTTCTGACGTGACTCTCGATTGTTCAATGGAGATTTTTTCACGAGACAGGGATGTTTTAG 
ACAATGTTCACGACTATATTGCTAACGACCCCGTACCATTTTTAGTGGATGTTGTGCACCGTGGATCTAGTCTCCGTTGAGTAAGTACTTTAGTTGCAGC 
CGGTCTCATTAGACCTGAAAAATACCCTTCATTTGTTAGAAAAGTGTGCTTAGATCAAAATGGCATCCACTTCTTCATCAACTAAGAAAAGAGTACATGA 
AGAAGATGAAAATCTCATCCCACAACCCAAAAAGAAAAAATCAAAGAAAGTACTACCATTTCCTGTTGACAAGTATAGAGCTGTGGATAAAAAGGTGGTA 
AATCTCATACACAAGATATTAGATCAAGAAAAGGACCACCTTTCTAGTACCGAACTGCAAATGATAACTGAATGTAATGGTGCGCGAGAAGATCTGCTTA 
AACATCTTCTAGACGAAGGAGAATTTAACCCTACTATAATTGAAGTAGTATCATCCATGCCTATTGAAACAATATACGAAATACTCTCTTCTTCTGCTGA 
CG AC AAGAAGTT TGT AC AG AT ATC ATT ATC AATGTTGAT CCACATACTTTTCTTC GC TG AT AAGGGT ACT ATGT GGGT ATCC AAC GCGTGCGT TC AA AAT 
GTTTTGGGGAACGACTATAAAGTGGAATTTGAAAATATACGTAAAAAGTATCTGATATTGGAAGACTTACTGAACGGCGTTTCAAATCATTGGTCTGAAC 
ATGGTCCTCTTTCTCACATGCTCCATTCTTCAATCCCTATTGTACAAGACATGTTATTGAACAGGCTGGTGCGTTACTTTAGCACGTATGATGGAGATGC 
TCAATTCGATATATCATTCATAATTAATAGTGTCTTGTGGGGAATTGATAAAAGTGTTCTCAACGAATTGACACAATTGATATCGAGGGGTGTTTTCATT 
GTGTCGTACGTACCGATGCGTGTACGAACACCTTCAAAGGACAGTAATCGGCCACAAAATACTCCTTCACAAAATATGTCAGCACTAGGTATGAAACTCA 
ATACATTTTCATCCAGAATCTCAGTGTACAGAAACAATACCTTTAAAAAACTAACCGAGTTAGTGCATAACTTTGATTACGGTTCCAAAGATGCATCATC 
ATCATCTCCTCCTCCTCCTTCATTATCGGACAGCGTCAACACTTTTGTGAGGTTGTACACCAACTATGACATATTCTTAAAGGTGATTTCCGACTGGAAA 
ATGCCTTATGGGTTCTTTAAGAAAACTTTTGACGTCCTTTATTCTAAGGGGTTGATGACATTATCAGTGTCTGAATATACACTCAAAAAAGAGTTGGTTA 
CGTTTTTGCGCGCCTTGAAGGAAAGGGAAATTTTAATCTATAAAATGGAGAAGAGAGACATTATATGTATACTGAAAAAGTCTTTGTTCGGATTTAATTT 
CAGGTGTTTAAAACAATTACTCCCTCTCTTCAAACACTTTTTAAAAATTGAAGAGGTTAAACATATAGCACGTTTTGTCTTTAGAGATTACAGTCTCATG 
TGCAAAACTCAAAAAGATTTGCAGAGTTTCCCTGCCATACAGTCTGCTTCACTTTTCATGGAAGAATTCCCTTGGCTTGCAAAAACTTGGATCGACGACG 
ATGATGATGAAGGAGGAAAGGGACATACCCTATTAACATTTGCTATAGTGCACAGATATCCCTTAATAAGCCAACTrATTTCACACCCAATTTTAAAATC 
GTTAGTGAATACTACATGTAGAGACAAGCACTTTACTCCCCTCATGCACCTCGCCAACACGTCTATAATGTACCAATGCAATACACTCTTGTGCCTTATA 
ATAAATGGAGCTAAACCAGAGTTCATAAACAAGTTCAACGAGAATGTTTTGCATATAGCGATTGAAAATGTTAACTATGGAGTCATCACTGAATTGAGAG 
GAACATTATCCAGCGAACAAATTGAAAAAATGGTCAATGTAAGAAGAATGATGGATAATACAACACCTTTAATGATCGCCTTGGCGAGGGAGAATATTGT 
ACTCGCTCAGCTTTTTGACGGTCTTTACAAGCCCAAAATAAAGGTCCGTTTCGGTTCTTCAAAGAGGCTAAGGATACCAGAGTTTGTCCTCTrAAAGGGC 
CTAAAGGAATCAGTTGCATATTTGGAAACGAGGAATATATCCTACGATATTAACATCATAAAGGATGCAGTAATGGACAACAGTCTTTTTGAAGAGGAGT 
ACGAAATAGCAGCAGCAGGACTGCGAGGC7VATAACTGCGACCCTGAAGCAGACGAGAAGACTATGAACACGTGGAACTTTTTCACCAAAAATTCAACCAA 
ATGGGCAAGCTCTATTTTCCAAAAGAATAGGCAGAAATTTGTAAAGATTGTGGATGGTATGAATAGGACATATGAAGACTCTGAATGTGCAATATGCTTG 
GATAGTCTGGACGGGGATCTTCCTTCAGGGAGAACAACGTGCGGTCATTGCTTCCACAACGTCTGTTGGTTATCCTTGATAAGGATGAGCGGGCCGAATA 
ATGGCAGCCGCGCAAGAGGAGGAGGAATAAAATGCCCGTCCTGCAGACAAGTCACCTGCCTCGGAAAAAGACTAGGGGTTGCCGACTATGATATTGAAAC 
AGAGGAAGAACGTGACACGAAAAATGTCGTGCCTTCGGTAGAAGAAGGAAGAAGGGAATGGAGGAAGATTGGTGTTGACAGATATGAATTTCTTGTAGGT 
GGAGTGTGGACAAATGAAATAAAACTATAAAATGAACTAGAATATTGGTATTATTTTTACACCTTACACATGTTCCAGAAACGTCTGTTCCAGAAATGTG 
TTTTGAGATTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTGGGTCGACCCAGCGGTCCACCCTCGGAACTTGACATTGGGTCGACCGAGCGGTCCA 
CCCTCTAAACTCGAGCGACCCAGAAAAATTTTTAAAAAATTTTTGAGATGGAGAATGAGTAAAATCTCTTCCCTGAAAGGGAGGTCTGAAGAGGCTATGA 
CGGTTCTAGGTGCCCGCCCTCTCCAGAAACGTCTGTTCCAGAAATGTGTTTTGAGATTTCTGGACACTCCATTTCTGGAGTAATCTGACATTGGGTCGAC 
CCAGCGGTCCACCCTCGGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCCAGAAAAATTTTTAAAAATTTTTTGAGATGGA 
GAATGAGTAAAATCTCTTCCCTGAAAGGGAGATCTGAAGAGGCTATACCGGTTCTAGGTGCCCACCCTCTCCAGAAACGTCTGTTCCAGAAACTTACTAA 
AGTTAAAGTACGTTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTGGGTCGACCCAGCGGTCCACCCTCGGAACTTGACATCAGGCCGACCGAGCGG 
TCCACCCTCTAAACTCGAGCGACACAGAAAAATTTTTAAAAAATTTTTGAGATGTAGAACGAGTAAAACACTCTAGTGGATTAGGGTGTTAACAGCCTAC 
CCTTTCTGGTCGCAGCTAGACCCAGAAACATCTGTTCCAGAAACTTAATAAAGTTAAAGTACGTTTCTGGACAAGCCATTTCTGGAGTAATCTGACATTG 
GGTCGACCCAGCGGTCCACCCTCGGAACTTGACATCAGGCCGACCCAGAGGTCCACCCTCTAAACTCGAGCGACACAGAAAAATTTTTAAAAAATTTTTG 
AGATGTAGAACGAGTAAAACTCTCTAGTGGATTAGGGTGTTAACAGCCTACCCTTTCTGGGCGCAGCTATACCCAGAAACATCTGTTCCAGAAACTTACT 
AAAGTTAAAGTACATTTCTGGAGATGTGCGCACGTATGCATGCATGCATACGTACCTATCTGTATTTTTTTAAAAGTAGGGGAAAATGCTATTTAATAAA 
ACAACCACCATTCTCCGCTCACTATACAGCTGACCTTCGTTGAGTAACCATGTCTAGCGGAAAAGTAACCTACGAAATCGTTGAAGGGGGATTGTTGAAC 
AACAAGTACCTTCTAGATGGAGGTGCAGCAATCTGTCTGCAGTCTAATTGTGTTGCAAGAAAACGTCACGCCGGTTCCCTCCACGATAACCTCTTCAAGA 
TGCTAGGATTTGGCGACCCCTATAAACAGAGACGGGGAAAAACAAACAGCAAAAATCTGGCCATAATTGAAGATAGACCTCAACTCGGGTCAGTATCAGT 
TGTCCAACACCCGACAGAACCAGAAAGGTTTTGCTCCATGACATTCTTATTTGCrCAGTACAATATGGGTAATGGAAGAAAATGTTACTTCCCTAACGAC 
AAAGAGTATGTTGAGAGCTGCAAGAAGCACGAAAGGGTCCACAAATCTTCCACAGAAATGAAAAGATTGCGCTTGTATTACTTTAACAAGTGTCTTCACG 
CGATCGCCAAATCACCTGCAATGAAGAAGTACAACAAGATAATCTTCCCTGCCAGAATTGGGTGCGCGGCAGCTGGAGGAGATTGGGAGAAGTACCATGC 
TTCTATTCGAGATTTCTCCACAATCATTGATAAGGAAGTGATAATAGTGTCTCAAAGGATGTAATTAAAAAATAAAACCGTCGCTGGCAACCCCCGCCAC 
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CACCACCATCGAGGCGGGTATAAATAAGGGGCGCTGGCACATGGGTGGCACACTCGCATCATGTCTTCCAACCGATTCAGTCAGCTGAGGGGCAACGAGG 
AGATGGTTGGGGACTATTCAAGATGGACAACTGTCAAGAACAGGAGGAACAGACAGCAACAGTATTCCCATAGTTTCCGTCCCCAACAACAACAACAACA 
TCAAAAAAGAACATCAACCAATTCTCCTCCTGCTCCACCTCCTCCATTCCCCATCATTAGTTGGGGAGCCCTCGGCAGCTACTCAATGTATCGACTGGAT 
GACCAGTGCAGAAATTGCGATGAAACTGGCTATTACAATTTCCACTCTTATGATAGAAAGAGGGAAAGAGTTCGCTCATTAAACAACACTCCAAGTGAAG 
GCATGTGGCGGCGCACAAGTAGATCTTCCCCCTTCCTTAATAAGAAGAAGGACGTTGACGAAGCTCCACCTCCTCAATCAAACCAACACATGTACCCCCT 
CAACAAGTACAGTTTCCGTGAATATACTCCTTCATCAAAGCTTGTGAATTGGCGAGACCCTTCACAAGAAAAACAGGACAAGATCTTACAAGAGGAAGAA 
GCTCGCGCCCCTACACCCACTCCCCAAGAAAAGGAACCAGAAGTAGAAACTAAAGATGATGTTGTCATCGAGGAAGAAACTGCACCAGAACCAGAACCAG 
AACCAGCCCCAGTTCCAGACCCAGATATTCCCGCAATAACTGCAACTACTACTACTACTACAGTTGCAACACGTCACGACGATTCTTCTACAGTATTTCT 
CAGAAATGTTATTCTGAGTATCGTGTTTTGGTTTCTGGGTGTTTATTCTGCATTATTTGCAAAATGTATTAGATCTAAGAAGGAATAAATAAAATGGTAT 
ATGAAATTTAAATCTTTATTGTTTCTTTCCAATTACTCCATTGAAATTGTCCTCCrTGTGAGTGCGCCTTCTGGCAACACTCCCATGTCCACAAGTGCTA 
AAGGTTGTATAGTTCCATCCCTCTTTGGTGGGAATGGTCCTCTATCTGATTGACAAACAAAGGGGAATCTATCCTTCAAATCACCCTCTGAAACAGAAAC 
AGTTTTTTCTCTTGTATAATAATATCTGCTCATTAATCCTTCATATCCAAAATGGTGATGGTGTGCAAATTTCCTTAGATTAGGTCCCCCTACGCTTCTA 
AAGTAGAGGGTTGAAAGATCATCAAATGGTTCTGGGCCTTTAACTCTTACCCCCTCCAATGTTGTTTTTGTTCCACTAAGATCTGAAGGTCTCATGAAAT 
CTGCATAGGTTATTTCCACACCTCTTGTACAAGCACTCTGAGCAGCTGTAGGTGCGCAAGATTCGAATCCTCCATGGTAGGGTGTCAATACAGGAAGTAC 
ATGGACTCTTCTCATGACTCTAGGGACTCTTTTACTCGCTTCATAGAAACCGACCATGTCTTTTTGTCCTCCCTCTACATACTTTGATGATGATGATGTT 
CTAACGCGGAGATTTTCTGTAGCCTTTCCTTGCTTATTGATTGTGAAAACTTCCGCATCTTCTTCCATVATCCACATAATCTGAAATAGATAATCTCCTCT 
TCTTTGTACTACCGGCAGTGCTTCCTCCTAACATTTCCATATTGCTGTTAACTACTTCCACCACAACTTCACTCCCATCAGTTGGTTCAATGCTTGCGCC 
TTCATTTATTGTCGCAGAATCGGCCGATATTGTTGGCGAATCTATGTCTGTAAAGCAAGGCGTTGCATACAGGCTCACTATATCCTTGACACCCTCAACG 
TAGGGGGAAAATGTTCCCACGTTAAATGTAGAGGCAGTAACTCCATGAGAAGACTCGAAAAAGTCCACTATCCTTCTAGCCTTTGCACCCTCTTGACCCG 
TTTTTGCTGCATCCCTTATAATGTGGAAAATTTCAGCCTCCGCAGGAGGCACGCCCGGCAAGCGTTCATAATCCTCCATTAAAATGTCATTAGATACAAT 
ATCTGTCATACCAACATCCCCTAACAATTCTGCGTTCCTATCGAACGTACCCTTCCTTTCGTCCAAAACAGCAGCAGGTACCTTTTCTTGTCTTGTGGAA 
TAAGGAGTGTAAGCCGTACTCCTACGCATGGCACAAAATTTATTAGGGGCCAACATAACATTGGAAGATATATCCCATATATTTTCTGTTGCAGCTTTTA 
GGTTTATCTTTCTTCCATTAGTTTTAGTTTCTCCGATAGTATTTAGTATTCTAGCGTCTCCGGATTGAGCTCTCAGGAGTGCTGCAGAGAGATGAACATC 
GTTTTTAGGGAGGGTAGTCATGAGGGGAAAACGTGAATGCCTATTTCCTCTGCTAGGTGTTGCCGCTATAGGGTCAGTTTTGTTCCTACCCCCTCCATTC 
ACTTTGGGACAGAATAAACCGTTGGTGCAATTACAGTTCTCCTTTTCTCCCAATTCCCCGATTTTATCGAATATGTTTTTACTATTAGGATCTTGCATCT 
CAAAAATACTCTTTTCTCCAATATTGGCAGGAAGTAGCCGTGTTTGTTCCTCTACATAACTCTTTATAGAAAATTTAGATAACGAAGAATGTGATTGCGG 
GCCCAGGGGTTTGGGGACCATCATTGTAGTGTACTGAGAGAGGGCACCCGAGCACGGGTATCCGAAAGATTGGCCAGTTTCTTCCCTGACACAAGGCGGG 
GAGTACACGCCGTTCAAGTAGTGGCCGGCCGTGTGGGCGCTCAGACCCCAATCTGCTCTGAACCGATTAGGGATCAATTCTGGCCTAATAAACGTTTCAG 
AGGAAAGAGTCCTAACCCTGAGTAGCGACTCAAAGAGAGCCATAGGAGGAGGGATGAAATTGAGGGCTCCTTCAGrTTCCTTGACTTCACATTGTTGCCA 
TAATTCATCCCCTTCATGGAATTTTAGAGGGACTATTCCATGCATGGACCCGACCGTGCCGTTCGAGACTGATTCrTGGACACTCTTTTGAATGAGAAAA 
CTGGAAGGTATGGGTAAAGGAAATCCCCCTAAAACTCCCAAAAAT6ATGTAGCTTGAGCAGCCGCAGACACTCCCGCAGGAGGAGCGAGAGGCTTAGAAA 
AGGGTCGTGCTGGAGCGTACGAATTGAAGGAGGTTTTAATGTCTGTTGTCGAATTTAGAATAGCTCTAATGTCTCCCTTTGTTACTCGATCATAAGTGTT 
GTTGATGGCTGTCAAGTTGTCGGCCGAACCAGGGTCTCCTCTCGCGTTGACCGGCGATCTGCGTTCGAACCAGGTGTGTTCGGGGAGAGAAGATGCAGGA 
ATTTCCATAGGTCTTACAAGAGGTGAAGATGCACCATTCATTAAAACTCCATCTCCAGTGAGGATTGCTCCTGCAAAAATATCGGGCTTTTTTGATAACG 
TTTTTGCGACCCTTTCGGCCTCGCGCATACCCGTACCTTCTATTCCATTATTTAGTCCTCTTCCCGAGAAAAATGTCCCTTCATTAGAAGCTATGCGCCA 
CGCAGTTGGAGGGTGAGCAAACATTTCATATGCTTCTTTTTCCCCCACTTTCTTAAACACTGCAGATGAACCAGCCTTGTAATGATCAATTATTTTGGGA 
GAATTTGTAGACAATATTTTCTCACTGAATCTTGCAAAGGTGGAACTTTTCTTAGCACCCTGCGCTGTAACAGGTTGCAGAATAATACCTCCAATTTCTT 
TGAATTTGGCATTGTTTGGTTCATTGTATATTGATCTTGTATTGATGGGCAGGCCAGACCTCTCTTTTCCTTCGCCGTTCAAGTTGGGGAAAAAATAGTC 
CAGAAGTAAATTTGTAACGTCGTCAAAACTATTACTAGAATTTGTAACCATCATAGAATTTAATGTACTGTTCGCAAAAGATTGGTCTCTTTCTTGTGCC 
GTCAAAAGAGAGTGCTTGAAAATTGTCTCGATGGGAGTGTGGGCCATTCGCTTAGCGGTAGGTCTATAGTGTACCAAGGGCAATATCTTATTTACTGTTT 
TATCATTTTTAGCAGAAGGTTTTTCCTTTATTACTCTATGGAGTGAAGGAACGACGTATGCTGTTGCGCATATGCCAGAGTCGTGAGATTTTCTCAGTTT 
CTGTAAGGGGTTATCTTTGTACGAATCGTACCCTGCCTTGTCAGAATCTTTAAGGAAGGATGTTTCTATGGTCGTATCGTTGTGCGCATCCCTTCTGTCA 
GTCTTTACTCCTGAAGGATCGCCTGCTCTCATGTGAACCGTCTCGGGGAAGATTTTACCCTGGGACGCTATTATGCCCACAGCATGATCTATGCTATCGA 
TTCCTAAACCGGGAATATTGCTATGGGGTAAAGGCACGACCCTGAGCGATGCGTTTGGATGTTCAGTAAAATTTCCCCCTCCAGCTCCAACCAGCCCATT 
ATCGTTTAGTGCATCTCCGTTCCAGATGTGAGCGGCCACACTTCCTGTTCCACTGTAGGGCACTACACCTCCTACAGATTCACTGACTGTGAGTTTTCCT 
TGTTCGTCAGAATCTTCAATTCTTGAGTGCCAATTGCCGGCGGCGTCCATGGCAGATTCGCGCAATAATCCTCCCGTAATCAGATGGCCTCCGTTAAACG 
CTGTAATTTCAGGTGCTCTTCCAAAAAGAGAAGCGGCATAAGAATAGCCGAGAGCCGACATCCTCGTATCGTTTCCTGGTCTAGGGAGGCACGCGTCCTT 
AG TAG T ACC ACCT AAAGTT TCGT AC CT TTCTGTT ATTTG C AGCAAT AAT T TTGC AC C CT CT AC AAAGGGT TGAGC AT AATCTGTTTT TGCCGCATCGGG A 
AGATCTATACCGGCCACTTCTTCCAGTCTGATTTGTGTACTTTCATCACAAAGACAGCCCTTTGTTCCAGTATCTCCTACAACCCCATACCTGAACATGA 
CAGGGCAGCCTAGACCTTGTTCCTTTTCCTGTATACCACCTTCTTTCTTCTCACATTTTGGGGCGTTGGTGGCAGGGTCTCTCTGCTGGGAGTAGCCAGG 
TTCGCACAAACAATCCATAGTAGAAGTCATACTGCTTATGTCGAAATCGATATCGTCCACATCATCAATAATTGAGAATGTTGTAGGGTGTATAGGTCGC 
CCCACTCCTTTTCCTCCACACAGTACAATATCTGTGCACCCAAGAATGGGCTCTAATCTTGGATCTTTATTGAGAAGGGAAGGATCCTTTCGCACCACAG 
ACGAATCTACACACCCTCTTTTTTTGGTGATATAAGTCGTTCCCTTTTTTCCTCCTCCAGTTATGACTCCTCCAGATTCGTAGTGCAAATAAGCCTCCTC 
GACAGTAGTAAAAGGGTTGACGTTTCTGTAGGCTGCATCTGTAGCAACATCATCGTCGTGCTTGTCGAACGCAGAAGAGGCTGAAAAAATAGTATCGGCG 
AGAGGGACACAAACTCCGTCATATTCATCATCATCTTCTATACCGTATTCAGTTTTCAATTCTCTACTGGATTTGACGCAAGTTCCTCCCCCGAGAGAAC 
GACAGCTAATGCATTCAAAATCGTTGCCTCCTACGCGACATTTTTTAGGAGTTTTAAAACAGTCAAATGTGTTATAATGGGAACTAATTTCCCCTGGTAA 
TAGAGATTCTACAAAACCTTTCCCTATGATGTCTGACTTGACTCCAGCCATACCGAAGGGGGCGTCCGTATCGACATATAAGGGGTCGAAAGGGTTGGGA 
ATAATTCCCCCATTTTCTTTTAATCCTTTGACGCGTAAAGAGTTTCGAGACATGGCTCCTGTATCTACAAAAGAGCTCGCTTAACCACAGTTGCACTCAC 
CCGGGTACAATAAGACAAAAATTATAAATTGGGAAGACCGATACAGTCTTTCATGACAAAATACCCCGAGAACAAAAGATTGTTGTCTAGGAACAAAGAA 
ACATTAAAAATGGTTGCCCGAAGCTCCAAGACCAAATCCCGCCGTGGAAGCAAGAAGAGGTCCACCACTGCTGGACGCATCTCCAAGCGGAGGAGCCCAT 
CAATGAAGAAGCGTGCAGGAAAGAAGAGCTCCACTGTCCGTCGCCGTTCCTCAAAGAGCGGAAAGAAGTCTGGAGCCCGCAAGTCAAGGCGTTAATTCTT 
CCCTGTACAACAACTATGTTATTTAATTGATTTTTTTTCTTCTGAATAATTGGAAATAATAAAACATCCATTGAAACTTATGCAGTATTTTTATTCAATT 
TTTTAACCAACTATAAATCCACATGTGGTATAAAAGTTTAAGGGTACAGATATATTATAAGATGATGTAACAGATGCACGGTCAATAACAACAGGGCCTG 
AAGAAGTGTCTATGAATTTCTTGTAGAATTCTTCCCTGTCTGTACTTGACGCTACATTGGCAAATACCAGAGGGTCTACTAGTACACCTGAGCTAGTGCC 
GGGTTCTATTTTAGACATGACATTTTGAGGGATTTTAACACATAATACAATATCATCATGGTCGAATGTCGTCCTGTTCCAGAGACGGCGATGTAGTGCT 
GAATTTCTATCCACTRAAACTAGACCATCGTCATTTTTCAAATGCTTCTTTCCTTCATTTCTAGCTACAATTTTCATGTAGAATCTTACTCTTTCTGCAT 
ACGACACCACCTTGTCTTGATTCCCAACTATAGCAAATAGTAACGAGGAAGTAAATTCATTGTTTGCACCATTATTATTGTAGAACCCGTTTGCACTACC 
AATTCCTCCACATCTAGGGTCTAGACGTACCAGAAATTGTCCATCGGCTAACCGAGTAACAGGGTCGCCGCTAGGCCATGTGGCTAGCCGAGTCTGTAGA 
GGGGCAACTGGTTGTCTCAATTGAGAATAAGAACCCACTGCGCGTAGGCGGCCAGTGTTTCTCCAGTTCGTGTATCTGTCTGCAGGTTCGTCAGGGTATT 
TATTTGCTCCAAAAACAACAATACCCCATTTTCCTAAAATAGCCGCAAGTTTCTTGTCTTCTTTATTTACCAGAAGATTTTCCTTTTTGGTTCCAGACTC 
GTCTACCACGTATGTAGTATTTGAAGTGTGCACAATTTGGTCTAATGTTTCTTCAGAAACGCCGTCTCCCATGTACATTTTAGCGTACACTTTCTCCCCC 
TGTGTATACGCTTCACTAGGTGCTGGCAATAACTGTCGATGACGTTCTTGTTCCTTATCGATTTGTTCCCTCAACCAGTTTTCTATTGAAAATTCAGTTT 
GAGTATCTGTATTGACAATGTTTGTAGTATCATTCGTTCTTATGGAAGAAATGAAAATGTGCCGAGGGTACTTTATTTCTCCCCCGTAATCAGCGAATGC 
CCATTCAGGTATGCTTGTAGGTAGGGGGACAAGACTAGTCTGTTTCATGAAAGGGTCCCTTATCCCTCTATTAGAAATGGGAGGAAGTGTAGTGCGACAG 
TAGACTATGCCCTTGTTGGTTATGTCGGTGGCGGGTGTGTCCCTTACAGGTGCCGTAAAGGTTGAGGGATCATTCCATATAGCCAATTCGTTAGATGGTA 
CAGACATGTCTGAAAACTGTCCTTCAAAATGCCCGTCCATCGTAAAGGGCATCTGTATGCCTTTATCGTCACCTCCAAATACTTCCATCTTTACACTGAA 
AACCCTATCGTCTCGTGCACCGTCAATAAATATAGGGCGGCTGGTTATTCTCGAAGAAAGAACGTTTTCTTTTGCAAGTATAGAACCAGACACGGGATCG 
GAGACTGTGATGTGGTAGCTTTTATCTTCAAGGACAACATCATCATCATCATCATTGACGATAACATCAGATACGGTCAAGACAGTAGTTTTGTTCCTTG 
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ACGTGATACTGTGAATGGGTTGGAATGGGTTATCACTCGCTTTCACTAGGCCGTTATGCGAATCTAAAACAGGGAAAACAGCGTACAGTTTAGTAGGTTC 
TGCGGGTGCCTTAAATGTAGGCTGTTCACCTCCCAGTAATGCAGTTTGAACGGGGGTAATAATATCTCCACCTCGTACTTCAGTCAATGGACCAACTGAA 
ACGTTTAAAGCACGAGGCAGTAGAGATGTTGTGCGTTCACCGGCATATTCTGTGGCGGGGGAAGAAAGCACAGCCGTTATTCGAGGTTCTCTATTATCAT 
TATCAGTTTTCAAGATGATACATGGTCTTAGTTGGTAGGAGAAGAAATCTGGCCCGTTCGTGGGTCCAGGCCAATCTGGTGCAGTTGCAGCATCTACTCT 
AAAGTTTACGCTAGAACCGAAGCACCCGACTCTCGAAGGTGTATTATGTACAACAACTTTGTTCGTGTTAGCATTCACATCAGTACTTATAACCGAGAAC 
AATTTATCAGATTGAGTAACTTTTGATGCTGGAAATACAATATTGTACCCAGAAGATGTGGTTAAAACGTCAGACGACGTTTTGAAGCCCACAATCCAGC 
CCAACATTTTCCATTGCGCTTCTATTTCACTTTCAGGTACAGACACACATATAAAGGTCCGAGCTGGGTTAGATGGATTTCTAAGGTGGGTTATCTCTGT 
GGCGTTTAGAATCTTGTAACCCTCGGATTCATTTCCCTCCATATTCTTTAACGTTTCATTGTATAGTACTTTGTTTAATACTGGTGTGTTAACAGTGTTT 
GTTTTCGAGTACACGAACGACACTGAAGGTTTGTTAATATCTGAAAGTTGATTTTCGCGAGGTAGTTTAGCTGCTAAATTGGACGCTATTTCAGCTTGTC 
GGGAAAGACGGGGAAAATTAGCAGGGTAATCTGAAACATATTTTTCATTCAATGTATCAGGAGTGTATTCCACATTATCATAGTTTATTGCTTTAGGCAT 
ATCAAGCCTCATGAGGGCTACATTTCTACAAGCA7VAAACTTCCGTTATTTTTGGTATGGGTGAAGATATTGTTACCTGAGAATCGAATAACACGGTCTTG 
TTTTTATCCTCTGTTACAATGGAAATTTTCAGTCTATTCCCAGTGTTGAAAATACTTCCCTTTGTTGATAGTGTTATACCGTCAGCTAGAGAATrTACTT 
CCACGTGTTTGAATACGGCTTGTTGTTTATTGCCATTGTTGCGAGAGAACGCATrCGGGTCATTGGTGATAGGAGGCCAACCTGCTGGGACAAATTCAGG 
GAAAGTTTCGAATAGAGACGGACCACTATCGCTGCTGCCACCCCCGTCTTCTCCTTCATATTTTTCATCCACTAAATCAACTTTTATCTCTGCCTTAAAG 
GGAAGAGGGACATTATCTGGTAATATTCCGTCAAGAATATGTATCCTCACGATGGTATCTTTTCCACCTTTTTCTAATAATTCTTCTGTACGGATATTAA 
TCATTCGAATTGGCAGCAATGATGATGATGATGAACCATCTCTATCAGGAGCCGATTGAGGAGAAATATCATCACCAATATCATAACCGATCTGCCGTGC 
AGTGCGTTTCATTATACTGTGTAACAATTTTGGATCCTGCACATCAGTGACACGTTTTAATTCCTCAGAGGATATCCTTTCAATATTTTTAGCGAGTTTT 
TTTGACGCCGGTTCTAACCCATTCTTCAAGTTCCTTTTCTTTTTCGGGTCTTTTCTAACAGCAGCGGAATCATCGATAGCCGCTGCTATATTCCCGAATA 
CAGCTTTTTCTATTTTCGCAGGTTTTCTTATGTGTATCACACCAGATCTTTCATCGGGCGTAAAATTGTTCTGGACAATTTTCATACCAATATAGGAAAT 
AATAGAAATAAACACAACGAGAAGCGTGAATTCAAAGGCACCCCACCACGAGCCAAAAATACCCGTGACTGCTGTTTCTCCAAACATTGTTATTGCTGTT 
CTTAACGGCAGGGTTGGTTGTTCGGCAAATCAAGGAAACAAAAAAGTTACCCGGGCATGGCAGGGAATAGAACCCAGTTCGTATCGTCCCTGATTGCTAA 
ATGTATATCAGACGTGGAACAAGGCATGGAGTGTTGTGGCAGACAAGCACAGGATGCATTAATGACTCGCCTAGCCAACTTAAAATTGGGCGATTCTCTT 
AAAGAAACTGATGTTAATTTGGAATACTTGAGATACGCGTCTACGCCCCTCCTTGGGGAATTAAACTACGACAAACAACAATATGCGGCAACAGTTGACA 
TCAACCTAATGGCTCATTTCTCCTACGCTGCTTTGGGTATAGAAAGTATACTGAATTCTATACGGAGAGTTGTAGTGGCTAATCATCAACGTAGAAATAA 
TGGAAAAAAACCTTCTGAACCAATCTCACGCCCTCACCCGCTGGGAGGGGTAGAACCTCCTCTATCGTCAGAGTTGGCAAATGCAATAAGGGACAAGTTC 
ATCAGCATGGGGGCGTTGGACAGATTGAATTCAGCAATAGTGACAGCGGCCTTGGGGGCTATTGCCAGTGAACGTGAACTATTCTTACGTGAAAATGCTG 
TAAACTACATGTACGATGTAGAATTTGCAGAAAGAGATGCTGCTACTACAGATACAGGGAATGTAGTCTATCTTTCCACCAAAATGGACGAAGATGAAGA 
TGACATAATAAAGCGTTCAGAAATATTAGATJy^GGTATCAAAACGACCCGCAAAGGAAGGTATAGACTGGCGCCCCACCCCTGACAATTCGTTCCCTTAC 
CAATTGATTTGGGGCGATGATTCTGTAGATGATACTGTTCTTATAGATCTCATCACCAATGCGATCGTGCCTAATATTTTTATGGCAAAATTTATCCTGT 
TCATATGTAACCATTTAAGGGCAGTTATTAGGAGTATGAGGGAAATTTTATACGGTAACATTTCTTCTTCATCCGATAATTATTTTGAGGATGGACGTAA 
ATGGTGCTTCTGGTTGAACCTGTACAATAGACTGGAATGGTTCATGTTAGTAGTTAGATTTGTAATTTTCCTCCACTCAAAAAAGGAGTCCTTTTCAGGA 
GCTGACAATGTTAACGrGAAAAGACTTCTGGTGGTAGTTGTGGAGAGTTTTCCTCCCGTTCTCTTGGACACTGAATGGGTCAAGACTAATATAACGTCAT 
GGCCTGTTATTAATAACAGCAATAATAATAGTACACTCCCTGTGACAGAAGACACCTTAATGAGACTAGCGATAAGGACGAGTAGCGGTGCCCGACATCC 
TATTTTCGACGAAATTAACTCCTTGACAACAGCAGTGACCAACCGTATTACCTTCCAGTCTGCAGAATTCTGCACAAAGATTTTGCTCGGGCGAGCTCTG 
GACGAAGAAGAAGCTGGAACAAAAATGCTAGTAAAATCAGTCAAAGAGACGGGAGAAGAAAAGGATAAGAACAATACGTTCTCTTCATTTGGTTTATTAC 
TGAAGAACACAAAAAATGAAGAATTGGAAATAAACATAGGCGATAACGATGATGAGACTACAGATGTGGCTTGTTGGGCACGTACTTCCTCGACATCCTT 
TATCCGTAATAGGACATATGCGTTTAAAAAAATATGGGGCCTTGAGGATGCAAGTGATGTAGTCGAGCTGAAGCGAGAGAGTGACGCCATTACATCCTTT 
GTCACCGATAAGAGCAGTCCTCTCCTATTTCCGTATGTGTCCGACTGGAGTTGCTTACTATTACATCCCTGTTGTAAAGCACCGGCCATAATTAAAAGTG 
TGTGGTTACAAATCCTGAAAGATTTTTCCCAGGAAAATATAAAAACTATAAATGAAAAGGTACAATCTCTTTCATCTGAGATTTGTCAGAAATCAAACGA 
CCGTTTTAAAAATAAAAAAATTGCTGCCGAACACGTTCGCAGTGTAAAAAAGTTATTAAATACGATAAGCAACAGGGAGCAAGAAGCAGCACTATCTACA 
GAACACTGTATTTGGTTAACGATTTTGTGGAAACAAGTCGTTCAGAACACTCTCAACCTTCTGGAGAATTTTCCCGTATAAAAGGGGAAAGTACTCAACA 
TGTTCACTCACTTGACCCGAGCCTTCCGCAAGATGAACAACCTAGTCAATCGCAGCTTCATTGACGTTCACCGAGTGGTGGCCGAGTTATCCTATCCAGA 
ATTCGAAGAGGATGTAAAGAATCCGGAATCATCCATATATAGAACTCCGATATCCCTCTTCCAAAACAAGGATATTGTTACAATAGTTGGTGATTACATC 
CTCTCTCCGAAGACGGACTCATTCCAAGTTCTATACCCAATCAAGAAGGTCATCGAACACTTCCCAGTAATCTTCCACTGCACCCACAATAATGCCCCTC 
TCTGGGTACACCTTCTGGACGAACGCCATCATCGCCTGCTCCAGAGCCTGCTGACGTACGAGATTGTGAATGCCAAGTACAGGGGTATTGTTGTCATCCC 
ATACTACAGGCGCCCCATCAACTATCARACTGGGAAGAGTCTACTGATGAGCAAACTGGCGTCCGTAAAAGTTCTGGACATTTTAATGAGATGTGGATCA 
TACAAATTCATCTCATTAATGTGCATGATCAACAAGAAGAACAACACCAACTTTCTTCACTGCTGTGCAAGTAAATGGGGAGAAGTTGGAAGCAAGATGA 
TGCTCCACATTGCTGAAATGTTCTTTGCCAACCCTACTACTAGCCAACACCTATCCGACGCTAGTAGTTTCCCTGATGCTGCAGCAGAGGACGACAAGGG 
GAAAACACCTGCCCATCTAGCAATCCAAGAAGATAATGCTGATGCACTCCTGTTCCTGATCTCCCTCTACGGCGCACCCTGGTTTCTlAGATAACAACTCG 
TACATGAAATCTGCCCTTGAACTCAAGTCTAACAAGTGTGTCAAGGTACTATCCTTTGCAGCTGACAAGTACGAGATTTTACCCAACATTAACAACAATC 

aactagaaccagataccatgtgtggagtgtgtgcaacatctgtggaagaagatgaaaatgaagggaaaacaacaagtctttcctggtaccagatgaattg 
caagcattacatccattgcgaatgcctgatgggaatgtgtgctgctgctggcaatgtacaatgccccatgtgccgtgaggatgtgggcgacgaagtactg 
gaaagatgccctcctacaatatttagatggttaaaactggctgagagatctgaacacaatcgtgtactttttgaagcaaaaaagcaagaattctataagc 
agatggaagcaatgaaacctcccagagttgttgttcctcctcgcaggacatttctcaccccagccagaagaggcgaacgagccatcagaatcgcaagaga 
aattgccaccaacgccatcgctgaagccacagctcaaggagatgtcaactcctacttccctgttctcattgacgggagcggagaagaatatgaagaagag 
ggagaagaattcttcaattctgaagaggaggcgcttgcttttggaagaccatttctggaagatgaggaagaagccagacaaatacagatgcgccagtttg 
ctgaactgtctagacgaggcgtttctgtcaatattattaacaatgataatcctcatcgacacatctctacagtaaatattgtgcaaccagtttatggagt 
tgaaaagtcacctgctgcttccttcatctacaacatgctcaagaatgacgtctttgagtctatacgctcaagagatactcgagttggaggagaaagagtg 
cccgtcatgaacctgtccaatgacaagagggcattattccacgcagcttcttccatgctttgtgactttgccacagaaacaaactctcaaattgttggat 
tggactttcaagcagtgtatgatccccatcacatatccaactatatcgagacgtttggtagtcctcttcacgcctacccaggagccgtcacttttctgga 
cggggcccaggactattatgcagagagtatcagatacgacaatgatattgtctcattctcagaaatggcaagtgagttgcacatcaccgaagcattagat 
gtctttgagggtagtttattatccccactgttcaagaaaatcaggactggaaaatcttactctaactggaacgaccatttgaggcgtagaaattatgctc 
gagatattgctgaggaatttgttagggtatgtgaaaactctctagcttcacgcgaacacccccctgttcatgtacatccctttagagatggagcaatccc 
cattctcattgaatatatagtagatttcatccaccactgcatcacctggtctatgcaagttaatgcactccattgtatgagaaagtacattgaacacgag 
aatacaaatgtgcacctgttaaacttgcgtcctactgatgaaagggtggaagttttaagggtatctcaactcagatggagccgcttgttcaatgaacaat 
acaacactagaatgtccctcagcaccaaaagattgagcctcatgaagatcttcaaccatgatttgggtgtgtctaaatttggtgtatacaaactcctaga 
tattattgaaatgtactgttttactttaatctaaacaattw^aataatgtaaagaattccatgtttttcatccctcctcctctgtggtgtgtatatataa 
ggcgggtagcctcctccacaggcacacttgtccagtagccgttgtcgagtgaacatcatggcagcagacctcctagagttggctatccaggaaacaatcc 
agtctgaattggaagaaattgccgatactgaattcctcaattatcttccccataaaactggcatctgcgaagaagctgcagctaatggacggccatatct 
tcctacactagaaatgaggaacgaagttgaccatttctggtcccaagataacaggaagctgaaactcttggggcatttttgtggcaacttgtatgtggag 
gcatttatagctggttctatagatgctgaaacgtgcgtgggctttttgaggtcgcaagcaactggactaggataccctctattgaaaaaactagccctga 
ttgcccgtgaggataaatcaaatacaactaattacaacttgtacattgatagaaattcaatgatgaaacaagtttttagtgctgaaattgataagcgtcc 
ttcatcaatacagaacacttctcacacaaaatcttcccccgtgtacttgaaactgatcgacagaagaaccgagtgtcttgccctggattggctggacgca 
tccaagaggacggccaaggaaattggagcagccagaaaggtttgtttcctccagaaccrgattgttgccatattaatacctgcatacacagaaacgtttg 
ttcttgatacaggcaacgaactagaacagcaagtattggatgatgcatattttaatgcggagaataaagataaagtggargaaatgtgtgtagtggccat 
attgagtactttgcacaatttatttgttaggaaaagtcttccccatcatttgtacaatgcacctttccgtcttcctccctttggacaacaccctatcatc 
aacattgaaaattcctcattctttaatgaagatacgacacctattctagcgtcaatttctataccatcaagtatggtaaraaaacaccatacgcgaaaaa 
atagtagatggagatgtcccaataacttgatgactgcagcagaacggtctatatttttacgtggagttttaacagtcagtggagattacggatggttttc 
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TGTAATTGTfiGGATCTACTATAATGCCTAGTGTACTTTTCTATGGCGACCGAAAACATCTAATAAATACAGTCAAATCTAATAACTTTTCTGCCATAACC 
TGTTCTTATTGGAACAAGTATATGGATTGCAGATCTTATGGGTTTGAGATTATRGACACACCTGAAAATAACTGTGGTTTTCGTATAAGGGCTGCAATTG 
ATTGCTCGAACACAGATTTCCATTCACCGGTAACGCGRGTCAACAAGAAGAAAACGAGCATTATTAATGCGGTAAAGAACCCTTTTTTTATTAGACACAC 
AGAACCTAAGTGGTACAACAAAAATGCCATGTGTGGTGAAGTATTGGAAAATGTTGGCGTGACCCTCGAACAACACGTCCGTGTTAGTGATGAGTATATG 
GACAGATTTGGTAGTCTATTACTTGGACGAGAAAAGAAATGGACGTGTAATTATCTAGATAGAATAAAGTCTCTAGAAACTATTTCTAACAATCTCAAGG 
GT AAAATTGACACC ATGTGT AAAATTC T AG AAACC AAGT AT AACT AC AAATCTTC T T C CT TA T ACT AT AAGC AAAT AAC TGC TAG GTCTGATGAT C CT AT 
AAAGATGAAGATTATCGCCTCTATAAACAAGAGGAGGTATTTGTGTAATATTCTAGAATTTGCTATAATTTCATCAGAGAAAAAGGATGAGGTGGAGGAA 
GATCACACTAAAACCGGCAATGGTGGTTGTGCGTTTTCAAAGTATAAGAAAAAACAACTCGAACCTAAACAACATTTAATTGTTAAAGTGAATAAATACA 
TTGAGGCTTTTTCGTTAATTAAGATGCTGAGAAATGATTGCGAACGCAACAAGTGTAGGTTTAAAGAGGCTGAAATTAGAGAGTGCGCCAACGAACTGGT 
ACGAGAACTGTATAGGGCTTCGGCCAGAAGTTATGTCCACGATCTGGTATTGAAGCGAACTAATGTCCACTTGACATGGCAACGCCCTTACGATGAAAAC 
GCTAACACTATCATGTCTTTAATACCTAAATGTAAGCTACACACAGTATTGTACGATAAAGATTCGCGCGATGTTAAGTTGTTGAATTTTCTGAGAACGA 
GGGATGGAAATTATAACCCAATAAGACACTCTATGCTGGAGTTAGTATACGGAGAGGAGTACGCTAAAGATGTCAGTACTGTTACCTGTTTTGAArGGTT 
AAAATGGTGCTCGAAAAAAGGTGTGATTAAATATGAAGACTTTTTGGATCGTTACGAGAAAACGGGGGAGGAAGATAAAGACGAAAGGGAATTCTTTAGA 
CTAAAAAAATGTAGTAGAGATCACACTAAGGATATAAAAAAAATAGAAAATGTACTAAATTCTGATACACTTTATTCTTATTCTCTCGATAAAAATGTGC 
AAACCCACGCATCTTCTAGTACAGTTGTAAAAAATGACACTGACGGAAAAACGTCTATGGTGGGCTGGGATTATATTTTTTCAATCGGTAAAGGAGAAAA 
AACAACCAAAAAACGAAAACTGGAAACGATAGATATATCGAGTAGTGACGACGACGATGAAGAAGAAGAGGAAGAAGATGAAGGAAAAAGAATGAAAATG 
AATAACTGCAGCAGCAGCATCAAGAACAAGAGCAAGAACAAGAATGGGAGAATGTGTTGCACAGATATTCTCAATGTTGTAGAACCTTCTCTACCTAATA 
CTTTATCGTTCAATTGTGTAAAAAGTATGGATGTGTTGAATTTGTTATGATGTTAAAAATAATAAAAATATTGTACATTTTATTACATTTGTTTGTTTTG 
ACACCTGAGCGGTTTGGGCCTGTTGACGTGCCTAACTTTTTGTCACCCTCATGAATACAATTTGTAAAGGTGCTGAAATGTACTTGTTTTTTATCCAAAT 
TTCCTGTACTGAAGAATATTGAAAGAAGACTTCTTGAAGAGGACCGATAAAAAAATGGCCACCTTCCAGACTGACGCCGATTTCTTGCTGGTGGGGGATG 
ATACTAGTAGATATGAAGAAGTGATGAAGACTTTTGATACTGTTGAGGCAGTCAGGAAGAGTGATCTAGATGACCGTGTTTACATGGTGTGCCTAAAGCA 
GGGATCTACTTTTGTCCTCAATGGAGGCATCGAAGAATTGCGTCTTTTGACTGGAGATTCAACGCTGGAGATTCAACCCATGATTGTGCCAACAACAGAA 
TAAAATAAAGACGGTGACGGGAGACTAATATCTTTCTTAGTTTCCCGTCACGGTGAAAATGTTGGTTATTTCTTCCCTATGTTTAAAAATTTGTCTTGGT 
TAAAAAAATAAAACGAAAACTGTCAATATATTGTTTTATTGATATACAATATCCCTTTTTACACAGAAATGGCATGTCTCTTTACACGCTGTTGTTCTTG 
AGTGTTGTTTCCATCTTCTTCTTCTTCTAC6TCGCTATCAAATTCATTGTCGTTTTCCCAGGTCCTTTTTACTCCCCTTTGCTCCTGTTCACTGGTAGAC 
TGTTCTCCATTTTCTCGCTTCTTTTGTTCCTCCCTCTTCTTTACCTTCTCAAACCACTTTTCGGTCTTGTTTCTCGTACGAGCTTCCTTTGTGGTCAAAT 
GTCTTTCCTTGTTGTACCTAGTGACAATTTTTGATGTGTTCTCCATCATGGAATGGAGCTTCCTTTGTTGTTCAATTTCAAATTGGGTCATATTTTCTTC 
ATCAAGTTCTTGAGGCGAAGAGATGTCTATCCTCTGCCCATCTTGATAGACGGTGCCATCATTCATAAAGATTCGGATATCAGAACCTGTCACTTTTTGT 
ACAATACCCTTCAAATCTTCATCACTTGGAGGGGTCTGGAAAACTATACTTGATCGGTTACATTTTAATACCTTGAACTTGTTTTGGTGTTTGATACTGT 
CGTTGCAAGCGATAGGTGCGTCACAGCCAGTCACGGCCATGATGAGTCTGGGCCTCATACCTGCAGTATCCCTTTTCTTGTTTATGGGGGAGAAAATGTA 
CCTCAAACCATTTTCTGTTGCAGTTTCAAAGGAACCACTTTTGTCTGTCCTGCACAACAAGTAGAATTCTGTAGAGGATATCGAGAAGACGAGACAAGAG 
CCGCTGTCGTATCTGAAGGTCGCATTGGAAGATTTCTTGTGTACATTTCTGAGAGATTCTTTTTGTTCCTCAGAGACTTTGAAAAGTTGTTCAGTTTCGA 
T ATA C GAG TT TTTGT ACTC C TCTTTCTTT ACT ACTTGTGTTTCTAGTTCCTTTTCAACTTTTGTGGTCTT TACT ACT TGTGTTTCTAGTTCCTTTTC ARC 
TTTTGTGGTCTTTTCGACCTGGTTTTCTAGTTCCTCTTTTATTTGTAAATTTTTCCTAGCCAACTCTTCGCGTATCTGGCGTTGCTTTTCATCACTGTAT 
AAGATAACAGCGTAGGCTAGTGTCGAAAGGTACATTTCAATGCTTGCAGCAGAAGGGTGTGCGCTCCTTGCAAAAACGCCCGCCACCATAAACAAGGGAA 
CAATCAAGCAATGTGAAGAATGGCTCTTGAAAATAGGATTGGCTATATATTCTCCTTTGTAATGGGTACGGTGACAAATTTCAACCAATGGGACCATCAT 
ATCCTTGAAACCATGTTTCTGCATAGTTCTCTCCATTGACTTTAAAGCGTGCGTGACGTTCATTGTACTGAAACCTAAACCGATTAAGAAATCGGAACTT 
ATTATCATACAATCGTGTTTTTTTTCTACTGCTTCAGAAAGACTATTGCTCATGTACCTGTATAATTGACTGAATTTAAAGTCCTTGTCTTTTCGTTTTA 
ATTTTTCCTTGACTATTGTATTGATTTTACGCACGTCTTCTTCTTGCGTTGCAAGCCATTCCTCGTCAGTAACGTGCCCAGCATCATCTTTGAATAAATG 
TCCAGGTTCAGGAGGGGCACAACTTATCCGTTCAATTCTTAAAGAGTCATCTTCTACCAATTCTTGTAATCTTTTTAACCCTCTTGCGAATTTGAGTTGT 
TTATCTTTCTTCACATGTACAACAAATGTGTCTTCGGACAATTCGCTTTGTCTTATAGATTCGATAGCTTCATAAACACCCATGACTTGCTCATATTTAC 
TTGTTTTCTTCCCGATCATCAGGAATGACGATCTTAGTTTTAACTCCTTTTCTGATACACTTTCATTACTAACAGGGTTATTGTTTTTCTCTCCAATGTA 
TTCCATATTTATCTTCTTCAAAACAACAACGGAAAATACAGTTTTCTTTTAAGAAATATAGGTTACAATACTGTGGATTGATAATTTTACTCACAGCTTT 
GTTCAAAACTTAACTACGTGGTTCGGCCCATTTTGCACCAGATCTGTTTTAAAGGAACTGGGTAAAATACACATGAAAACACTCCTTTTGAACAAATTCA 
GATTTATATGTATTTGTCGATTTTTATTAATATTAATAAAGTATATTTTTAAAAATTAGGATGGGGACAGATGGTGTAGATTGATACAGTGACCGTCCCT 
GTTATTTTACCTTAATTTTTTTTCCCTATAATACAAATAATTGGTATCAGTTTAGGAAAACATATTTTTAAAATGAAGGTTATCATATCAACTAAACACT 
TTTATTTACCCCCTTTTCACCTCAGAAATATCCTCACATAATCGCATGCAAGACTTGCGCATTTCTTCCCTGCCGTACAAATCAATAGCAAACGCTAACG 
CCAACAGATACGTCTCCACTTCTAGAGATGGGGGTTGAACATTACGGGAAAACATGTACCCTAGATACACCAGTGGAATAACTAGGCAATGGGAATGATA 
ACTCTTGAAAATGGGGTTTACAACAATATCACCCTTGTATTTTTTGAGATGGCACACTTCGGCCACAGGCACCCATGTATATTTCAGACCACTTTTCCGC 
AT AT TTCCC T CT ATCGCT A T T AAT T T T TC C AT G ACATC ATCAGCACT AAAACCG AGAC CAAT C AAAAAAT CT GTGCT AAC AAC AA AACGTTCCTTTT CC C 
CGAAAGGTATTTGAGGAAGACTTTTTGTTATATACCTGTAAGTTGGACTGAAATAAAACCCTCTAAACAACATCCGTGTCTTTTCCTTTACGAGCTTATT 
TACTTCCTGTACGTCTTTGTCTTGAGAAGCAAGCCACTCGTCATCAGTAGTTGTAGACGTGTTTTTAAGTATAACGTGTCCTTTCTCAGGCTTAATAGAG 
GATAATAATGATTCTATCTTGAGGGAATTGTCCCCTGTGAGCTCTCTTAATTCTTTCAGTCCCCCTTCAAAAGTAATGTGTTTATTTTCCTTTAATTGTA 
CAATGAAAACGCCGTCTCTAAATTCACTCTTTCGGATAGTTTCAACTGCTTCAAATGTACTCATTACTTCTTCATACTTTTCATATTGTTTTTCTCCAAT 
CATTAGAAAAGAAGAGCGTAGCTGTAACTCGTCTTCTTCAGGTGTCTCATCCAGGAGATTTATCAGTTTTTGTTCCCCTATATACTCCATGGTTTATTTA 
GTCAGTTGATTTATTTTTCTCTTTGACACAGTTTTTATGGTACTCTGGGATTACTTTACTCAAAAATTTACTCACATGTTGTTAAAAGTTTGAGTGTATT 
CTTTTGTGGTAAACTAACAATGAAGTACATTTCTTAGTAAATGAGGTAGGTACTGATTCGTGATGGATAACGATGGCGTCCCATGTCTAGT6ATGTGAGT 
CACATAATAAAAAGGGAAATAAAAATAAGAACTAGTAACATATTTTTTATTTTATCCTTCTCAATATGTTGTTTGATTTCTTTCTAAAAATGGCTGCTGC 
AAAAATGGACGCTATCCTTGCGGATATTAATGGGAATGATACAGATTTATCCAAGCTAATCACAGACGTGATTCAAAAGAGAGCCAAGGCTGTCATGGAT 
AGAAATAGGGCTAAAATGGACATGAATAGAAGAGTAGATGAGGCTATTCAGGAAGCCGTAGCGGCCAAGAAACAAAAAGCATTAGTGGTATTTGATAAAC 
TCGTGGAAGAAACTGACAGCGGACAAAGTGTCCCTCCAACATTATCGGGATCCGATTACGACGCGTGGGTAGACAGAGCCATGCCTTCGCATATTGAACT 
TGTAGAGAGTGTTGAGGGAGATTCTTTGTATGATAAACTCCCTCCTTTTAACGTACTU^GACATAGACGACCAAATCGGTGATGAGATAGATACACCAATA 
TCTTACCTTGCCATGGTAGTGGTAAAAGTCGACTGTGAAACTGGGGATATCGAAGAAGAGTACAATCTTGCTCCTACCTTTGGTGTGACACAAAATAATA 
AAATATACAGAGATGAAAGAGACCAGATTTTTACAAAGGCTGATAAATCTGTGCGTATTTTTAAACTTGCTAAATTGGATAGTATATCAGGTAAAAGTAG 
ACAACTGACGTATGCGGTAAAAAATAACAATGAATATACAGAGTTTGTCTGTAGCGTCTTTGCAGAGTTTGAATCTGACAGTGACACTACTAAATCAGGT 
ATAGGTATTCGTGAATATGACAAACCTAAAAATGAATTCGAATATGAGGAGCGAGAGATCTTTACATTTTTTATTCCAATACAACCCGCAGGGACGAAAT 
TATTGTTATACTTTTTGGTGGACGTCAGGTCCCGTATTATATAGGACTATAAGGCACAACACTCTGTTTGAAGAGAAAAAATGTTCGTCATAAGCATAGC 
TACGTCATTGGTGTTATTCTTCTTCCTTCTTTTCGTCTCAATAACAATTTTAGATGGTGCAAAAACAATCGACTCTCAACCTTTTAGGAAAAGGAGGAAG 
AGGAAGAGATATCGTACCAGTGAGAGTGGTGACGGCATAGACGGAGGAACTGGAACAACAAACGGAGGAGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTG 
GAACAAATGGAAATGGAACTGGAACAACAAACGGAGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTGGAACAAATGGAAATGGTTCTGGAACAACAAACGG 
AGGAGGAGGTGGAGGAGAAGGAGGTGGTGGTGGAACAAATGGAGGAGGAAATGGAAATGGAGGAGGAAATGGAAATGGAAATGGAAATGGAGGGGATACA 
GACACAGACGATTTTGAGCCTACGCCAGCCCTTCTAAAAGAACGTCTACTTAATTCCATATCTTCAAAACCTAAAGAATACTATGAAGCATTCGTATCTG 
CAGAGGTAGAAACTGCCTTACAATTATCTCGAGATGATTCTACCCAAACTATAATAATTGATGACGACCAATTAGAATTGGACGCCTCAGACACTCTACA 
AGGAAAACCAAGGGATTATTTGTTCAAGCTCGCAGGTGTTTCATCGGCCTTTTTAGAAGGTACCACTATCCGCAAAGCGGAAGACCGTGCCCGTAATATA 
AATGAGG AAG AAATTGC AC AAA C AAT ATT AAGTC AACT AAGAGAAAAAC AC ATC AACG AT GAAT ACG AT G G AAAAT ATGC CAC AC CC GAGG AAAG AGCT G 
ATTTTTCCAATAGTCTTAATCTTGTTACTAAATACACAAACCATGAAGTGGGTCTTTTAGTAGGAGAAACTATTGAAAAGGCTTTCCCTCATGAGATTGA 
GTTTGAGAGATGCATAATTCTAGTAGAGGATTTTAATAGTGGAACTATTACTTCAAACACTATGCAGTACAGGTCCAACGCTTACAAAATCAGAGTAGTA 
GAAGGATCAACAACAGATCCAGGGGAAGTTGTCCCTGATGATTGTTTGGTTTTTGCCGTAGTAGTAAATAAGGAACAACATTCTCTAGAAATATCTGCAA 
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CTAACAGATGCCAAGACATATGCTTTGTAATTATTCCTCGTTTATCCGCAATAGGAAAAAATGCTACCATGGTAATAAGGAAAGGTGATGAAATTAAACA 
AGAAACCTATCTGTTTGTGGCCAATAAGAATGACACTACTCATTTTTCAATCATCACAGACAAGGATGAATCTGTTGGAATAGAATTAAACATGCTCATT 
TTCTCAGAAAGAATTCTCCCCACTTTGAGTGACCCTGCAACCGTTCCAAGACCTTTGACTGACGCCAACGTACTGTCAGCCTACGGAAAGCGCCTAGGTG 
TTGGTGCCTTTACAGACAAAAATTTATTGTCCAGCCAATAAAAAAAATATAAAAAAAGATTGATTGTTTTATTTTCAAAAAAAATAAAACACCATGGGAG 
ATAAGCAAAAGGTGGAACAGCTTTTAAGGGAATTAAAGGCAGAGGCTAATGATGATTGGCTATCGGCTAATGTTGATCCAATTGTTGAGAGATTTGTCAC 
AACTAAATCTGATGAGACAGCCCAAGTTGTTAAACAGGCCGTTGATGAAAAATACGATGAATTATTAGAAGATAAGGTTGAAGAAATGAGACCGGATATA 
ATCAATGAAGCATCCGAAACATATGATAAACTTGCTGCTGATATGATAAGAGAGGTAGACACTAGTAGTGTTATTGCTCCTGCAATAGCTGGCACAGTGG 
CAAGAACTATCAATAATTTAAGAGATAAAAGGAAAGAATATGAAAAGAGGCTATGGACATTAGCCTACAAACCATGGAGAAGATATGTACAAGCAATTAC 
AGTGATGGAATTTCGTTTATCATATAAAGACCTGACTGTCCATGCCAATTCCGATACATATTTAACATTCCCTTTTTTAAGAATAAAAAAGATCGCCTAT 
ATTAATAACGACCGCGCCAGCCCCGTGAACTGCTCCTTGTCAGTGTCTTATCCCAACAAGTCCGAGTGGGGAAATGATAACGGAGTTGGACGTAAGGTTG 
ACATACACATCAGAAGAAATGATTTACAGGAAAAGGATCTTTACCTGTCTGTTATTTGTATGTTAGATACTGATTTTTCAGGATATGATAAAGCTGTGGA 
AGTTGATGCCCATAAATTTCATTTTGAAGCAGGGAACAGGACTATGTTTTTACCCAAAACCTCTAATCTATTTAATCGATCGCATATAGTAAACTCTAAA 
ATATGCACTATAGTGTTCCCTCCCGCTTCTGCCAGTTCTGCTTCTACAACAGAACTGGATAATGTCTATTACAGGATCACATGCACATGCTCCTAGAGGG 
GGGGGGTGGCTGTGTCCTTGTATAAAAAGCGAGGTTGGCACAGACTATCAAACATTCAGCCGTAAACCATGGAGGAAGAAAGTCAACGAGTACAGAGGAG 
GATCGGAGTCCTCCCAGAAGAGGCTGCCTCCCAGATCCTGAAAGACACCAAGTTAAGAGTATCATATCTGGGCGTGGGACATTGGGGATACTCTGTTTCA 
GTTATAAAAAGTGCTCrCCAGAAAGGATGCAGGAGGAACGATGAGGATATAACTGCGTGGTCAATCAGGGAGGCTTATCTCTATTACCATCTTGGATTGA 
ATTACATTGAAAACGTAAAACCTGCTGCAAAATCATTAAATACAAATATGGTAAACAGAATTAAAATCATAGCTGTGGAGGATACCAGTCCTAGAAGTAT 
GGTGGCTTCTAATGAGTGTGTGAGAACTCTAGAAAAATATGAAAAGGGGAATTTTAGGCAACCCAGTTACTTGATGGATGCTGCCATGAGGCTAGTTCAC 
GCCTCCAGTTCTAGAGTGTGCTCCCACATGAGAGCTTTGTGTTGCAAGGAAGAGGACAGTGATAAATTAGGGGGTATTTATTATGCTAATTTTAATGAAC 
TTGAAACTCAGTGTGTTAGTGCAGTTAATTTTTCTCCCATAGAAAGAATTAAACACGTCTTCAGGGAGATTGAAAGTGTAAAATTGGGGAAGAAAAGTGT 
ACAGTTATTAAATTTAAGAAGTGTAGCAGCTTACCATGTGTTAAGATATTATGGGGATAAAGTGAAAGATACAAATAAGAAACATAGTGGACCATTCAAG 
CGAAAAGAGTTTGAGCAGTTTTGGGGGTTATGCTTTAAATTTGTTACTCAGCACGTAAAAACAGACCCTGAATTACGTTGTTATTTTAATGAGTTGACAT 
ATGCCATAAATTGGAGAAGGGATTTTTTCTGCTCAAAAGGGTTCTTTAGAGAAGAAAGTCTATTCCTTACATCTATTGTGGAATTAATTATAGCCATGTG 
CATAGGTGACCGTAAACAGTTTGCCAAGATTCAGAAAAGGGATTTAAAACGTTTCAATAAAGGAGAAGAAGGGAGAAAAGAAGAGGCTGCTACATTTGAT 
TGGATAGAAGGACATGTTAAACGGATGCCTCAAATGCCTGTATGGGTTCTGGACAAACATACGAACAAAAACACACATGGAGTATCTTTTGCCCTAGAGA 
GTAGTATGGTTTCTGGAGGAGAC/^GCGCTGGTCCCCAGGGGTGTGGCTTCATTCTTATACAAAGATGCGTCTAGATTCTCCCCCTCCCCCAGAAGTGGG 
CCAATTTCTGGATCAGGCTTTTAATACATTGAAGCGAGAAGCTGCTAGTCATTGCGTAACGAGGAATATTTGTACTACTACAGGATTTATAAAGGCTTCT 
AGTTTTACTGCTAATATTAATTCTGAACCGATGGAAATTAAGGAAGAAATAAAGAAAAGAAAAATTGAGATTAAGGATGATAATACTACTGCTACTGTTA 
CTGTTAGTGCTACTACTAGTAGTTCTATTACTTCTACTCCACCTCCTACAAAGAAACAAAAAACAACTCCAAGTGGAAGCAAT7VAAGTAGACTCTATACA 
ATTGAATAATCTACCAACTTTAAATATGGAGGATTTAGATAGAGTACTAGAAGTACACAACCAAAACTCTAAAAAGGGTGTAGCTGCTACAGTTTTAATG 
AAGGATGGAAATAAAGTTGTGTTTAAGGAGATGAGAAAAAGCTTTGGTTGGGGTTCTCATCAGAATTTTGTTCAAGTACTAAAGGATGAAGATGTGTGTA 
AATTGGACTATTTGTTGCCATGCCCTGATAGCGGACCATACAGAGGATTATACAGATGCTATTTCAAAATAGTAAAGGATGAAATTTCGAGTACAGCTGC 
TAGGATAGAAAAAGTGAAATGGGGAGAAAATGCCATGTGTTATTTCATCTCTGGGTGTGTGACTAGACAAGAAGGAATTGGGAAAATTATAACAGATGTG 
CGCCTTTCACATATGGGACCAAATAAACAATATGTGTATGATAACTATAGGCAATTAATACACATTTTAATTTTTAGATTGTTGACTGGTGTGAGTGATA 
CCAACACATCAAACATTCTCGTAGGTGATGGAGGAAACTTGTTCTCTGTGGACGAGAATTATGTGGGCGCAAAGGATCCGAGGACAGCTCTCGAAAATAG 
AAAGATAAAGGAACTACAATTGCTACTGAAGACTTCCTTTAAGGTAAACAAGGTAACAAAGGAAGATATAGATAGCTGTCTTCCATCTTGGCTATTTGAT 
ACATCAAAAAGTGATAAAATAATGAATGGTGTATGTAATATTGGTAAAAATATGGGAATTGGTCCCACTACTTTGGATATTGTAAAGAATAATTGTACTT 
GTATTTTAGGGGTCGTGAATGACCTGCTTTATGACAACAAATAAAGGAATAATACTTTATACAATTTTTCATTTCCTTCCTTTTTAAAGTGTATAAAATT 
TTTCTAAGACAAAAGTGAGTTTGAGGGTTGGTCATAATCTAGCACATTTCATCCCTTTTTCCCCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTATTTGAA 

agaccctggttaggtccctgctaagggtaggctgtaacctaccatatacactagagaattttactcttcctccatctcaaaaacttttaaaaaatttttc 
tgggtcactcgagtptagagggtggaccgctgggtcggcctgatgtcaagttcggagggtggaccgctgggtcgggccaatgtcagattgcaccagaaac 
gtctgttgctccagaaacgtacattaaatttctactgtttctggaacaggcgtrtctggacacaccagcacccagaaagggtaggctgtaacctaccata 
tacactagagaatxrttactcttcctccatctcaaaaacttttaaaaaatttttctgggtcactcgagtttagagggtggaccgctggctaggcctgatg 
tcaagttcgaagggtggaccgctggctcgggccaatgtcagattgcaccagaaacgtcrgttgctccagaaacgtacattaaatttctactgtttctgga 
acaggcgtttctggacacaccagcacccagaaagggtaggctgtaacctaccatatacactagagaatttttactcttcctccatctcaaaaacttttaa 
aaaatttttctgggtcactcgagtttagagggtggaccgctggctaggcctgatgtcaagttcggagggtggacctctgggtcgggccaatgtcagattg 
caccagaaacattgcgtactactcccatgtccagaaatatctagatcctttctggttctcgcccaccaccagatatctgggaagccttaggcgagtcatg 
tttcttgcaccatgtatttctaggaatttcgctgacgtcaataaactggtttgtcatcctgttttatccagtttgctgacgtcaatagaccatatttggg 
catctcgcccacacagaaacaggatatgatatcatagatttctgggaagagggtgttttttgtgccgatctgggrgtataaaagagcgctgcggagaggc 
agaaacatcagacagacttgatctgtacagctagcagcagcagtagcagcagccaagagaagatcggacgcaaaccattctcgcagccatggcagcagca 
gcagtctcaggagaggggagaatctctgcagatctactcctgttggaacaactcaccccggacggagacgtgataagatacgactctgagcagtacacca 
aacctaggaagatctttggtgacaaaagtgtgatagagactattggacattttctcatccacaaccacaaccaaggtgagagttaccaaatcgcatcttc 
tgtgttggaaaaattccccgctctactcaattgcatatggaatggagagtcgggaggaatggctctatggaaggcattgtacagggctaaaaagtataga 
cttctcaactcacttttggtccacaagataaaaaattggccttcggttgccgtgatccctatctacgggtcagtgtgtgacagagaagaaaggcccatca 
tcatgagtgagattattgacaaggaaactcttcagaccatatgtaagagtgatatacgttctcttcttggaatgatgaacgccaagcatggcacattggg 

AGGTAATTTTTTACACTTCTATGCCCGCTCAACTAAACCGTTTGAAAATTTCCAATATGAAGCAATGGGAGCTAATGCAGTGCTAATGGCAGCTGAAGCT 
ATTTATGATGGATTCAGAGACCATGGCTTAAACCCATCAGAATATACTTTTCCTGGTTTGGAATCTGCTGATGTGTACGGAAACAATCCAGTGGAAATTG 

caatatcaggagatgatgacaatatgttgttgaacctcatctgcaactatggtgtatcttatgaaaaaactcgcggtcgggtaaatagatccttgttaga 
ttttttaaaaatgaacacagcttcaaagtgtctcagtgttttaaagtttgttgaaaaacactttaaaattgagtcaaatacaccaaaaggagaatttgaa 
gaaaaggccgaaacatgtgttaattgtcttgatagaaataatgtgttgactaaagggtctgaacaagaatcttacaagctttcttgtggacactttcttc 
atgtaaaatgtttgaggaatatttgtatagtatcacaacacctaagatgtgaaaaatgtctaaaaagatttgatgagagtattttgagaaagtgtacacc 
taacttgaattggtggttgactatgccggcaggtgctggaaatgaagaagaaatatgtttcatgagaaataagaaactggttgatgatttcagaaaattg 
ttgtcccctgtctcaattcctcatttcttcaaaaatagtagacagcgaaatcttgatatgttgtgcccctacagtgaccacacgataataccaaataaag 
aagatccaaagaaaaacgaagatggaaacagagtgagggtcaaccacacagccatcagtgaaaagcagaacaaggaggaagaagacgcgaggataaagcg 
tgtagccgtcaggacatttacagccatcagagaaaagcagaacaaggaggaagaagacgcgaggatcaagcgtgcagtcgacatggctgtcgcagccatc 
aacgaaaagaacaaggaggaagaagacgcgaggatcaagcgtgcagtcgacatggctgtcgcagccatcaacgaaaataacaaggaggaagaagacgcga 
ggatcaagcgtgcagtcgacatggcggttgcagccatcaacgaaaagaacaaggaggaagaagacgcgaggatcaagcgtgcagtcgacatggctgtcgc 
agccatcaacgaaaataacaaggaggaagaagacgcgaggatcaagcgtgcagtcgacatggcggttgcagccatcaacgaaaataacaaggaggaagaa 
gacgcgaggatcaagcgtgcagtcgacatggcggttgcagccaccaacgaaaagaacaagaaggaagaagacgcgaggatcaagcgtataattgacttga 
ctgttgatatgaggattcaacgtatagtcgacatggcaattgcagctgccactaaaaaggacaagaaagaagaagagaaaaggacaaaaagggaacaaga 
gttaagggctgatctgagaagggcaatggatatggtgaacgaagtacagaagaaacttgaagacatggaactagaaaaggggtgtaataaggatgaagcc 
aagaatactagtaatgttgttagtagcagtagtgttgttgcctattctaaagaaattgtaccttgtttaggaaataataataatgctgtcattggtatga 
ctagcaccaactattctgccaacaatactaagaataatgtatttggttcacctcataaattttccttcaacgatgcatctagattctccaatattgtaga 
aactcccaaaatgtctttcaatttctcgttcaagacataaaataaaatgtgtaacatgcaatactgttttttgtaaaacccttgtgtctggataaaatac 
cacacatctgtgaacatttcttgcacacgtaacactacgacttttcttcaactcacacatctattttttgcacatgcaagtaaaacatttcttcaaacct 
ctacagagagatggtatataaaggctttgtatctccatcatttatcactattcgtacgatgacgtccaacagaccaacaacatcccctctttcgttctct 
gaaggcttctcgctttcaggggataagtatgatacgtatgaggatatcttacttgaacaattcaactgtttcaagacatcttctccttcttctgctcgta 

AAAGTGAAATAGAGGATAAAACTTTAATCTTTCAACTTAAAGAAGGAGAAAAATTCCATCTTGCAAAGGGCATAGAAGAGCTCCGTGAGATTCTAGACGA 
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TAATTCTGCAACAATTGAACCTATTATTTCTCCAACGACATTCAATGACAGAAACGAATTACTAAACCACGAGGGAGATATATCCTCAAGTCCCCTATAT 
ACTCAGATAATGAAGCATATTTCACCAGAGCATGATATTTATGAATTGGACCTTATTGTTGGCACTGATTTGCTTTTTGGTCTAGGTGTGAATCTACGCA 
ACGTTTCTAAACTGATGAAGAAAATATCGTATGGTACTTTAAATGTAGTTGATGTGTGCCACAGAAAATTTTTCAACAATAGGATTATAGTTAATCCCAT 
TTCTTCGTCATTCTCAAAAAATGTGTGTATTATTCCTCTATTTTCTGCAGCTGAAGAATTTTCGTCTCTGGGGGAATGCAGGGATTTATTCAACGGTATT 
TGTGATGACGTAGAGAGATATATCAACTCTTATTTTTTCTACCCTGAAAATACTACTACTACTACTACTACTGCTCCTTCGTCGCCCGAAATGGAAATTG 
CAGATGAGGAAGAACAATCCCCAAAAACTATAAAGAGAAATGACAACGCAAGTAGAAACTGGTCTGGTGTCTGTTTGATTTTTGAAGTATTTAAAAACAC 
GTACTACATTATTAATAGAGGAGATAGAGGAGGTTCTTTTGAAAAGGCTGTGAAGAGTGCAATTTCTTCTATCAAGGAAAAGAGATGTAAAATCACAGAT 
ATTAATGGTAATAAACCTCGATTGGTTATGGTGATAACTGGGTGTTATACAGAATTGTACTTCAAAGATGCACTTAAACAGATTGGAGAAAACAGGCGCA 
AATTTTTGAAAATGAATGGGAATTACTTTTCTCTGATTGATGAACAAGCAGATCTAATCGAATTCGCGATGAGTGTTTCTGGTGCCGGGGAGAGGATTTT 
TGTTAACGGTTTGGGGATGTTCCAGAACCGTAAAATGATACCTGTAATTGATCCTCTCACATATGAAAATGTTGTATGTGGTGAGCATGATATACAAAAA 
GAAGATGCTATTCTTTCTGTAAGGAGAGCTATTGCAGACTATAATGACTTTGTAAGTAAGAACAAGAGAGGGAAGAAACGCAGCGCAGAAGAAGAAAATG 
AAGATGAAGATGCAGACGCTAGCAGCAGCAGCAGCAGCAGTCCTCCTCCTTCTTCTCCTCCTGCACATAAAAAATCACGTCTTCCGGATGAAGGCGAAAA 
ATGTACACTCTGTTAATTTTTTCAAACAATAAACTAACCACCTTGTATATTTTTGTTTGTTTGCAACACCCTTTTTATGATACCATTAAGACACGATGGC 
GGTAAACTTGGATAATGTTCTTGTGAATATCAACAACAAGGATGAAGATCTTACAAAACTCGTATCCGAGGCAATAAAGCGGCGAGCTAAAACTGTATTT 
GACACTAAAAATCAAGCAGGGTTTGACATGAGACGTCAAGTTGAAGCTGCATTATATGAAGCAATATCCAAAAAGAAAGAAAAGGCCATAAAGGCATTCG 
ATGAGCTCATACAAGAAAGAGGTGATGAAATTACACCTTTGACTACAATGCAGTATGAAGAGTGGGTAAACCGTACAATAACTCCCTCATTGACGACTGA 
AAATTTATTAGGTGATGTTGAGCACGCCGATTTTTTACTGGACCGAATGACACCCGTAAGCGAGGAAGATATTGAAGGTTTCGCTGCTT^CTACTTTTAAG 
GAGGTATCAGATTCAAAAACTGCAACAGTCATAGTTAAGGCAGATTGTGAAACGGGGGATATCGATGAAGTGTATAATCTTGCACCATCATTCGGCGTCA 
CTCAAGAAATTAAAATATATAGGTCAAACAATTCCTCGGAATTGGATAATGTCGCAGATTCTTTCCATATTTATAAAATTTCTGCAACAGATAGCGACAG 
TGGCAATACTAAAAAATTGTTGTATGGGTTAAGGAATAAAAAAGCAGGTTATACGTGTTTGTGTAGAATTTTTGCAGAAATTGAATCAGATGGGATTATG 
GCCAATACAAATATCGGTGTCGCTGAAAACAACAGAGATGAAATTGATGAAAACGAAGAAGGTAAATATGGTTTTTTAATACCCAAACAACCAGCTGGTG 
CAAAATTGATCATCTACTTCTTTTTAAATTGTTGGACATAAAAAATGATATTTTATACAATGCAACCCTTCCTTGGGTTTCTGGTCTTTTCTGTGCTTAT 
TGTAATTGTAATGACTGTATTAGCTGTATACACTGCACCTCAAATAAAGAAATCGAAGAAGAGAAA/^TTGAAGATGAAAATGAGGAAGAACCCGTAAAG 
ACTTTGGAAGATTTTGTAAAGGGTCGGCTCCTTAACGCTGTCAAGGAAAAACCTGCAGAGTACTTTGAGTTGCTAATATCTGCAGACACTGAAGCAGCAT 
TAAAAACTGCCGAAGAAACAGCCCTTCGAGATTTTGTTATTGAGAACGACTCTGTCGAAATAGATGTGGAGGAAGTACTTGAAGAGAAACCAAGAGAATA 
TGTCTTCAAATTGGCAGGCGCAACAAGCGAAACGCTAACAAACACAATCATCGCAGAGGTACAAAAAAAGGCAGCATTAATAACAGAAGAAGATATCACT 
ATTAAAATGTTAAAACAATTCAGGGCTGCGAACAAAGATAATAAAGACGGGGAAGCAACTCCTGAAGAAAAGGAAGATTTTACCAATAATTCAGATCTTG 
TGGGGTTGTACTTGAACGAAGTAGTAGAAAAAACAACAAATATTGTCATTAACAAAATATTCCCTCATGAGATGGTTTTTGAAAGATGTGCTATTTTAAT 
TGAAGATTTTGATACTGGTGTTGTGACTGATCAAGCCATTCAGATACCCTCCAACAAATACAAAATCAGATTAGTCGAAGGGGATGAACCTGAAGTATTC 
CCTGGTGACTGCTTGGATCTTGCAGTTTCAGTTGATAAAATAAACCACGTCTTGAAAATTTCTGCAAAGAACGGATGTGAAAACAACTGCTTCGTTATTA 
TTCCACGGTTCTCTCCTGTAGGAAGTGTTTCTTCCATGATATTGGGCAGCACTGACCAAGTCAAACCTAAAACATTCTTATTTTTAGCCAACAAAAATGA 
CAGTACACATTTTCAA'TTCACAATGGATAAGCAACATTCTGTAGGGTGTGAGTTGGACATGTTAATTTTTTCAGAAAGGAACTTGAGGAATTTACCCGAT 
TCAAAACCTAGACCTCTAAGTGATGCAGACATATTGGCCTCATATGGGAAGCGTCTAGGAACTGGTGTTTTCACAACAGAAAATTTGGTAGACGATTAAA 
TAAAATTTATAACCACTTTATCTGTAGACTTTTATGTCCTTTGTGCAACAAAAATAAACATGTCTTCTTCGTCTTCTGAAACTCCTAAGACTTCCACCGA 
TACTGGGGAAGAAAGGATTAAAGACATTGTAAATGCTCTAGATAATAATGGCGAGTGGTTGTCTTCCTATATTGATCCGATTATCAATAATCACATCTCA 
CGAAAAACGGCAGAAACTGTCCAAAAAATCAACCAAGAAGTTGATG/IACGGTACGATAGAAAAATAGCCGACAAAATCAACGAAATAAAATCATCCATCT 
TTACAAGTGCTCAGACTATGTATGACCAATATGCAATAGACACATTTCAAGAAGGAAAAGGAGCCAACGGGACTGGACCAGTCATGGGGCCAGTGAACAC 
GGTTATCGATACAACTTTAAATAAAATGAGGGGAAATATGCTCGAATACGCTGAAGATATGTGGGACGGAGATGACTGGAAACGATTTTCCAGTTCTATG 
ACAACGCTTGAATTTGATCTAAGTTACTCTGATTTAACTATGATGCGTGGTTCTGACGGGTATTTTGCATTCCCTTTCCGTGGAACAAAAAAGATAAAGA 
TGGACGGTTCAAGAAAGAAAGAAGACCCAATTAATTGTATCATTTCAGTAACATATCCAAACAAAGTAGGGGATGAGTGGGAAGAGGGTAAAGAACGTGA 
AGTGAATTTTAACCTAGAAAGAGTAGACGACTATGAAAGAGATATCCATGTTTCAATTTTGTGCATGTTACATGCACAACTTGATAATTTCGAACAAGCA 
TTAGGAGAAAATGCAAACTCTTTTTATTTTAAAAAGGGGCAAAGAGTCATGTTCTTACCCAAGAAATCTAAACTGTTCAATAGACCTACTGTAGAAGATT 
CTGATATGTTTTCTATAATATTCCCACCTGCATCTGACCAAGATTTTGCAGATGATATTTATTATCGAATAATTGTAACATGTTCATAATAAAATTTGTT 
GAAATAAAATACAGTAATACATTCAAAATCTTTTATTATTTTTCCTTTCTAATCCAACATATATAATGGTGATACATTTACAGGTGATACCTGAACATAT 
TTATAAAAATCAGGAATATACATAAACATGCTATTGTAAGTTGTATTCATATCCCTGGTGATCATCTCGTGCCAGATGCCCCTCAGCGAGATGGAGCTTG 
CTGTAATCTTTGCCCTGTGGAGTGCTTGTTTGTATGCGTCTTGAGCCCTCTTCATCTCATGTGCTGTAGTTTCATCCCCCTCACAAGGCTCCATGATAAT 
ATTCTTTCTGGCCGAATTATACTTGTAATCAATGAGGTATTGCTTGAAAGGCCCAGTCAAGATGTCGTTGGTTGTAGTCTGGATGAGGTCACATTTCTTG 
GCAACATTCTTAAAGGCGGCAGCAGCAGTCTTGGCAGACATTGAATTCGCTTCAATGCTTCCAAAAATATCCTCATTATTCTCGTGCACCTTCAAGTAGT 
TGCCACTATTTTCTCCCACGCCTCCGTCAACAATAAGGTTGCGAACGTATCTGCGGCTGTGTGTCTTGTTCTCAACAACTTCCCTCTTTAGAAGATTTTC 
AACCATCTTAGTGTACATGTCAGATCTCCAATTGTTGTTGATGTCGTTATGTTGCCCAGAAAAGTTCCCTGGCAAACTAATTCTGAAGGTCTTGTAGCTT 
CCATAAGGAGTAGTGTTATTAATGTGGTTAATGGGCATAGATTCACAGAATGGAATAGCCAGTGTATGGACAGTAGCGTATACCTTGGCAGTCACATTAT 
CTTCACATAGAGCATTAATAAGAGCACTCTTCTTGCTCTCACAATAAGGCTTCACAGACTTGTCGAAAGAGGACCAGAAACTCTTATCATAATCTACCTC 
CACCATTGTVAGATTCGTATGTAGTGGAACCCATAGAAAAGATAATAAGATTGTCCACATCGACAAGTACTGAGTAGTACTGGCTGTTTTGCAGATCAAGT 
CCCTGATCCTCCTCATCTTCATCCTCATCCTTCTTCTTTGTAGACTCGATGTGCACACAATCCTCAGCAGCCTCAGTTCCGGCAGGGAACAACTTAAGAG 
ACTTGAGGATAGTGGACATGTTGAATGAGTTGGTGATGCATCCAGAGGAGTGTCTGCATAGAGATAGACAGACAGGAGTTGATCTGTTCTTTGATAGGAG 
AGAAACATCCATCTCTTCCTCAGATTGAGGATATTTGGTCACTGAAGCAGATGGTCTGAAGCTGTTGTCTGAAGCCAGGTAGAGAGGGTTCCCATCGCTC 
TCGTCGAATTCGCACTCTGTTACATCCCCAAACACAACTTCTGAACACTGGGAAGTTGCACCATCATCATCCTCATCTTCTTCTGAATCTTCACTGCTAC 
TGCTGATAACACTTTCCTTGTCATCTTCATCCTCTACTACTGTTGCTGTATCCTTATCCTTATCCTCCTCATCCTCTTCCTCTTCTTCTTCAGCAGCAGC 
TACAACAGTGACAGATGAAGACGAAGAGGAAGAAGAACACACTGATGAGGCGTCGTCATCATCATCATCGGAGTCTTCTTCTTCTTCCTCTGACTGCTCA 
CCCTCTTCCTTTTCCTCCTCTACATCACTGAGACCGATAGTTTTTCTAATAAGATTCTTGATGACATTTCCAACTGAACTTTCAATAGTTTCATTGTTGT 
AATCGTAATCAACTTCCTCCTCGTTTTCATCTTCGTCTTCATCCTCACTGTCGCTGTATGCATATGCTGACTTTTCGTCATCCTCTTCTCCAAATACAGT 
ATTGTTCAGTGACTTGAGGTTACTCTCCCTCTTGGCCT^CCACTTGTCGTTCTTGAAGCAACAACATCCATTTCTCATAGATTCAGTGACACTTTCTTCC 
ACAATTTGGTTAGTGGTCTTGAAATGTGTTCCTGCCTTCTTGGTAGACGAAGAACTGGGAGTTGACATGACCAATCCAGTTGACTTGATTGCACTATCAA 
TATAGTTTCCTCCAAGTTCCTTAAACAGTGACTTATCTCCATTACTTCCAAATATCTGCCTCTTCTTCTCCAAATCCATCTCGTCCACATATACAGCATC 
AGAGAGCTTCAATGTGAAGAGAGGGCCTCCCTTATCAAGGGCCTTCTTGTTGAGTGTTTCAATAGTCACGTGCTCATGGAACTGGAGATTCATACAACCA 
AGGAATAGATTGGCCAGTTTTGTAGTATCAGTGGCGATTGCAGTTTTCTTCACAGCTrCTCCCTCAACCTTGACAATGTTGCTAACCAGATCCTTTCCTT 
CAAATGCCGACTTGACCTTTGAGACACACCCATCACAAAAGGTGGAAGGGGACAGTTCTGTGCGTCCAGAGTTCATCACACTATTGAACAGTACCGATTT 
CTGCGCCTTCTTATTCTTCTCCAATTCTTCGTCCTCATCTTCGTCATTCTCCAGAAATTCAGCAACAGATTTGGCCAGAACACTCACGTCAAGATGAGCT 
GGCACCTTGCCCATAAGAGCGTCCAGTTCTTGAAGAGATTGCTGCTTCTTCTCGGCGGGTGTGATTTCTGGAGGAGTCTGCTGCTGCTGTTCTTCTTCCT 
CTCCTACAACTTCTTCCACAGTTTCCGTGTGTGGACTCCTGATAGTGTTCCTCCTCTGCTTAGAGACCATTCTTCTTCCAGCAGCAAACTTGGCCTTGGA 
AGGAGTTGGTTTCTTGGCAGAAGTAGTAGTAGTATCCTCAAATTCATAAACTTGATCTGGTTCCTCAGAGAAAGTGACAGTTGGCTGCTGCTGCTGCTGT 
TCCTTGATTGACTCCATTTTTAGAATATTGGGTTATTAGGATTCAGTAGTTCTGTTATTCCTAGACAACACGTCTTGTCTGTACGATTAGTCGAGATGAT 
GTGATGTAGAGCCCTTGCCCGTGCCTCAATATATACCCCTCAGTAGACCTTGATAATCACGGTTGCCAAACAACAACGGTTGGAAATCTTAGTAGAAAGA 
ACCAATGGAACAAGTCAGGATAATAATATACAGTATAAGCTTTTGGGGAACAACATTCATTTATTGTCATAGCGCTAACATTAGAAGTACAAGAACAGGT 
GATTCTGCATCCAGCTCATTTTGTTCAACCCACTTCATTTGCAGGGGACAACACTCTTCTTTCTTGAAAAATATGGATGAAGACACTTGGAATTCTTCAA 
TAGGCCCTGGGGCGTTGCTGGAGGAGGACGCAGAAGAAGAAGAAGAACAGCTGCCGTTGAACGAGTTTATAGTTTCCAGAAGACAACATAATTTAGCTGT 
AATTGGACGAGGAGACTTCTTGTAATGAGAGAACCACAAAATTCTTCTTCCCGTCTTTTCTGGTATAAAAGTAGTCCCATAATATGTTCCTTTCTGGACT 
ACCACATTATCTTCGTTTAAGGGAACATATCTGACCGACTTTCTCATCAAGGGCTGATCTCCATTTCTTGTTGTGCTTAGAGCCGATTTGTATTCCCCGT 
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CGTTCACGTTATAGTTGCATATAGAACAGTGGGATGAAAGGAACTTGTCGGTTACGTGTTCAGTTTGTACAAGATAACTGGACGCATTTGCTAGTTCTGG 
AGGGACGTCCTGGTTTTTGTTTATTGTTGAAGTAGGTTTTATCTTTTTCCTATTTAAGCACACACCCTTTGAATTAATGTGTATGCGATTCGCATTGATG 
TATATGCAGATTGTGAGTGCTGGAGAAAATGGTATCTCAAAATCGGTAGCTGGTTCAATAATTTCTGGTTCAACTTCCTGTTGTTCTTCTTCCTCATCTT 
TCTCTTCCTTATTTTTGACACCTAGAACTCTAAGGTTGGCAGTACTAATTATTCTGGCTGCAGCCGTGCTGGTAGCAGAAGGGGAAGATTGTTGGGAGAT 
AACAGGAGACATGTTGACCGTGCCGTGTCGTGAGGATAAATGGCTAAAGGTATATACGGCATGCCCTTTTTTCCGTTACTGTGTGCGAGGTCGTTTTCTA 
GGACCGTAGATGACCATCTGCCTTCCTCTCTTTTTCTGGACAACCTTTAATTGCCATGGTGTTATGAGAGGTAAACTCGCCTCATAGAATCCGCAAACTT 
CCTCTAAAGGGAACAAGTGTTTCCCCATAGGGGACAATTCAAAAATATAATTCCAACATATTAAGGGTGTTGTTTTATTTTCCTTTTCATCGTACTCCGT 
TATTGTTTTACTAAAATTTGGAAGGTGATTTTTTGATAGACTATCGGACATTATATCCGCTGCAAGTTGACGTTTATGTTCAATGACATAATCATTCGAA 
TCAAAGTTGTCGTGTGTTGACACCATACCATTCCATTTCCATTTGGCTAGTTTGGAGAGGGATTTATTGTTGCGCAATTTTAGACGTGCAGAAACGTTTT 
GCCATATTATTTTAGAAAGTACAGGCATlAACAACATCCCTCACGTCAATACrGTCCATTAATATATCAGTAGAATCATTGAACCTTATAACATCTCTCTG 
TATATCTTCGTTTGATGGGGCCAATCGAACTAGAGGAACTAAATCCCCAGTrATGAGATCGGTGAAAATAAAGCTCTCGTCATCAACAATTTTTTTGGAG 
CCTATGTACATCTCACACTTACTATAAAAGTCATTTTTATACCCTGGTTTCTTTTCCCACCTAGAACATAAGAGTATTAAAACTCTCGACAAATTATACA 
TGGCACCTACTCTTCCTTCCATATTACTGATAGCTCTATACATATTAACCACGTATTGTATAGGTGCTTCTACGTACAAATCTGAAAATCCACTAGCGGC 
AGTTCCTTCACTAAAGGCTAGAACTTGTTCCATAGTCTTCTCTTCTGAAAACATTTGCTGAATTATTTTAGTGGATAGTTTGTGCCGCTTATTGAATTTT 
GTGCCAGGGTACTGGGACCCGAATTCGTTGAAATGTTGTTGTTGTTTTTTCCCTACTTCTTTAGTTTCATGCCAGGCGGGGTGTTCTTGAAGAGCAGCCA 
CCTCCAGGAACAAATCACAGATGGCTCGTACTCCATCACATGTGCATTTTACAGTTTCAGGGAAATTTAATCCTGTTATTTTATAAAACAACTCTCTGTC 
TATTATTTTTCCCATTCGTAGGCGATTATTGACATTAAACCCACAACACTCGCCTCTGATTCCCATTATAAAGGCTAAAGAATATACTATAATATTTCCC 
GCACTTGAACCCACGAGTCTAAGAGTTGTATATTCTCCATCGCAGTTATAGAGACATCTTGCCAATTCGTCCATTGATTTCCTGCCCGTGAAGCGCACAT 
CCATGAGTCTTGCTCCTCCATATCCAATTTCAGCGTTTATTGAAGTCCTCCAACCTTTATCCTTGTTTAGGCGCTGTATCTTTTTCGCCTTCTCTGCCAA 
TACATTCATAAAAGTGTAAAATTTAACGTCTCTTACAGATTCCATAATTTTTTCCTCAAGTGTTAGCGGCGGCGGTGGCGGCGATCCTTCTTTGTAGCCT 
TCAAAGATAAAAGAGTGTTATTCAACTTACAGTGTTTATTGGCCCAGCAATTCTTTTCTTTCATCAACTGCATAGTAGAAGGTCGATCGGAAAGAATGAC 
CAATTTGGAATCCAAATCCCTAGGTTTATCTAGTCCAGCTTCTATGTCCATACTCTTTCCTCCTATTATCATATAGTCTTCAAACAGTGTTTTATTAGCA 
GACATGTTAATCATTTCAATTACAGTCTGGAGGCATGTTTCATGTCTATTGACCAGATCTTCATCATCTTCATCCCATGTCACACCCACCAGAATATTTT 
CTCCTTTCTTGAGTTTGGATACATTTTCTGTAATCTTTTTCATTATCCGTTGTTTCTTGGACGCCACACTTTCGTAGCATAGCGTCGGTAGGGGAGAGTC 
GCAATCCACAACCAAGTATTTTTTCATGATTAATTTGTTGAGGATTGTGTTCTTTGAATTGAAGAAAAAAATCATTCGTTCTGTACCATGGAGACAACTA 
TGGATAATGTCGTTCAGAATAACGACGTAACAAAACCAACACCAGATGTTGCTACTGTTACAACAGCAACTGAGAAACGTCAGTCATGCAAAGAGAAAAA 
GGATCAACTTAAGGCCGAATGTCCTCAAGTACTGAGAGCACTAAAATTGTCCAATACTTTAAAGGCAAATTTTGGAAAATCCATGTCGGCTATTTTTGCT 
CAACATTTAGTGGACATGACAAACGCTAAACACTTTAAGGACCCAAAGACAAAGAAGATTTTAGAACTGGATGGAAGTAGTAGCAGTGACAGTGAAGAAG 
AGGAAGAAACTAGTTCTTCATCCAAACGGAAAAGAGGTAGTGGTGCTAGAAGTGCTTCTTCAAAGAAAGAAAAATGCCCCAATACTATCAAAAATTGGCT 
CAATGATGCTCAAGGTGTATTCCGCCAGTTTGCAGATATCATCATTAATCTTCCCTCTTTTGATGATCTTAGAGACGAAGTAAAGGATGAACAAACTGAG 
CTAAAGACCATATATGACTTGTATAGACAGGACATGGAAAAGGTGGTGGAAGAAGTTTTAGGGCGCCAAGACCTGTTTGATCACAAGTCAGAAATAGCCA 
AAGGTTTGGCCCGTTTCGATACCCACGTCTCGTTGCTCCCTTCGGATAGGTCGGCTGTTCTAGACTCGTCCATATCCAAAGAGTTGG7U\AAAAATAGCAA 
GGGCCCGAACAGTAATATTTTTGACACACTAAACACACTCAAGGAAGAAATCAAAGAACTTTTATGTCATCATGTCAAATATTTATTGCAA7VATCTTACA 
CCGGAGGATGCAAATTTTGTGTTCAATAGTTCTGTAAAGTATGTAAAGAAATCATATCAATACTACATACAAACATCAGAGATGGAAAGTGATGAATTTA 
AGTCCCTTCTCACTGGAGTCAACATTAAAATATTGGAGAAGATAATCTCGTCCGATAATAATGTTGCTACTCCTTACAAACACATCACTAATCCCAGGAA 
CATTATTTCGTCTTTACAAAAAGTACGTGAAACTAAACCTGTTTCAAAGGATTATCCGTTCAGAGTGGATACGGCCAGAGATATTGTACTACTTCCAGAG 
ACGGGTGGCATTTCTGATCTCCCTATCAAGCCCGTTACATTATTGCAGTTGGTGTCCTACATCAACGCTCTCTTTTCCCTCGAGCGCCGAAATGTTTTCA 
CCGACGGCTTTTTTAATGCAGCGTGCGTCCTAATTTCCCAGTGCCTAACGAATGCGAACCTATTATCTAACGACTTTCCTAAACCCATTGAACTGGCAGC 
TAATGTTACTCGCCATAATCTTCTGAGTATGAAAATGCTTCAAGAAGGTTCATCCAGTGAAAAGAAGAGTAAAAAGAAGGAGAAAAAGAAGGATAAAAAG 
AAGGGCGGTGGTGGTGGTGACGATTCTGATTCAGAAACAGATTCTTCCTCATCATCATCATCTTCTTCTTCTTCATCCTCCTCCTCCTCATCCGAAGACG 
AAGAAGAAGAAAAAGGAGAAGCAGTAGAAAAAGGCAAGAAAACTAAACGCAAAACAAAAAAGAAACCATCAAAGGACGACGATTTAGATACAATTAGTAA 
ACTGATTCTAAAAACAGGAGGTTACTTCCACGACACGAGTGAACTCGGCAATAAAATTAGAAATTTAATAGACAAGGATGATTTTGCGGGCGTAGCCCAA 
T ATGC AGTAACA AT C AC TG AG ATG C AATCT ACGC C AATG AATC AAAG ATT AGTATCT AGT C TTT T AG ATT TGAT AATG AG ACT AAAAG AAC AAGT AAAAT 
ATAGTGTTGATACTGAAAGTACTTCCTCCACCGCCAAATCTAATAATGCTTTAGATAGTGCTAAATTGACATCTCAACAAGTGGTTACAATGATGGTGGA 
TTCTGGAGCTGAATTGGCAAGGCTAGCTGCCTTCTTTTTCGTGGTGGTGGATAATACTGTAGTTAATCGCCATGAAGCATTCATTCTAACATCAAAACTT 
TTACCCTCGAATGAAAATAGAGGACTTAAAACTGTTGTAGAGTCATTCTTTAAAAATTTAACAATTAGTAACAAAGTCTCTACCTCGAATGAAGAAATGA 
TGTCGGTGATGCCGTTCGAAGACGAACAACAACAACAACAATGCCCTCAACATGAACAACAACCGGATTTGAAAAGAGTGGTGGGAGAAGTATTTCTAGA 
AATGGGAAAATCAATAGTGAACTCATTCCCTTCCAATAAGAGTGTACAATTAACGGCTGACGCGTTCAAGCAAAACTACTCGCCTATGGGAAGACGCATA 
AATTTGGCGGCCAAGATAAAAACGGCTATATCTATCGGATCTAATATCTCGCCCAATATACTATTTTCTAACCTCCCAGAATCTGTAGGGAATAATACTG 
TAACTGGTCTAAGGCTAACCAACCTATTGAAAAACATATCCCAAAGTGCCCAAGCTAATAATATAATCAAAAATGCCAACACGCTCGTAAACAATACGAT 
GGACCAACAAAACTCAGCAGCCATGTCCATACTACTCTTTCCTCCAACATCAAAAGAAACTTCTATCTTTCCCGGAAACGATCCTTCATCTATAAAATTA 
CAGGATATGACTACAATGTCAAATCTGGCACGAGGATTTTATTCCATCGCAGAAGGATGTATCGGGGTCGTACGCTCGAGGGAATTTGATGAAGGAGGAG 
TTAAAGCGTACACTTTACTAGTGGACTCAAATACTATGGACATGGCAGTTAATTTTGCCGCTCAGTCTCTGGAAAAATCAATGTCTGAAGCCTTAACAAA 
TAATGCGAATATGAACCCTTCTAATGTATTAGAAGGAGGATCTTTTGTAGACGGTGCTCTTTCTTACATGTTTGAAAAAAATGGCAGTGACTGTGAACCC 
ACCCCTCTAGCAAAATATACAATGAAAGATGTATCAAATCGATACTTGAAAAAATTCAACAATGATAAAAATACACAAGATTTATATAAAAATAGAGCAG 
AAAG AGC ACTCGTC G AAC AGGTT AC AAAT AAGCCT AC AAGTGTTGTGC ATTCCC AGC T AGC AAACGC C AT GGGGGTAGC TGTT AT AGGCGCAGC ATC AAT 
CAAGCTCATGGAAGCAGAAGCAGCGGAATCTGAAATGAGAGCAGCAAATTATCAAGCAACATCAAAATCTACAAATGCTATTAATATAACAAATACAATT 
GGAATGATACGTAATACTACTCACCTGTGTACGACCATCGCCGTGAGTGCAGCCGCCGACATGTCAAAGCTCGCCAACAACCATTTTATGAGTGTATTAA 
ACACTGCAAATAACAGTCATAGCAGCAGACGAGGAGACAGATCGTCCCTGATGTTGCAGCAACAACAACCAACACATTCAGCATTTCTGGAACAGACAAG 
AGGAAGAGGAGGAGGAGTATTAGGATCAGGAACTGAACAGACA7VAGGATCATGTGGAACGTATGAAGAGAGATTGGATATTAAACATGATATCTCCAGAA 
6ACAAAAATACTACTACTACTACACCCAGTAATGCCGGCCGGACATTAGGATATGGATCTAATATAACTGGCATAAACACGATAAAACAAGACGACAAGA 
GTATGATGGATAAACTTTCTGAGATGTCTAGCTTCAGAACTTAAAGTGCCAATTTTCATATAAAAATGTAGTATTATGTGGTCGTTTTTAAACTTTAAGC 
GTTTGTGGTTTTGGGTCGTTCACGGTCCAAGATGTAAGAGTCCCCTTGTACCCAAATTATAACAAAGTGTCAGTATGGTCAATGGAAAGTATCTAAAAAT 
CCACCCGTCTATCATTCAATATCCCAACTGAACCAACCAACCCCAGAACCAACAACATGTCTCTTGTTGAGAATAATACCCAAGAAGAAATGATTCTGGA 
AACTACTGTTGAAGGTGTTGTGGAGGGAGCAGAAGTTGCCCCTAGAGGTGTCAAGAGACCCCTTCCTTCTTCCTCATCTTCTTCTTCAGCTTCGGATAGT 
GAAGATGATGAAGGAGGAGAGCAACCACAAACGAAACCCCCAAAGAAGAAACGTAATATCAATAGCGGAAAGTACTGGAAAATCGAAACAATCGAACCAG 
CATCTCCAGAAATGCTCAGTGCTGTAAATGATATTGATAACGTGTCCAAAACTATTCCTCTAATCGATAATAGTTTTGGCGTTCAGTTTAAAAAGAGTGT 
ATCTGAAGAACAGATCAAAACGCTACTCACTGAAACTATTGCCGTGGAATATGGAACAATCACAAATGTTAAATATTCAACCTTTAATCAACTCGAAAGG 
ACCGGAGAGCCTCTTAAGAAGAAGCGCTCAAACAATGGAAATAACAACTACAGATACTGGCAAATTCGAATTGAAGCCGCTGCAGCAGAGAATGTTACAC 
AAGCTGTTTTGGACGCAATCGTTGAAGGAAATGACACCGTGATCAAGGCAATCCTCCTCCCTGAAGGGGAGGGAATCGGGCTTCAATTTAACAAGAGTGT 
TAGCTCTCAACAAGCTAAAAATATTGTCCAGGCTGCCGATATTGAATTTGGACAAGTTGCGCACATGAAGTGTAATTTGTTCCACAAGATGGAGAAGGCC 
GATGAATCTTCTAATTCTTCTGGTGAATCACCAAAGGTTAAGAAAGTAAGGAGGAACAAGTCCCAGCCTACAAATTCTTACTACACTTTCACTATGATCG 
GGGATTCTCTTCAAGAGCGTATTGATAACGCAATCAAAGTTATCGAAATGTCTCCAGTTAAGAGGCCATTCTCCAATTCTGCTGCTGCTGCTGAAGAAGA 
CACCACTACTACGACCACTACTTCCACTGGTGTTGTTAATCCACGAGGGATTAAGGATATTCACTTTTTCGATTCATCCATTTCTAAGGGATGTTTCACT 
GTGAGGAATATTGTTGCAGCAAATGGTGAAGTTCCACAAGAAGAATTCGTTTCTGAGTTGTACACTAACCTTCTAAAGGTTGAAGAGAAGGTGGATCATC 
CTACTTTCAAGAAGCTTATTCATGACCGTACAATGAATCGACACATTAAGGCATGGTACTGCATCTGCCCCTACTACACCACAGGAGGCGTCCCCCCTGC 
AGCTGATAAGGTTTCCGCTAAGGGAATTGCTACCTACAGAATATACGAGGACAGGACTGGAGTGTTCCAATTCGATGGGGCACATACATCTACTACCCCA 
GCACAGGCAGCCGAGGCAACTGGTGCTATTCACAAGTCTATGCTCTTCCAGAGTCCTGGAACTGATATTCAAAAGTTCCTCGATGCTAAGAAGGCAGAAG 
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GTTTGGAGCCTATTTCATCGGGCGAAATTGTGTACCGTTCAAAGTGGAGCCCCAATGATAGCAGGGCAACRCGCTGCTTCAAATTTTATTCCTCTTCAGA 
CGAGAAGATGAACATTGCTGATGTTTTGTCTATTGTTCATACTGATGGACTTTTCAGTAGTGTACACTTTAGAAAGGATACTATGGAATATGGTGTTGCA 
AAGAGCAAGAGTAAAATTATCCCCAAAACTATCAAGATTAAGAAGGGAGGAGATACTTTCCATAGCGAGGAGGATATTGAAGTGCCTGTAAAATTCACTG 
CAATCACTTCGGAGGAACTTAATAGGGAATGTAACACCAAGGGAATGAACAGCCTCCGAGCACACAAGAAACGAAAGAGCAACTCTTCTACTACTACTAC 
TTCCACCACTTCCACTTCAACAACAGCAAATACTCCAAAGAAGACGAAGAAGAGCGCTTCTGCTGCCAGTGACCCATTTGCAAAGCTTACTTTAGATTAT 
GTTGATAGTACATCATTTGTATTTTACAATATCAGTAAGGAAATGGTGCAGAGAATTTTGGCTCAGGAGAGAGTAAAGACGCTAAAGGCAGTCAAGAACG 
AAGAGAAGATGGAAATTGTAGAGGGAGAAGAAGCACAAGAAACTTATAGAGGAATCGTAAAGATCAAGACGAATGCAAAGGCATACAATCTTGCCAACAA 
AACATCTGGAGTACTTTTCCCTGCTGATAAGGTGTGTCTAAAGCACACATTGGAAGATTTGGGGGATGTACTTGATTTTGATGTTGTAAGGGAGGATAAT 
GTTAATAAGACGGTCGCTTCTACTACTACTACATCATCTGAAAATAAGGCAAGCGGAGGAGATGATGAAGAAACTCCAATGGAATTTGAAACTGACGGAG 
AGAAGCTGTTGCACGAATTGTTGAATGAATAATGTTGTTGTGTTGTATTTTGTATATAGAGAAAAGAATAAAATAAAAGGTTAATATAATGTTGTCTTTT 
ACATTTTCCTAAACTTGATATAATAAACTATATAGCAAATATTAATGCTATTGACACATGTTACTAGTCCCTAGTCAACGTGTAATTGTACATAGTAACA 
TGCCTTTTCTTAGGCATTTGAGGCAAATAGGAAAAGAGTATTTACGATGGGTGACCGTCGTGATAACGCCTACACCAGGTGACAGTATAGAGGGTTTCAG 
CAGACTGCAAATACTCACCTAACAGGCCTTGGTATAGAGGAATCAATAAAAAATCAACAAAAACCTTAAAAGTTTGGTGTTTCATTTTTTTACATCGGGG 
ACAGAAATGGGGTTAAAAACAAATATCCGAGCTGCAArCTGACATTGGCTCCCTACCCAGAGGTCCACCCTCCGAACTTGACATTCAGTCGACCCAGCGG 
TCCACCCTCTAAACTCGATCGAGCTGAAAAATTTTTTGAAAAGTTTTTGAGATGGAGAATGAGTAAAATTCTCTAGCGGAAACAGTAGGTTACACACTAC 
CCTTTCTGGGTACGTTCATATCCAGAAACGCCTGTTCCAGAAACAGTAAAGAGTTAATGTATGTTTCTGGAGCAACAGACGTTTCTGGTGTAATCTGACA 
TCGAGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTCGGTCGACCCAGCGGTCCACCCTCTAAACTCGATCGAGCTGAAAAATTTTTTGAAAAGTTT 
TTGAGGAGGAGAGAGGAGGTAAATTGATACAGATAGAGGTATCTTTAGTAGAGGTGGTGGTCCTTCTAGGAGCTGGTGTCTCCAGAAACGCCTGTTCCAG 
AAACAGTCAAAAGTTAATGCACATTTCTGGAGCTATCCCATTTCTGGTGCAATCTGACATTGGCCCTACCCAGCGGTCCACCCTCCGAACTTGACATTCG 
GTCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCCAGAAAAAATTTTATAAAATTTTGAGTCGGAGATAGAAGGGAAATTCTCTAGTGAATACAGTA 
GGTTACACACTACCCTTTCGGGGCGCAGCAAGTCTCCAGAAACGCCTGTTCCAGAAACAGTCAAAAGTTAATGTGCGTTTCTGGAGCAACAACCATTTCT 
GGTGTAATCTGACATCGAGTCGACCCAGAGGTCCACCATCCGAACTTGACATTCGGTCGACCCAGCGGTCCACCCTCTAAACTCGAGTGACCCAGAAAAA 
TTTTTATAAAATTTTCGTCAAGGGAAAACAACACTAAGTTGCTTGTCAATTCATACACTAGTGCAGGTTGGTTGAGATAGAGAGTGAAGGAGGCTATCCA 
GATATCTGGATCAACCAGAATCCAGAAACGTTTCTATCCATTTCTGGAACAGCCATTTCTGGAAAGGGTTACATTTTATTATATTTGGTATATTATTTCT 
GGTACCACTTCTGGCACATTCGTGCAATCTGACATTGGCCCTACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAA 
ACTCGAGTGAGCTGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGAACGAGTAAAATTCTCTAGCGAAAACAGAAGGTTACACATACCCTTTCTGGTTAC 
GGTCATGTCCAGAAACGTCTGTTCCAGAAACAGTCAAAAGTTAATGTACGTTTCTGGAGCAACAGACGTTTCTGGTGCAATCTGACATTGGCCCGACCCA 
GCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATTTTTGAAAAGTTTTTGAGATGGAGGA 
CGAGTAAAATTCTCTAGCGAAAACAGAAGGTTACACCTACCCTTTCTGGGCGCAGCTATATCCAGAAACGTCTGTTCCAGAAACACACAAAGGTTGATAT 
ACGTTTCTGGAGCAACAGCGTGACAGTAGTTTCTACTAAGCCTTCACTTGTACACATTTCACTTAGTATAATTTGTCATCACATGGTCGACAAAAATTTT 
ATAAAAATTTTTCTGGCTCGCTCGAGTTTAGAGGGTGGACCGCTCGCTCGGTCTGATGTCAAGTTCGGAGGGTGGACCGCTGGGTCGACTCGATGTCAGA 
TTGCACCAGAAGTTGTTGTTGCTCCAGAAATGGTGGGGGAGTATTCAGAGTGTGTTTCTGGAACAGGTACACATACCCAACTGATCAGATAAAAATTGGG 
TGTATACATTTTCTGATCCGAGAAAGTAAACCAACTGAACGGTGTAAAACGTACTGAAACATTACATCACCCACAGTCGTCCTCCTCCCTTCTAAGAGAG 
AACATATCCCGTACCCAACTACCTCAAACAACAACAACAACAACAACAACCATGACATCCATTATGGACGATCTCTACATGAGCTGTTCTTCTTCTTCAT 
CGTGCTCAAGTTCATCTGAAGAGGAGAATGAGGTAGGAGTAGAAGGAGGAGGAGGCAGAArCGGACCAACAGAGGCCAAGAAAAAGATCCTCCGTAAACG 
AAAGAGATCTTCTGTTAAAAGTACATCATCTTCTTCTTCCTCATCATCATCATCAGACGATTCTGACTCTGATAGAGAAGAAAAAGAAGGAAGAAAACTA 
TACGTGGATATTGCAGATACAAGGAAACCGCCCAAAGTAAGAAAACTGGATACTCCTTCACAAACTTTAGAGAACGACCTCTACATGTCTAGTTCATCTT 
CTTCTTCCTCCTCATCTTCCGACTCATCATCGTCTTCTGGTGAAGAAGAAAGTGACGATGATGATGATGACGATTATGACCCAGATAATGTCCATGTTCT 
TGGTTGTAAGAAGGAAAAATCTCCCCAAGACATAGAAGCTGAAAAGGAAAAGGAGGAAGAGTATGAAGAAGAATTCAAGAGAATGGCATTACCTTCACGG 
ATAAACACATCTGTAGATGATTGTGTTATACCTGATCGGATTTTAACCCTCTTTTCTACACTTTTGAAGAAAAATAGTTTCCAGTTCTCTCAGCCAGTTT 
CTTTCCTCCGGTTGGTGATGAAGCAAGTAAACGAGGCCATGAATTCAGCATTCTCTTCCATGTTATCCAGTTCTGGTATGAGGTTGGTGGAGGACTCTTT 
GGGTGACACATCTAAAATTTCCTCCTTTATAACACCTCAAACGGATACTAGTAATTCATCTTCATCATCGACGTTCGTCAATAATTGTACAGATGAAGAT 
ATCAAGAAGCGTAACATTGCAATGGGAAGAGTAGCAGAATTGCTGTCAAATATTGCAGCCAGTTCTAATGAGGAGAATAATTTCCGCCCTGTAGTTTCCT 
TAATGCGAGGACCAACTTGTGGAGGTTCTAATGCATCCAACAAGAAACTCAATAGTAACAGGCAAACTATTCCCCAAGTATTAAACAAGGTTATATTTTT 
TAGAGAAATACACAGTGTTATCGCGCTATATTTATCGTCCGTCTGTGTGCAGCGAGCfiATGAATAATGACAACACAAATTCTAGCGGATACGCTGAAGGG 
ATGGTTACTAAAATCTTGAATATTATTGGTAAAATTCCTTATAATGAAATGAGTAGAGAAAAATTCATATCCGTTGGAAGAGATGCACTATATTTGTACC 
AGAATGTGATCACGGATATGACTGGCCCCAAACATAACAAGAGACTCCGTATCCCTCAACAACAAGCTGATTTTTGTTACATTATAGCCATGTTGGTTAA 
TGATGTTCCCATTACCTCAGATTTACTTTTAACTGGAAAGGCGACAAATTTAGTACAATTTGCTTCTGCTATGGTGGATCCTGCATATCGCCTGGCTGTC 
CATAAAATGGCGTCTGTTTTCAATAGTAGTTATTCCGTATATAAAGTCCTAGATCTTGACCATAAAATGTTATTAAGGGCTAATTTAATCCTATCTATTT 
TATCAGCTAG7VAATAAGTGTCTCAGTGAAAGAAAACCTAGAACATTAACTCAGAGCGTGTATTTGTTCCTAAATCATCTTTTGCGGAACAAATTGAGGTC 
TAGTGGTTTGACCAGTGAAGAGAGTTCTCTAGGAACAGCCGTAAAATTGGTGTCACAGCAATTAATGTATGAGGGTGTGACTCGTCAAACCATCGAGGAC 
GGATGTAGCATGATTAGCGGAAACTTTGAGGATGAAGACGGTGTAACACTGAAATGTTTGGGAGCCGATGTTAAGGACGTGAAGACTGTTGGACTATCTG 
CATTGCTATCTGATAGGCTTAGAAAAAACATTAGAAGAAACGTCCCTTTCTACTGAAAAAATATGGAAACTACTACGCTCGCTTTCATTGCAATGGGTTT 
GCTTTTCGTGGTGTTGATGGTGTACATCTTTATACAGTACACGTCATCAAATAAAAACAAGACAGAGGAAAGAAAGGGTAGTAAATTATCATCATCATCA 
ACTTCTTCTACTTCTGGTCCAACATTTAAGAGAGAAGCGTCTCCTGAATTAAGAGAAGACATACCACTTTCTCCACGATCTTCTGGTCCAGAACTGAATC 
TAACTGACGCTGAATTTATAGACCCTGAACTCTTTTCTGAAGCCAGGGTCAGTCCCGACCGTTAACGGAAAAAAGGTGCAAACCGGCAACTCTTCTCCCT 
CCTTTCGAGTATAAGTAGAGGCGTTGCACCGAGCCAAGCATCATTCGGGCTCATTCAACTGTGCTGACAACAACTAAGAATATACATTGAAATAACCCAA 
ATAATCAACTTTTCAAGATGTTTGGAAGTTCTGCTAATAACTTTAATGGTGACAAGAAATCTTCCTCATCATCATCAGCTGCCGCATCATCTGACGATCA 
GCAGTTAGGTCCCCTTGGACTGTCTACTGCTGATTTCAAGAAAGTTGCTGCCATTCTTGCCAACAGAACAGAGTCCCTATATCTGTTGCCAGATTCTCCC 
AACTTTAAGAATGTGATTAACAACCCTAACCAGATATCTATTGTGCCGTTTCTGGGTTCATCGAAGGCAGCTGAAAGTGGTAGTGCAAACAAGAATGAAA 
ACCAGGCTGAAAATTCTTCTAAAGGAGGAAGTGATGGAAAGAAATCATCGCAGCAGAACAAGTTTAACCTATTGAACAAGGTAGAGGCTGAAGAAATGGC 
CTTTAAACGTGTGGCTGAACTCATTGCGGATACTCCTCCCTCTAAAGATAATCCATTGAGGGATGACCCTGATGCTATTCCATCACGCAACCCATGGGTC 
AAATTGACTCAGAAGAATCTGGAATATCTTTTCTGGGAAGCAGTAACAATTGAAGTCTCCAATGATAGGAGTATCCGTAGTGGAAGATATCTTCAAGCCA 
GTGAAGTGGGGGAGAATCCATTCCTAATGACCATCAGTGTTGACATTAGAATCCTGCAAAGAATGGCACTTMTGTCGTGTGGTTCTTCAACAGATTCTT 
CCGCATGGTTTCTGGACTTGGAGTAGAAAACAGGGCCAATTCCACCTATGTAGCAACTAGCGATGCTATTGCCCAGATCTGGGTAGAGATGCTCCTCAAG 
AACTTTATTTCTGGAGAAAATGTGCCCCAGGCACTGAAGTATTTGAAGGAACATTATGAGCATGTTTATAACAAGATCAGTAAATGTGGACGTCAGCCAT 
CCTATTTTGTTGTTGAATTTGAAAGAGTGGACAACACGATTGGATTTGTCAATTCTGATACTGAACATAATGGATCATCATACATGGAATACAGGTGCTT 
TGACACAATCAGGAAGAATGCATCATCTGGGCCAAGTGGAGGCGGAAAGAGTGGTGTCCTGTCTTCTGGAACATTCTTCATTGATAACGAAATGGGGAAT 
AACAATAGTAGTGCAGCTGCAGCTTCTGCCCCTGCTGTTTCTGCTGGAGTTTCTCCATCACTTTCTCCATTTAGCAGTGATGGAGATGATGATGACGACG 
ATTGTAGTGGTGATGACGTGTGGGGGAAGAAGATGATATTCAACACATCAGGAGACGGATCAGGAGAATCTTCTGGGCAGAACGGTGGTGGTGCATCAAC 
TTACAAGAGATTTAGGTGTGGAGAAAATACTGCTTCTCTTTCTCAGAAGGAAAATGTTCGTCTCATGGCGATGCCAAAGGGAAATGAAGATAAACAACTA 
CTCAAGAACATTATCAACTTTCTGAATAGTGCACTTAACTCTGTTGAAAACCATGTAATGTGTACAGATGAAAATATCTTCGATGAGGATCAAGCCGAGC 
ATTATACGTCAAACAAGGAACTATACAAGGCTATTGTTTGCTCCAACCCAGCCAATGTGTACAGAGTAATGGTTGAACTGTTTGTCAACCTTATTCTTCC 
CCGTCTTAGGAACCCAATTGTGAGTGACATTGAAACTGTACAAAATCTTCCTTCAAACAATGGATCGGTGAGAACAAAGAAGATGGTTGAACATGGATGT 
ACAGATATGCGCTATGACATCCCTCCATATGCAAAGGGAAAGATTCGTCTTAGTGCCAAGAGAGCTTGTGAGTGCAGAAAACTGTGCAAGGACGTGAGGT 
GCTTTGACAAGTCTAGAGAGGCAAATCTCACCCCTAGCCAGAAGGCAGGAAGGGAGGTGGAGGAACCTTTTCCCCGCAACCACAATTCTCACAGGAGTAA 
CGCTCACGACTTTACTTTCTATGACAAGTACAGGGCAAGGATGAACAAGCTCAAGAAGGATTCAAAGTIAGAAGGTAAAGAAGATTGACACCTTTACAACA 
ACAGACGATTTTCTCCTGCAAGATAGGAACGCTTTTGATCTACTTAGAAAGTGCTTCCTTTCTGCCTCTCTTCATCACATTTTCTGTCCTGATGTGCTTA 
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TGGTGCRTAGAGGAGACAGCTTCAATATTAACTTTGCAAACAACAAACTCGAGTGCTATAATGAflCGTAACGGAATTGAAGAAGTTACTTCCTCACAAAC 
GGTCAACGCCAAGGAAGCACTTGAGGATATTACAAAAATTAAAATGAAGAGAGGGGATGATATTATAGATGTTGTGAAGAGTAAAGGACTTTCTTTGAGG 
GAATTCTCTAAGAAGGTTAGTAAGATTGTGAGAAGGTTTAATGAGATCACAAACCAACTCTGCAACAACTGCAACGTTAACTCTTCTAATGGAGATGTGG 
ATTTCCACGTCTTTACTTCTGTGTGTGTCTACATCCACAACATTATTCCTGTGCTCGAAGATATCTCCATTTTTGCAGAATTGGGTG7VAGAATTGACCAA 
GCTTGTTAAGGAGTGTAGAGACGTGGCTGGAGAGGACAAGACATATGATGATATTATCCGCAATTACGAAATTACTGTAAAGTACTTTAAGCTCTTTAAT 
GCACTCGTT7VAATTCTGTCACAGGAATTATAATGTGGCAGTAACCTCTGCCATTAACAGGAGAGGGTACATGTGCATGGTGAGCAACCTTGTCGGGTATT 
ATTGTAAGCTGTCTGATAACGCTATCCAGTATCACGAATCACTATGCTCTTTGCACTCTAGCATCTCTTATGCAGRCTATTATACGTCTCGCAATAACAA 
TTCTGAAGATGGAGGAGGAAACTCTTCTTCAGAAAAGAGCAATGCAGATGTAGCCAAGACTATGGCCTCTTTCTATGACCAGTTCGATAAGAGTGAAGAC 
AGCAAGAAAAATAAGAACAAAACTTCAAATGAGATCCTTATAAAAATGTTCCAAATGGATAGGGTTTTGGATGGCATGGATGATGATGATGATGAAGATA 
GTGATAGTAGTAGCAGTGAGAATGAAGAGGAGGAGGAAGAGGAGGAAATTGTAAAGAAACCAGCAAAGAAGAGGAAAGTGGAAGATGTTGATAGCAATAA 
GAAGACACTGCCAAAGGAACCTGCCGTTAAGAAGGTGAAGCAGGAAGAAGATGTGGAGATGGAGGAAGTGAAGGAAGCAGCAGCAGAAGAAGAAAAGAAA 
GAGGAACAGGAGGCGAAGGAGGAAGACGCTACTGAGTATGACGACGATACAGAAGAGGACGAGAAAGCAGTAGCATCTGATGAAGATGAAGATGATGAAG 
ATTCTAAAGCTATTTTCTAAATATCATGTGTATAAAATGTAGTATTTTAAAGGTATAAATAAACACAATATAAAGTTAAACATTGTTTTTTATTTGACAC 
ATGT AC ATC ATCAAT TT C AC AT GGT T AT ATTTCT T GT AGCT T GC T CGGC AGC AAT AAGAGC AAG AGAAG AGTC AT AT AAAAC AG ATG TC T GTG AAGC AGC 
CATGATACCCCTCGATATATTATTAATAAGGGGAGTTGCTGCTATATTAACAGCATAATAAATCAGGTTCTTCCAATCCCTAGAAGCCCACACATTGGGC 
ATTTTTATATAAAAATCAACGTCTACTATTGACCTGGCAACAGCCACTATACCTAGCTCATCAATGTTAACAATCTTCCCCTGAACACCCATTACAATTT 
TTGATTTTTCTGAATTCTCTCGTATAGCAGCAATACGGCTAGCTTCATTATTTAACCATTCCAATTTTACACTTGAAACATAAGCTGGGGATTTTTCATC 
AGTTACTTGAGATACAATATTATTCCATTGAGATAAAAGTTCCTCCAATTGAGTTGTAGTCTTGTCCGCTTGCTCTTGGAAACTTTGCATGACGATAGCC 
GAATCTCTTGTTGCAGTTTCGATGATGGAAATACAGTTGAGCCATTGCATGTCAGTAAGGATATGGCGCGGATTAACTGGAACAAACAGATTCTGTGCAG 
GAGAAAAGTTCAAGTCTTCTGGAATCTTGTTGAGGTTGATATTGTCCTTGGTGTGTTTCCATGGGATAGGTTTCTGTGCCTTTGAAGCAATAAAGTATGA 
TGGGATTCTACTGTATTTCTTTAGTGTTTCTGAAATGTTTGTCTTTATACGGTCCTTTCTTCCATCATCCCTAATGTAATTCAAAAGTTTAGATGTGAAT 
AGAGAAGAGTCTCTAArTTTTTGCATTTGTTCATTGTGAACCTTTATATCCTCGGGAACAGTATTGTTATTATTAGCCTGCCTCTGTTGACGCTGCTGCT 
GTTGTTGTTGTTGAGTAGTACTTCCAATGGTATTTAATTTGTCCAATATTGAACGTAATTCTACAGGCACCCTTCTACCAGTTCCAAGAGCACCTCCACC 
GCCGATGGCCGCTGCAGCTTCTTCACTAGCTTCATCATTTATAAAAAGTCTCCCCGAAGAAGCATAGCGTAAAGATCCATACCCTTCATTTCCAGTATCA 
ATAGTAATCCCATTTTGTTCAGCTGTTATCCTTTCCAGTATTTCTTCAATGGCCTTTCTAGGATCTAGTGCAACATATAATCCATCGACCATAATATGAG 
ACACGTGCTCGCACACrTCTTTAATAATTTTCATCTCTTCGTTGCTGACAAGCATCATCATTCTAGATATAATAGTAGCAATAACATTCCTAAATTTGTT 
CTGAAGAAATAGTTCGATCTCTGTGAACGTAGGCTCGAAACGGGCAGGCACATTAACTGGAACAACCGCTTGTGTTTCTTGTTGTTGCTGCTGTTGTTGT 
TGTTGGATCGCTCTTCTTGTTCCACCACCTACTATATTATTACTGCCTCTGAGTTGGTCGGCTGCAATAGCAATGATGTTTTGTACACCTTTACCAGATT 
CTGCAATTACTTCCATTTTTGAGTCTTCGCCTGAAATTGGTACAGAAGGAATCTTGGGTGACAACCTTTCTAGAGTTGTGGGTAATCCAGTATCATACTG 
AAGAACTTTCCAAATAAATTCGAATCCAGTTCCTGCCACTGTGCCCACTTTACTGGGGAAAATATCGATCCTCCCTGTAGATAGAGGACTAACACATTCC 
TTATAGATCCTCATTGGGCTTAAATGTTGTGTCGATTTAGTTTTTGAGGCTTCATAAAGCACAGCAGACGAGAGAAGGGGTTCAAAGAATACAGTTTCCA 
TGCCCAATTCCTTTACCGAATATTTGACTTGTTTAGATGCATGAACAGGAAGATTTGTGGGCTGAGAAGTCGTACCAGGGAGGAATGCGCTAACATGTTC 
AATTTTCCCATCAACCGAAAAACTTGTCCGTACACCATCAACGAATGAAGAAGAAGAAGAAGAAAGGGATCCAGTGTCGTCTTGATGGTTATTAATATCA 
CCAATTTCCATATCCTTTATACCATCATCGCGTTCAGTATTCTTTACAACTTTAAAGTACGACACCAATTCACCTTTATTTTTCAATCCAAGGAATCCAT 
CTTC T GCGATATTC T GACC AT ATG T C AC A T TTCTTGT T AGAG AT T T AGGT GGT AC AT AAT T T AAG AACG ATTTTG AAATGGT T TC AG AAG AAAATGG AG C 
AGT6GCAGTTGTTTGCTGCTGTTGTTGTTTTTGTTCCTTAGAGGATATTAGAGGTACATTAATGAAAAGTTCCAGATCCTTATTCATCTTTAAAAACCGT 
CCTGTTGGTACAGGAAGATTAACTACCCTAATAGGGGTTTCTGATGAATATCTGGAGGAGGATGATGGAGAAGAAGCGACTGAAGTGGCAGACTTTAAGT 
TTTCGGCTACATTATCTCCATTCAAAATACTCTGAGGTGTATCGGTCAGTGTATAGTATCTGAGAGACGTATCATTTTCTACAATAAAGGGAGGATCTCT 
AGGGAAAGCAAGGCACACGCCTCCATTATCGGCTGCCGAATGAATAACTGCATAATGTGTAACAATCTTTGCAATTAGATCCATCACGGGCTTCTTGAAT 
AATAAAAATTTGTGTTGATCAGGATCGGCTGTATATTTTAGTAGCACTTCACGTACTATATAATCTGTTAGTTGTCTAGCCTCCTTTACAGCCCATACGC 
CCGATTTCAAAACGCAATCAAAAAATGGAAGAAGAACAACTTGAAGTACCTGCTGGTGCATGTACATCCAAGCGGTCTCATCCACTTTCACTGGCTGTCC 
TCTCATAATATTTCTTGTCATTGTTTCTAAACACGACCGATAAAGACCCACCACGCAGTTAATGGGATCGTACCATCTGTACTGCCGTCCATAACTTCTT 
CTTCCGGCCAGTATAGGCTCCCCTGGGAATAGTATGGAATCGCAGTTAGATTCTCTTCCAATATTGTATATGGATGCAATCTCTCGAGCAATATCTCCTA 
CCGTAATAGCAGATCCGCCCGAGAAAAGTCCCACAACAGCATCAGAGACATTTGAGAATTCCCTCCCTTCTAGATTTACATTATTATAATATATCAGAGA 
TGCTATTATAGCAGCAGAAGATTCCCCAGAATCGTCTCTCAGAATTAATTTTAACGTAGGAATATCAACATCTGGTATAAAGGCTGTAGTATTATTTTGA 
CCGAATGTAAGAAACCTTCCAGAATCGTGTATTTCCGACACGTTACTTTTTCTGTCGACAACGAGTGTTTTATTATGTTCAGCGTCGTTAAGAGAGGTTA 
AAAAAAGGTGCATTTCTATCCAGTCTGTTTTCTCCGATATACAGTCCATGATTGAAGTGTATTGAGGCCCAAAGAAGCAAGTAACTGGGTTTTTCATGGA 
AATGGAAGTAGGTGCCCCTGAAGGGACAATGTTGGTATCGGTAGAAGTTTTTTGCCTCTTTGTACGTTTTCTATTTTCATCGTATTCCTTAGAAAGGGTG 
GAAGATACATCTCCTAAAGATTTTCCTTTAGACGATAAAATCGATTGTGCGATTTTTTCCTTTTGTTCGGTCGTCAATGAGGCGGTTGAGGAGGAGGAGG 
AGGCTGCTGTTGCTGTTGCTGTTGGTTCATAAAGTTTGGGTTGAGGTTTATTTTTAAATGATAGTTGATAATACCGTGTAACCAAATCCTTTAAAGAATT 
GGTATTTTTTCCAAAAAAGCACTGGGCCGTAAAGTAGTGCATACCAAATTCAGTCGAGTTAAATTTTGACAACTCTGGTATGATAATGGAAGAAACTGAC 
TTGGAGTTTGAAGCCATGTCATTTAAATCACGTAGCCTTGTAAGAGAGGGATGTTGATAGAGAATTTTCTTTGCATTTGGGTTCTTACTGGTGTCAGGAA 
ATGCTATCTGGAGGGGGGTGTTTATCCACACCTTCACATCCATTAGACGAAATATCACAGGGTGTTGTAATAATTTCCCTGTGAATCTAGAGGTCGTAAG 
AACAATATCAAGTATGCCGTGATGGGCATAAAACACACCATAGGCGTCTTTCAATACTATTGAGGAAGAAAGTCTTTCCATTTCCCATTCTTCTTGATTC 
AGATGGGATATATTGACAAGTTCTTCCATGCCGTCGTTTCCTTTAAGATCTAGCCGTATTTCTTTCTCATCATGCATTGTCCAGTCCTTAATATTAATGA 
CAACAGGCAAAAGCCCTGCCAATCTCTCTAAAGACGAAGCAGTTTCTTCCATGTTCAGCTTGATGTTGTTAATTTCGCCTTCGATAAACCTCTCATTTTT 
ACGAGAATAACATCCTCCTCCTCCTCGAGGAAATTTTTGTTCTGTCAGAGATGAAAATCGTCGTGCTTCTCGTGCTCTTTTTTCCACTAATGCATTTAGA 
GATTTTGTAGATATTTCTTCAATAGTCATCTCTTTATTCGTCTCTATAATAAACACAGACGGGTGTATTTTTACCCTTATTTTTGCATTTTCTGTTTCAC 
TTCCTAAACGCCCCTCGGTGAGGTTTGCAAGGGTAGAATCATAACACGCAAAAGTTCCCGCATTTIACAAGCGGTAGAGAAATTTCGCTATCGCCATCTTG 
TTGTTGTTGTTGTTGCTGTAGCCCTTCAGCCAATAGGCAAGCTAAACGGCTATTGTCTTCGTTGTTCTTACCAGGCAGGGGCCGTTCTATTGTTTGCGTT 
TGCGTTGTTGACATAATTTCCTCCTCGACGTTTCTCGTCAGCCTGTAATTCGAGTAGAGTTTGTTTCCTTATACGGGTGTGAACCAATAGTCGGTAAAAC 
ACCAATCCACCGGTTACGATGATGGATATCAACAATATATAGAATAGGGCCTTGTCACTAGCCTTGCAGCATTGTCTCTTTCTGCAGCTGCAGTCAAATC 
TTTGTAGTCTGGGCCTAATTCGCTCTGTACGTGGTTCCGTATTCTCTCAGACACGTCCATGTTGTTGGGTGGATAAAGTATAAATACTCAGGTGTACATA 
AAAACCTCTCACATGATGGCAGGAATCGGTAGAAGAGATAATAGACCCGTATTACATCTGGACATTGATCCAAAT7VAGGAAATACCCTATAACGTTCCAC 
CAACCCCTATTATCTGTGAAAAAAACCCCTTCGTGTTTAACATGCAGAAGTGCTCAGACTGTGCTCCTTTCCCTCCCTACCCCGGCACTGAGAAGCCTTT 
CCCTCCATACCCTGGTACTGCAGTAGAAGAGGAGGAGAAGCAAAAGGAAATTGAGGAGCTTCTGGTTGACCAATCTTTCCCTCCCCCATTCCCTGGAAAT 
AAGCTGAGAGATATCCCCAGAACCTACCCTCTCGAATTTCCCGAGAAG7VAGGAGAAGGATTTCCCTTGCGTTGACACTACCGGTCACAGCGATATCCCCT 
TCATCGATCTGGAGAAAACCCCACCCCGTAGTGACGTTAGGCACGGTTACCACTACTTAATCAACCCC7VACAAGGTTGGGGAGCTTAACCATATCGTTGG 
TAAGCTCACTGAAAAGCAAGAAAACCTGAACAAATTGGTGTTGGATGTTGATGACGTTGTGATTAATCTGTCAAGCACTTTGAAGGAACTTGAGAAGCTG 
CGAGCTGGCCTGTGCAAGTTCTCAAAAAACTAGACGGTAAAGGTATGACCAAACCGGTATAACCAACTTTTTATATAAGCCTAATATTATAAATCAACAA 
CTAACAAACGCGAAAACTAACAATAGGAACCATGGCCTCATCATCATCATCCCCTGTTGCCCTCTCCTCTGTCGCATCATCCGTGATGATGGAAAGAGAT 
GAAGAAAATACACTGTCCCTCAGGAACAGGAATGTGAACAAACCAACACCTGTTAGCGCCGCCTGGGTGCCTGTTGATGAAGAGGATGAAGATAGGGAGG 
AAATGAGAAGACTTGAAGATTTTTCTTCAGATGAAGAAGACGATGATAATAAATCATGTCATTGTGACCATAGCGATGACGATGACGATGACGAGGAGGA 
TCCTTCATGCTTTAAGGGTTTTTCAGCTGGCCTGTGCTCTTTTGTGAGGGGTTTCTTTGGCTTCCTCAGGAAGTCACTTACCAAGAAACAGGTGTTCCTT 
CTTACAAGCGCAGCCGTTGCTGCTATTTTTAAGACTAGAGATGTGGCTAAAACTGAAGAAGGCGCAGCAACCATGGAAGAAAATTCAACAGATGTGATTA 
CTGGAGGAGATGGAGATAGTGGTATTGCTGCTGATGTTGTCTCTCTCGCTAGTGAGGGAGAGGGAGAAAATGGATCTCTTTTGGAATCTATTGCAACAAC 
ACTCATCAAGACTACAATTGAAAATCTTGTAGATGGTGGAGAAGAAACCACAGAATTGTAAAATGTTTTTTGGTATACAAATAAAAGAATACATAATTTA 
TATAATATTTTATTGTTTCCTCTTAAATGACAATTTCCTTATGATACTTTTAGGTGAAGGAATAAAGGTAGGAGAAGGTGTGGAAGACACAATTGTAGTG 
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TTTCCATACRCATGATTCTCCCACATGTTATTATTGTTAGGTGGGAATTTCTTAAACGTAACTTGGGTTATRCGTGGAGCATTGTTGTTGTTGTTACTAC 
TGCTCCTGGTTAGGGTGTTTTTCTTGTGTTTCATCTCCTTCAATTCCTGGATACACTTCTGGATAGATGGAGATTGTTGAGGTACTTCTGGCACGTATTC 
GTAGACATGTTCTTCCACATTATCATCCTCTCCCTCCTCATTCTTGATATTGAAGATGGGAGAAGGTGGGTTTGAAGGAAGCTTCAGATGTCGTCCGTTT 
ACATATGACACGTAATCACCGCCATTGCTTCTAGGTGGAGATGGCGGCGGTGTTGGCAGAGGAAGACTATTGTCAGAAGAAAATGACGACGAGCAGCTCG 
ACATTGGCATATAGACACTTTCTTCCTCTTTATTTGTACTTGTTGGTTGTTGTTGAGGTGTATTGGCAACTGGCCTGGGTGTAGGCGGACTAGCATAGGT 
TGGACCAGAAAATGTTACCAGTTTTGTGACTTGAGGGGTAACATTGGCCAGATCGTAAGTGGTAGATTTGAGCATGGGTTCAGAGATGTCCAGGTCCTTA 
TCTGGAGTGTTGAACCGGACGTGGCTGATGAAACTGGATTCGGTGTTGCTCCCGTTCCTGTTCTTCTTATTCTTATTCTTCTCCCACGGTGAGCAAGAAA 
TGCAAGTAGTAGTTCTGTCGATGCACTTCACGAGCATGTTCATGGCCAGTGCGAGCCCAATAGTCATGGAGCAGCCGACTGCCAATCCACTCAGGAAAGT 
GACGAGGATAATGTTGTCGTTGGTGATAAGATTGTCCATCTCGAGAGATGTTTCTTCCTCGACTACCAGACGAGATATGATGCTCTGGGGCCCTGCTTCG 
TCCTTTTATACCGTCACTTGGTTAGAAAAAACGATAGTGTAGTTCACAGTATTTGTCCCTTCTGGTCTTGGGATTGGGGTTCTGATACTCTGCCCAGCCT 
CCAGATATAGGTGGATAGACCGAGATTCTTCCCTTGATGCAGTTTCTATTCCArCCAAGATATCTTCACCAAATAAGTCCTTATGCAAGGACACCAGAAT 
ATTGAACTCGTGAACGCTTGACTTGTTGATGATGCATAATTCACCTTGTTCGACTGTTATACATACTCCTTCAATAGAATCGTATAGCCCGGGGTTATTG 
GCACGGTCCTCTTCAGGTAAGAAAACTGCAAATATTGTATTAGGCACT^CATCTGGCACGTTTACAGAATGACAGTAATGTTTATCTATAATGTCACCAG 
AAAAGTGAAAATCTATCATCCGACACTGGAGAAAGGTTGAAGGAAAATCTTTCCGCATATTCTTGAGTTCCCGCAATAACTTCTTGTTTCTGGAGTTTGA 
CTTTTTAGGATTATTTGTTGCCAGACTTACAGCATCGAAAATACTTTCTCCCATTCCTTCAGCGGGATTTTCCATCTTACTTTTGTAGTGGTTTTCAGTT 
CATTAATTGTGTTTCAGGGAGGCTATAAAACAAAACTCTTACGCATCTGTTCATTCAGTAGTGGACCAGACGTCGAGATGACGAGACATGGTGTGCTTGT 
TCCAAAAGGCCGTTCTAGGCATGTTATCCTAGGAAATGTCGACTACACTTTCTGTACTACTGACAATAATTGTGTCAGTCTAGACATCGATTTCAAGGAC 
AAT ATC AC AG AC C AAAAC ATT C AGT T ATTG AAC AAGAAAT TGGGT AAGAA AACAG C AAAG AAAAT AAAG AAGGAAGATGC AC CTG AAAC AAAGGA AAAT A 
GTGACGAAGACATATACGCCACCAAGGAATTCGAACAGACAATAAAAGGTCTACAGACAAAAAAAGGTGCCACCGAGGAAAACGCCATCGCGGCCGCAGC 
TGCCGCTGCCACTGCTGCTGCGGTAGAAAAGGCTATGCTATCAGAAAGTGAAGGAAAATCAATGGTCATCAACAGAGCTAGAATGGTGCTTTCTAAGCGA 
GACACGTCCCAGAAACAGTTCACTGCATTGAAGAACAGGGAATCTTTCTTCAGTGTTTTGATATTTGAAACTGGATCAGTGATAGTTGTCGGGCTTCAAG 
ATCCTTCGCTTACAAAATTGTGTGTGATTAAAGCCACGACTGATATTGCTGATATTCTACAGAAAAACATCAGTGTGGCTAACGTGTCTATAGTGAATAC 
AGTGTCCACTTTTAATAGATTCCACTTGAACTTTATTCGACTCGGGAAATTCTTCGAAAGAAATTGCATCTCTTACAGTTATAACCCAGAAACGTTCCCC 
GGTATGTTTTTCAAGCTGCGAGTGCCCGCAAAGCCTCTCTTGCCTGGAGAGACTATAGGGGAATACTACACAAAGGTTGCAATGATGCGCGATAGTAAGG 
ATCCCAATTTTAAAATGTCTGACTGGTTGAGGATAAAAACTGCATTAACATTTAAAGTTGGGAAAATTACTGTGCTCGGAGAAGGAGAGAGTGGGTGCGG 
TGATGTTTCTGTCGTATCCAAATTACTATTTGGTTTATTCCATTACTTTATGGACAACAACATTAAAATGTCCCCCAAAGAAGCACAAAGAGTCAGAGAA 
AAATACGGCATCCCGCATCTAGAATGGTACTTGTACATTGACATGTTGCTCCACTCCTACCCGTACGTCAAACCATCGGCCGAGCAAGTGAAAAGGGCGA 
TGGTGGACCAACAACATATTTCTGAAGTGGATAGGACATACTATGGAACAAAGAACAGTATGGACGCTGCCATGTCTGCAAATTTAGTGCCTTCAAAAGA 
AGAGAGTATCTCCTTCATTAAAAAAATAAGATCACAACAACTTTTTGGACATTTGTGTAAACCTTCAAAAGAAACTACTCGACGTGCTATAGACACACTT 
TCTTTCGATCCrATAAACCAAGACAGGTGGTGGAATAAAAATGACCAATATrACGGTAAAGAGAGATGTGACCCGTTTTCTGTTGCACGTTTAGTGTCTG 
TTTCTGAAAATACAAACAGTATGATGAATAGTCGCATTTCTTGTCAAGGGAAATGGTGGCTAGATGAAAACGAATACAAGGATAAACTTGATCATATTGT 
GGATTTGTGTACAGAAGAAATAGTGGAGGAATGTGAATCAAAGGGTTTTATrGCCTCCCCATTTTTGAGGAAGCACCAGAAGGAAAAAATACCAACGCCT 
TATGTTTTATTAGCGAGAGCCTGTAATCAAAAAAATGGTAACAAAATGAGTATTAACAATAATAGTAACTATTTGTCGGGTTCAAGTAGGGCGAAGAGGA 
ATGCAAAACTACAGGAAAAACACCGTGTAACTTTAGCCAGGTTGAACACGATGATGGCGTCGTACCGATTTTTGAATAATTACATCTCAACAGACATTGC 
ACCTGATTTTGCCAAGTTATTTGGTAATGATGTATATAGTTTATTACATTTAATGACAAACTTGCCTAAATCCCGTGGACATGCTCTAACATACAACGAA 
AGGGCCCTTTCAAGTAATGAAAGTACATATAAAACACCTGGAAATGCATACTTTAGTACTCTATTTGAAAAATCCATTATAAATAACCAAGAAACTGCTA 
ATAAAGGTAACAATAGAAAACGTAAATTTTCTCGAATCGGACAAGAAAAGAGCTCTTTTCTGTGCAACGCGTGTGGTGTCAATTTGAACAAGGGTAGTGA 
TGAAATCATAAAGGGTATTTGTACAAGTTGCGATCAAAATAGTACCAGTTACATAGAGAATGCATTATCTGACATTAACAGAGACAAGAAGATTAAACGT 
TTTAAAGCAGCTGCAACCCATCCGCCAGTGAAGCAAGAATTGGTAGATTCTTTATCCTCCTCTTCATCTCCTTCTTCTTCTTCTTCTCAGACGTCTAACA 
AGAACAATAGATGCACCCCTAGTGATTTTATAGATTATGTGTACAAATTCACTGACGAAACAACAGGTGCTCCAAAGGTGGGCTTAGTGTTTAAAATGTG 
TGATATTCTTGCATCCTTAGCAAGCAGGAGAGGGATGGAAGATCGTCCCACAGCCAACTATAGAACCTCCTTACATTCAGCTACTCAAAATAAAACCAAT 
TTGAATAAACTATTAGTTTCTGCTATCAAGGAAACAGGAGCCACTGAAACTGAAGCACAGATATTCAACAAGATTATTGGTAGTGAAAAGGGACTATCAA 
TTCTCTGTCAACTTGTGGAAAGGAGGAACAAAGACAATAATGTCTTCGACTGATTTGTCTAAGAATGCCTTCCATGACTGGGTGGTATCAAAGACAGATT 
GTGAGGTGTTTGATGTACACTGTGAGACGGACAGAGATTGTGGCGCTGCTTGCGAGAACACGTACTCTGTTGACGGAAAGGAGGTTACAAAATTCTCTTG 
TAACCAACAGTCGGGAAGATGTGCCAGGAGTGTTTATAGTGCGTCTTCTCTAGAAAGAGCAGCCAATGATCTTGGCCACATTATAGGTATCATCAAGAAA 
AATCCAAAATTGGAGGAAGAACTCCCTGAATCATTTTTGTGGTTTATCAATCACAATGGAGGAGATTTGTTTGTGAATAAGCGAGCCGCGTACTACGACA 
CGATGCATCTAAGCATAGGGAAACTGGATAATGTGGACACTCTTGCCCAGGGTTTAGATAAACGGATGGCCTCATCATTGAGAGAGCACCTACTGAGGAA 
GTTGGACTCTATACTTTTACAAATTGATAAAGTTAAATATGAAAAGGCAAAGAAATGGATATTGGATATAACACAGGAGGCTGGCACCGAAGAGGACAAT 
AAAGAAGAAGAAGATGCGAAAAAAGAGGATCAATCTCTTAGCGTTAGTGAAATTGTGGATGTTTTAACGGGCACACATGACCCTATGCCCCTGAGGGCTA 
GAGGGTTTATCCAGAAAAAAATATATCCTTTGTCAAGAAACGAATTGAGAGAATTAGCCCTTAAGGAACTTTTCCCTGAAGAAACTACATCTCCTCAGGT 
TTTGAGTAGGCAACATGATGTATCTACGCGTGAAGATTTATGCAATGAAAGTATGAATGCAGGGAGGGCAGAATCCATTTTTAGCGACCCTGATTCTGGA 
GAGTACGTGGCTACTTGTGCATGTCTTTACTCGGAATATTTAACAGGGCCTGCGTGTAAGCACAAAACATACAGGTATGTTATAGACTACGACAAATGGA 
AAAGGACTGGAAGACCTGAATTTCTAACTGATCCTGTACTTCATTTTAAAAAGGCAGAAGCTGTGTGTAAATCGACAAATCCAAACTTGAGGGCAATTTA 
TAGTCCAGATAATAAAGGTTTCTTGTGTGCGCCTGTAGCTGAACTTGTAAAGACGGCATTAACTTTTAGGGGTTCACACGAACCGTCTCTCATTGTCGAG 
AGAGATATAAATCAAGCTGAAAATCTACCATCCAATTCATTTGGTGTAAACTGGCCCTATGTGAATCTCCTAAATCGCATTCAAGACCAGTACACGTAAT 
TTGATAAAAAAAGATCGGGGAAAATGTCTGCATCTTTAATATTGGACGAATACCTCAAGAAGACTGCTTCAGCCGTTCTGGATGTAGCTGACTCATTTGA 
GAAAATCARGGGAGAAATCCAATCACCTGAGGAGGCTGCGGCTCTTTCTGTTGCTCTCTATGGAGCACCTCCAAAACCTTCAGCTTCGGCTGTGGCCTCT 
ATCATCACTGGAGAAAGAACATCTTTAAACGACAAATATCTATCGGATAATGTCCTATTGAAAATGTCTGTTGCTCGCGTTGGACAAGAAAATAATCGCA 
AGAGAGCCGACCAGGCAGCTGATGAAATTAGAACCATCATGGAAGATATTACAGGGAGTTTGTCCGGTGCGTACAGGCAATATAGCCCGCTCGAGGAAGA 
AAATAAGGTGCATATAGGCATCATGAATAACAAAACGCCTAGCATTGTTTGTGGATATTATACAATGGACACATCTATTTCTTCCGAACCTCTTTCTCTA 
ACAGATTTTCAAAACCCCACTGTCATTGCCAATGTGACTAAGCGGATGGAGAGCATTTTTTCAAAGGTCGACTCTGCTAGGTCTACAAGATTCGACGCTT 
TTGTTAATGGTGTTGCGAATAATATGGATATAAAGTCATCAATAGATTGGGCAAATATGGTAGATVAATGTGATCAAATTACCAGATTCTACACCTAACCC 
TTGTTCAGTTGACACTATTGTGTCCAGAGACGCAAGTGTAGTTAAAACAGCAGTTAATGATATATACGCTTCTGTTGGAAAATCTTATTGTCGTCCTGCA 
ACACAGCTAACCTTTATGAGCGAGATTGAAAAACTGCGAAAGGCTGCAGTTGTATGTTTTGAGGCACTCATGTCCGATACTAGGGAGAGGGCATTCGTAG 
AGTTCCTATTTTACGTTAGCTTTAAGGAAGATGCATCAAATACCAATTCAAAATTGTTTGTTCAGAATAAGCTATCTTCCATGTCTGGAAACCCCAGACA 
GCCCATAAAATTGGTACGCCGTTCTGCTGAGGAAACACTATTCGGGCTCTGTTTCATGTTTAAGGTAATGCCTCCAGAATTCATGAACTGTATATTTAAC 
TTCCCTACCATTCCCCATTCAACACAATACCATGGTCTATATGGTACATGTTTAACCCCTCTACTTAGAAAATACGGTTCTTCATTCGAAAAGTCCTGGG 
CTCATTTTGAGGAAATTTTAAGCGAAAGAGCCAATGCAGTGAAAAAATTTGGTGTAAACGATACGAGGATAGATTGTCTAGATGCAGTAGCAAATCTCAC 
CGGACCTGTGTATGTTCTCATTTTAGATCTTGTACGTACTCTAAGTGCGCAGAGATCGTGTTCAACTAAATTTCTCCGTGAAATTAAGGAAAACTATCTT 
TTGTGGAATAGGTTTGTGTCATAATAAAAATGGCTCAGACATCAAAGATGGGAACTAACAAGAGGTGTTTTGAGGAGGAAGTGGAGGAAGAAAGGCAACA 
ACCTTTCACAAAGAAATCTAAATCGGAACCACCCAGTTTTGAAGACAAGAGTTCATCCACATCTTCTAAGAAGAAGAGCAAATCCAATAAACACACCAAG 
ACCAAGGAAGAACAACTTCTAGAATTCGTGAAGGATCTGGAGCGGAGCGACCCCACTGTTCCTGATGAGAAGGTCAAGCAAGAAGTTGAAGAAAAGTCCC 
CTGAAGCTATTGCTGAAATTTTTTCAATGTTTGGGATCGCTCAAGACAGCAAGTTCAAGAGCCTTCTTCCCATTGAACGCATAAAGAGCATCACTACTAA 
AATTGTTATCGATGCAATTAATCAGCCTGTGCGCAAGATGTTGGTTGACCACCTCTATCATTTTAAGGAGATGCAGAATGTTGTGGAGAAATATAAGGAC 
GATAGCGACGAAAAACTGAGCGTCATTCTTAAGAGTAAGAAATCCCCCAAAGAATTTGACCTCTCCTTTTCCGATTACGTTGATCGCCTTAACAGGATTC 
TGGTTGGTGTAATTAAGAGGGTGGCCGGAGCTATTGAAAGTAAGGAATTGTTGCAGAGTAACAGCATGATCATGAACAGTGTTCTGGGTACTGTTGTGTC 
CAACATTCCTTACAACATGAAGATTAATATTTGTGTGTTTTTGACTAACTTTATTTGTACATTTGCTAATGATGATTTGTACACATTCTTTAGGGATGAT 
GAGAAATTTGTAATGAGTCAGGTAACAAGATACATTTCAAAGGATTAGAAAAATAAAAGATGGTATAAAATTACTTGTATTTTTATTCAAACAAAACCTT 
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TATATTTGTACAATTCCCACTTTACTTCTTCTTGATTTCGTCCTTGATATCGATCACATTCTTGGAGGTGAATCCAATGTTCATTGTAGCTTCGAAGGCT 
GCGCCGTCTTCAAATTTGCTGCACACGTCAATGAGGGTAATTTTTGAAGATGAAGATTTAATGTCCTTGCTCGAGAAGTAAGTGTTCGACATAGTATTTC 
CCTTAATGTTTCCTGCAATAACAGTTGGCCGAACAGTAATGTCGTTGATCAAGAGATCGACGCCAGTGTTGTTGAGGATCTTCAAAGTCATGTTAGAGTT 
CTCGGCCCCAGTGCCAGCGGGATCAGTCCTTGGGGCTGTGACGGTAGAGATGACAAGATCAGCAGAAACATCCTTCATGTCCTTATCGGAGAGACCATTC 
TTCATGGCCACCTTTCCAAGAGGAGTATTGTAGGACCTCTCTCCACGAATTGACATTACCTTTGCCCTTCTTrGAATTGGGACTCGCATCATCTGATCAT 
AATTAGCGACGACGCTTCTTCCAACACGTGTGTTGAATACAATCATTATAACCATGATAACGATTAGAGCAATGATTGCAATGGACAAGATTGCAATAAT 
TGCAACGTCCAGGTTTGTTiiGGTTGCCAAATTCCATTTTTCTTTGTTTTAGATGGAAGTTCTTCCTTTACGTGTTGGTTGGATCCAGCTAGTTAGTTCTT 
CTGTAGCACCATATACCCAGAAAGGGTTTGTTTTTAAAATTTAACCCTACATCGAACCGGATGAAGTGTACCTTGCCGTGATCTAATTTCAGTAACGATA 
CCTCCCACTACCCCTACTTTCATCTCCATCTCTACCCCCTCCCTACTCCTCCCTTCACCTCCTCCTCTCCCCCTTTACTCGTATTGGTCGTCTTCATCGT 
CGTCTTCTGAATTGGATGAAGAATTGTAATATTTTGTGTGCTTCATTCTTTTCACTTGCTTTGCTAGAGAATTGACGCTAGATTTGTTATTCAAAAGGTC 
CATAAAAGTCCTCCGTCTAGAAGGTGTGGAAGATTCTTGGTCAATATACACACTGCTAGCGCTACTCATGTTGGAATTGATGTCATTTGAAAAAGAAAAC 
GTTCTTTTCCTACCAACATGAGAAGATACACTGCTAGGCACGTCCATTGCTGTTGCTGCTGCTGGGGTGGAGGGAGAAACATCTTGATTTTTAGAAGGAC 
GGTACATTAAGTTTCGCATAGGCACAACAGATGTAGATTTTCCGTGCTTACCCTTTTTATTGTATTTCTCAGATGCTTGATTTAGCTTATATTTTATGCA 
GTACACTTCAATGGACAATTTAGCTTTCTTTAAAAGTTTGATCACTACTGCGCTTAGCAAGTGTTTCCTCTTGTCCATGAAATTGACTAGTGTATTTAAT 
GCATTATCAAAAATAGGGGATCTGTGTTCGATGATTGATGAGCCTGCACTGTATGCTACACCAAACCTTCCCCCGCCTTGTGTCTGGAGATCAAACATAC 
TATTCTTGCTGTTATTGTTACTCACTCCAGAATACGCGTCACCGAAAAAGTGGTTTAATGATGTACGGTCATTACTGTCCCCGTATCTTATATTATCGTA 
TACTCGACGAATTGCTTCCTTCTTGTCCTTCTCTGAAAACATGTTGCCGTCTTCACTTCTTTTCTCTGCCTCACTTCCATTACCATCTTCATCCTCTTCA 
TCGACTGCCTTATCAAATTGTATACTGTTTAAGACGTAAATGGGAGGGACAGCCATTTTAGAAGAACAACTAGAAATACTATCCAAAAGAAGAACTTTAT 
AATCTTTTGACAAAGTGAACTCTTTTGGCCAATTTAATAGTACAATTGGTATGTTCAAGAAAATTGTTAACCTTCCCACGTCCATACATAGTTTGTTGAT 
GTGTTTCTCTATGATTGTAGAAATTGCCCCGAAACAAATTTCTGCCCCTATAGAACCTAACCCCATTTCTTCCATTATGTTTGCAGATTCTTTACTCATC 
TGTTTTAGTTTAGATGAGGTAGACAAATCTGATACATCATTTGTTTCAGTTTGATGTTTTTCAAAATCCTCTTGGGCTTTGTTGGTGTCTCCGAACAGTG 
CCACTGATGCCTTGGTCACTATGTCCGAAAGCATATTGACTAATCTAGACCAAGAAACAGCACTTGATGTTTTAAAGGATATGAGGTTGTTTAGGAACGC 
CAGTCTTTGACCAAGCTTATACGTTTGACATGGTTGATTATCGTCGTCTTCTTTCAGTATATTTATCAGTTCCATGTGCAAATTTGCTTGAGTTTGATTA 
CTATAAAATGAAGAATGCTCCACCACCTCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCTCCAGCAATAGACTCGTCGCCTTCAGCACATACTCCACCAG 
CAGATGAAGACATGGAAGTTTCATCCTCGTTTAATTCAGATATACTTTCAACCTTTTTAAGAGAAGGAGCAGGAGTCGATGGTCTTATTTTGCTAGCTCT 
TTCCCCCATTATATAAGTTGACGCGTATGCTGCTGAAAAAAGAGGGTTGGACATGATTTGCTTCCTCTGAAACAACTCCAATTTCTCATTTGTTTTCCTG 
AACAGCTCGGCAGATTTAATGTCTGTCACCGCAGCAGAATCTAAAAATGCCTTCATGGTTAATTCTGCAGCGATGATATTTTTAAGGCGCTCCATTACTT 
TCTCGGATGGTTTTACTGGGTTTTCTGGATCTTGGAACACAACTTTCACCAGTrCAGAGCATTGAAGGTTCTTTGTTGTTTCTAGAAGAGTTTTTACCCT 
TGTTTTTAAGATTATAGACTCCATAGTCTTGTGTAATCTTTCTTCGGCCTCTTTCCCACCTAATGTGTTGTTAACCTTTCCAACCACTGTACTCAAAAAG 
GATGGATTGTTGTTGTTGTTTGAAAATAAGGACTGATTTGTAGGTGGAGGTGGAGGGGGAGGAAGAAGAATATTGGTAGGTTTGAGTTGAGGAGGTTGAT 
GAGGTGGTGGTGGAGGTGGAGGAGGAGGAGGAGGCATCAGCATCGAATCTTGGTCCATGATGTCTCTACTTTCTTCTCCCATTTCATTGTCTGGAGGAAA 
AGTCACGATAGATGGAACAGGCGGTGGTGCTATATCGTTTAGTTGGTCATATTGrTCCTCACCTCCACTACCAAGAATAGAAAACATTCTAGGTGCTTTT 
GGTGAAGGTTCCTTTATCTGGGGGAGAATATCTGTCTCTTTCATGATTGGAGGGAACACAACTGCATCCTCGCACTCTTGTATATCTGGAGGGGGAGATC 
TTCGAACCCTGGGCGCATTGTTAAATTGATGCATAGTACTTATTTCTTCATCCTCGAGTGTGTGCCGAGATTCATCTTCATATTCTTCATATTCTTCCTC 
TTCTTCAATATATTCCGTGTGAGATTGAACAGACTCTTTAGTTTCTACATGCTCTTCAGATTTTTGTGCAGGTTCCTCCTCCTCCTCTTCCTCTCCCTCT 
TCTTCATCCTCTTCTTCATCCTCTTCATTATCATTGCCGCCACCTGCTATTGAAGAGAAGAAATAATCCATTCCATCATCATTACCTCCTTCACTCATAG 
GGCTGGATTCAACTCTATGAGAGTGCACGCTTTCATTTTCTATTGCATCTTCATCCTTCTCGGGTTCATTTTTTTCTTGTTCTATAGGTTCTTTTTCTTC 
CTCTTCCTCCTCCTGTTCTTCTTCCTCTTCCTCTTCCTCCTCCTGTTCTTCCTCTTCTTCATCCTCTTCCTCTTCTACGGGTTCTATTTGTTGGTTATCA 
TCTTCTTTGTTCTTCTGTTCTTCACCTGCTGATCCGCCCATGGCGTTCACATTTTCAAGTAAAAAGTTCAACATGGGAAGAACTTGCCTATTTGGCCCTA 
CTGGCGAGGGGGTGTTCACTATACTTTTCCACTGGTTTTCAGGTATAATTGTCGCATCTTCTTGTTGTGCCGCTAGCGCCAATAACGCAGGGTCAGCCTC 
ATCTTCTTCCCCTTCACCTTCATTGTCATCTTCATATTCATCTCCTTCTTCTTCCTCGTCTTCTAGATTTCCGTCATATTCACCCCCTTCTTCTCCCTCT 
TCATCTTCCTCTTCCTCCTCTTCCTCTTCTTCTTCATCATCATCATCGTACATGTCATTAGTGACATCTCTCACAAATTTACTTTGTTTTTTAGAAGATG 
GAGGATCTTTATTTCTTCCATTTCTAACTCCCGTGTCCTCATCTTCATCTTCGTAATCCTCCTCCTCCTCCTCCTCCTCATCCTCGTTCATTTCCTCCTC 
ATCCTCTTCGTCTTCATACAAGTTACTATAATCAACATTATTATTGATATCGTCCTCGTTGTCGATATCCAACACTCCCGCACCTTCAGTTATACGCAGA 
TTTGATGCCGCTTCCATGATGAAATTTTAGTATACAATATTGTGTTAACCGTGGAAAATCAAACACTACATTTACTAAGTTCCTTTCCTGAAAAAACGAG 
TTACCTCTATAGGGrAATTCTCAGGGTGTATATTCTATAGTTTGGTCTTGAACGTGCTTTTGGTTTTTGTTATAACTTTCTCTAAAATCTACTAACGGCA 
AGAGCATTGACAGTTGAGTGGGAAGGGGTGGTTCAGAGTAAGACCAAAAATAAAATATTTTTTGATTGTGTGAATTTCGAACCGTTTATATTACTATTTT 
AGGGTACTAGAAAACGTTGTCAGAACCAATCATGAGGTTTATCCCGATGTTGATAATTGCGCTTATCGCTGCATTTGTTATTGCAGCTCTTCTTACTGCC 
AATTCAAATTACCTGGATCACAATATCAATAAGGAACTAAATTTGACCCGTTCCCTTCAACTTCGGGGAACATTCACACCAGAAGATATAGCTCATAACA 
ACAGAATTCTCCCTTCCAAGCTGAGTGTTTTAGAAAGAGGGTCAATCATTTTAGCAGAAATGGACAAGTACAAAAATGCACAACCTACAGTAAATAATTC 
TCAAGAGAGGAGGAACATTTCTACTCCTCAACAACAACAACAGACCACCCCCTCATCCCAATCATCTTCCCAAGTTGAACTTTATIATTCTATAAGAAAAT 
GCTGATTTATGTTCCTGTATGTGACGAGTCTCGCACCGACGAGAAAGAAGATTTTAACAACGATGAGGAAGAAAATATATTAAAGGAGGAAGAAGTTAGT 
GGAGGAGAAAAGGTTGTAAATATCGTAGTAGGGAGTATTAAGATGATGAAACCTGGTTCAAAAATCAAMCAAAGTACAGGCTCTACTCCCTCCTTGCAA 
GTGCGATAGGGAGTAGAAAGAAAGCTGAAGAATATATTGAACGTCTATACAATTCCTTTTCTTCCATGACGTTGAATGAAGGAGGACGTCTGGCTTCAGC 
GATTTATTGTCCATCCTACAACAATAAAAGGATAAAAAACAATCGTTCTAGGCCTGTGAAGCTGATCCATGCCTCCAGGGAATTGTTATCTGAAACTACG 
GTGAGGGAGGAAATACTACGG7VAATCTCCCGCTTCTTCTTCTTCTTCTTCAACGTCATCATCCACTTCTTCGTTTTCTTCAATTTTTTTGTTTGTACCTT 
CTAATTGTACTTCTAAAACTGTATGCGATTTTGTAAAACACATACAGTATGAAGAAGATATTAACCGTTTAAGGTATAACATAATACATATATCTGAAGA 
AAACTATGCTTCAAGATTTTCTAAAATTAATAGATTAACTACATGTATACAAGGTATAAGTAAGACAATAAAGATTCTATAAAAATCAACATTTATTTGT 
ATCATTTATAAAATTCCATATTTTTGTATCCATTTCGTAGTCGACGGGTAGAGATGGTCTGTAGAAAGAAGATGCTGCTCCTTTTAAAAGGTTATTGGAC 
ATGGCTATCACATACATTTTCTTATTGTTTTCCCTATTGATGATATTTGTCATGACGTTGGGAGTGGCATTTTTTGTTGCTGGTGTATGACAAACGGCAA 
CATTACTGTTGTTCTTAACCACCTGAGCAAACATGGCTGTAGGGGATTATCTTTCTATGTCCTCTGTAGGGGAAGCAACGCTCGTGGGTTTTATGRTTTT 
AAATTTTATAAACTTTGTCACAATATTATCTTTAATAATATATGCCGTGACAGACGTTTATAGAAGATGCAAGCGGCCGTCTACCAATGGATATTCTGGT 
TGTACAACCAATGTGGTTTCTTCGACTTTACAAGAAGCTAATCTCGTTACCACTGAAAAAGATAAACCTGTTCAATTTGTTAGAGGTTTAGTCCCCAGAA 
AAATGATGGAAAAATATAGATCGGACTTGTCTCCTAAGAACGTGGGGGAATATATTTTACCTTCAGAAAAAG7VAACAGACAAATTGAAAAGTGATTATAA 
AAAGGGTAAGAAGGTTGGTCTTTTAACTGCCCTGAGTAATGGTCATGACAGTAACAAGAGGATTATAGGGCCAAGGGATCTGATTAGTAGAGATGATGTG 
AAGGACAAAAGTTATGTCTTTAAGAGATTGAGCAAAGATCCGCTCGTCTACTACTCTTCTGCAACCTCTAAATACGTTAGAAAATTTTCCCCTTTCAGAG 
CAAAAAAATTCATGACATCAACACAGTTGGGGAGTAAGCTCGTGTATCCTCACCCTATACGGTATGGTACTGCTTTTGTACTACCCACGGGATACGTGAT 
CAACAAAGCATACGGAATGGATAATGAGGATCTACACACTTGGAACCCACCCTCTTCTTCTGTGCTCGTCCCAGACTCTAATAATGATAGATTAACAGTA 
GAATGTGCTAAAACGGACCCAACACATAGAATCGGCATCTACGGCTTTGGAGGGTCTGATGATAATAGACGTGCAAAAGAAGAAGGTTATGTAGAAATGT 
TGTTATGTAATTGTGACAACCACAAGGACTTGCTTAAGGCTCCTCTAATTACAGAGTATTCTACAAATCCAACTGAAATTCAAGTAGATGTTGCTGCAAA 
ACGTGTTTTATTCCCTGCCCCTGGTTCCGAGCCTGTAAAATCTTCCCAAGTGACATCTGCTGCTCATCAACTAGACGGAGCTACTGGCGA6CACGATATT 
TCCCATGAGCCCGTGAAGCTATCAGATACGGGTGACTATGCAGTTGGATCACCCATTGTATTCAAGCCAGTTTATGGTACATCTTTAGTAAATCTTCCAG 
AAACAGGATCTCCTCTGGCATTGAACTGCCCCTGCACCGACAAGGCTGATGG7VATATATCAAGTCAATCAAAAGGGAGGGATATTATATAGAGATATGGT 
GGGGTATCTTAACGCCAACCCTGTGGAAGCTGCATCACTTTCTTCCTCGGACTCTTCTTCGTGGTTGACAACTGGTAACAAAATATCTTCTGTTACATGT 
GAAGGAGAAAAAATAAAGAAAATTGTGTAATCATATGTGTAGTTTTATTACCACTAAATTTATTGGTCTTCTTGGTACTGTTTACACTAGGGAATTTTAC 
TCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGGTCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGTCTGATGTCAAGTTTGGAGGGTGGAC 
CGCTGGGTCGAGCCAATGTCAGATTGCACCAGAAATGACAGAGCTCCAGAAACGTACATTAACTTTATACTGTTTCTGGAACATGTATTTCTGGAGACTG 
ACACGTCCAGAAAGGGTAGGGCATAGTCTACTGTTTTCACTAGGGAATTTTACTCTGCCTCCACCTTAAAAACTTTTCAAAAATTTTTCTGGGTCGCTCG 
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AGTTTAGAGGGTGGACCGCTGGGTCGGTCTAATGTCAAGTTCGGAGGGTGGACCTCTGGGTCGAGCCAATGTCAGATTGCACCAGAAATGACAGAGCTCC 
AGAAACGTACATTAACTCTATACTGTTTCTGGAACATGTATTTCTGGGCACGGACGCACCCAGAAAGGTGAGGCTGTTAACACCCTTAATCCACTAGAGA 
ATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGGTCACTCGAGTTTAGAGGGTGGACCGCTGGGTCGGCCGAATGTCAAGTTCGGAG 
GGTGGACCGCTGGGTCGAGCCAATGTCAGATTACACCAGAAACACCAGAGCTCCAGAAACGTACATTAACTTTTACGTGTTTCTGGAACATGTGTTTCTG 
GAGACTGACGCACCCAGAAAGGGAAGGGTATAACCTACTATTTACACTAGGGAATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAA7UVTTTTTCTGGG 
TCGCTCGAGTTTAGAGGGTGGACCGCTGGGTCGGTCTGATGTCAAGTTCGGAGGGTGGACCTCTGGGTCGACCCGATGTCAGATTACACCAGAAATAGCA 
GAGCTACAGAAACGTACATTAACTTTTACGTGTTTCTGGAATAGACGTTTCTGGTCATGGACGCACCCAGAAAGGGAAGGGTGTAACCTACTGTTTTCGC 
TAGAGAATTTTACTCTTCCTCCATCTCAAAAACTTTTCAAAAATTTTTCTGGGTCGCTCGAGTTTAGGGGGTGGACCGCTCACTCGGTCTAATGTCAAGT 
TCGGAGGGTGGACCGCTGGGTCGAGCCAATGTCAGATTACACCAGAAACGTTTGTTGCTCCAGAAACGATAACACACATTTCTGGAGCAGTAGTTAGTTA 
TTTCTGGAGCACGTTAAAATAAGTAGGGAGAATAATACATAACATTATATTTTCAATTCCATTTTATTGGTTCTATCCATCTAAAGGTGTTAGAGGAAAT 
ACTTTCATTGTAGTATAAGAATCCTTACTGAAATTTTTCAAATCATCCAAAGAGAATACAATCTTTCCAGTATAACTTAGAGGGTTCCTTGGTGTAGCTG 
TTTTCGTGTCCCTGGCGAGACGGTACTTATTATTGCTTCCCATCTTTAAATCTTCGCTTCTACTCCTCTTCATAGACCTGAATAAGGCTATGGATGAAAT 
AGGGGTCTTTCCTACAGGTTCGACAATAAGTCTTCCTATAGATGCGTTCCGTATCAAGTCTTGCTTGGCCTTCTTCACATTCCTGTCGGATGCGGCTTTA 
CTCCCACTAGTTTTCCATATCTCTGATTCCGTTTGCATCTCTCTAAATGCTTTCCAGRTCAATTTTTCTCTGTTAACATTTTCCACTTTGACATAATTAT 
TCTTCAACATGTTATCCACTAACATCTTGGATGAAAATGCAGATAAGAGAGGTAAGGGCCTCCCTGGTACAAAACTAGTGTTTGACGCATCAATCACATT 
TCTTCGTGCTCTAACAGCATTCCTTAAAGCCCRAAAGCTTGCAAAGGATAATAATGATGATGTATTTTCATCTGTTGGAACATTTCCCCATAAGTTATGT 
GTAGGAAACTTCAAATCTCCCCATCTTGCCAATAGTAAATATACTGCTCTGTCGAAATTGTTCGTTATTGAAGGATTGGAGGTGGCTGCAGCCGCGTACA 
ACAGGTAGAAAGAAAGGGCAGCAAACCCTTTAGATACATGGACATATTTTGCAGCTTCAAACACATACCCTTTCTTATCCACTAGGTGCATGTTACGCAC 
TTCTTCGTCCTCTCTTTTTTGTTTCATTTTTCCTTCAAAACCTATATCACCACCTAACCCTCCCCCTCTTAATAGACCCGAATCTGCAGAGAGTATCTCA 
ACCATCAACTCCCCTAACCCATTGACATTTTCTGCTCCTTTATTATTACCACCACCACCACCTAAAGCATCGAGTACTGCTTTCCAGTACGTCTCTAGTT 
CCTGTTCTCCCATAGACGGTTTACTTTGTTCCTTTAAACTGATGGTAAAAGTGTCTAGAGATTTTGTTATGAAACCAGATTCACAACGCTGGCAGACGAA 
TAAAGAATGCATATCTTTCAAAAACACTTGAAGAATTAACCTGCTTTTCACTGCAGAGCTATTAGTTTTCTGTGTTAGCCCTCCAGAGCATAAAAATAAT 
GCATTCCATAGCACATACATTGAAAATCGGGCAGCTGAGTCGGGATCAAGAACGCGGGTAACAGAGTAGTTGGAGAACATAACTTGTCTATTATTGACAA 
GAAAATACTCCAAAGTTTCTGGTGAAACAATATCGTTAAACTTGTTACATCTCTTGTATTCACGTTGAAGAGCTTGAAGGGCGGTAGGTGTGCGTTTAGG 
CCCTAAAATTCCGTCTATCGCCTTTGGAAGATATGGGTGGTCCCTAAAGAGATTCTCAGCAGTCATATCCGTAGTTTTAATTCGTTCTCTTTCCCTCTCG 
ACCCATTCTTTTTCCCTTTCTGTGACACTTTCGGATGGCATCTGCAGATGGGGATTGTTCACATATTTATCCAGTTTAGTTGTCAAAGGATTTGGTTTAG 
CGTTTCTGGTCCTGGACAGCACTGCCCTGTACTCTGCGTCGATGGCTGCTGATAATAGTACCATGATTTCCCTCCCTAAAAGAGCTTCATGGAAAGAACC 
TACCTTGGGGGGGTAGAATCCTTCCTGTATTACAGAAACCATGTTGGTTAAAACAGTCGATATAGTTGGAACAACATAAGATTGTCCATCTTCTTCACTG 
ACTGTAACTCCATCACTAAAGGACCCTTCAATAAAGTCTTTTATCCTTTTTTCAATCATGTCAGCAGCGCGTCCGTACTCTAGGTGAGGAGGCATGGCAG 
CTVAAGGGATCTAAACCCCTCTCTTTCATTCTAGAGGCGATAGATTGCTTCCATCTTAGAATATCACAAAGAGTATATCCTAGAAATAGAGCAATGGCTCT 
TTTTGCTCGTCCCATTGCCAGATCGGCCACCGTTGAATTGTTTCTGTTGGCCAACATTTCTGAATCAATTACCCATGAAGGTTCATTAGTATCACCTCTA 
GTGGAATAAAGAACTAGATTAGCTGCTGATCTAAACACACCGTCCATAATATTCTCAAAGGCATTCAAAAGGTTCAATAATGGTGCTTGTTTCAAAAAAG 
TAAAGTCACATAGTTCTTCCGCTGCAGTTTCGGTCAATCTCCAAGAGGGGTCAATTATATGCTCTCTAACACCTTCTATATCTCTGTAGACACTCCTCAT 
GACGGCATCTTTAGATAGGTGTCCTTTGGCTAATCTTAGTCCCGAATTAAAAAATTCTGTCGTTTTATTGAACCACGTGGGTCCGAATACGTCAGAAGAA 
ATAACAACCTTCCTTTGTTTTTCAGGTGTATTTTGTAAATTTGAAATGGCAGCAACTAGCCCTTCCGATACATAATTTAGTCGTTTTTCATCGGTTTTAG 
TTTCTGCTACTAATGCCAGAACATAGTCCTTTGACTCTGTCATGATTTTTGTGGTGCTGGTAAAGTAGTTTTTGTACAC6CTGCAATATCTTCGACCTTA 
AATGTCTTCGGTTAAAGGGATATTTATTTGTTGGTACTGTTTTCGTCAAGGCTGAAAAATAATTTCAAATATACAATGGAGTACATGGAAGAAGGAGACA 
TCGCTGAAAGGCGCTCAGAAGGTGTCGACTATATTCTGGACGAAAACTCTGCTTGTGTAGTTAATGTGAAGAGTATCCGTAACAGGCTCGGTGCCATGGA 
CGCCGAGGAGGCACAGTACGCACAGGACATTTCCGCCCAACTTGTCACCCATATTATCCGTCTGGCCCACTGCTCCGAATCCAACAAGATTAAGGATACC 
ATTGCCAGTATTGCGGGTCTTTTCATCAACAACATCTTTGACAACAATTCAACAAAGAACAAACTTAAAACGTATAATCAATTCAAGGCAGAGTCACAAA 
ACAAGTCTAGCGTTCTCAATATCTTTGGCTCTCTAGATCCTCTGAGTATGCTTTCTAGCTTCATGGGTTCTGATCCAGCAAAGAGTGGAGGGGAAAATTT 
GGACAAATCTTTGGGTGTGCTCTTTGAGGTGCTTCAAAATTACAACCCTTGCAAGATTGACGATATTGTCCTTCTGG7U\ATGTGCCCATCCAAGTGCGCC 
GCCTGCACCGGTCTCAAGGAAGCCATCCGCCAGGAACAACCCATGGAAGCAATGTTGTTGTTTTTCAAATGTATCAACCATAATAGGTTCAATTTTGGAA 
GCGACATAAAGTCAGCATACGCCTCTGAAACATGCATGAGATACTCTCAGGACGAACGCGCAGTCGTTGTGCCTCTGAGGAGTATCCTCCTCGGCTGCCT 
CGACAGGGATGATCCAGCTCATACTCTTTCCTCTTTCGGGGATACTATCGAGTATGCTGATTCAGATAACGCTTGGGTTTCTTCCCTGTTTGCAGCCGTC 
TCTAGAATGCCTATGGTAGACAGAGCCGTTATTGCTCACTTTTACGTGTACACAATGCTCAGCCGACATAGGCGAGTATCTGGAGACAGCTTCAAGCAGT 
TTGTCTATACCGTATTTGTTCGTATGATTTACTCTGCGATTGAAATTTTGTTCTGTGATACTGAAAATTCGTCTGTAGAATGTGATGGAAAGCACTTTTT 
GAGCTATGTTAATGCCATGGTTAACGTATCCGTGCTGGGTTCTACGTTTAACGTACTAAAAGCCTACCGTTCATGGGTGGTGGATCAAGCATCCGTCGCA 
CCGGTTCTAGACATTATTTCCGGAGGATGGAAGAAGAACTACCCCTCACCTGACCACATCAAGAGGGTGGCGTACGACATCTCTCAAGTCATCAATCATC 
TTGCATCACCTTCTAGAATGGTTAAAGGTAACAACAAGGCTAGCAACGTTACATCTGGCCTGGATAGTATCAGGTCTGTTCGTCAAGCAGAAAAATATAT 
CCCGTTTGGAATACTTGAAAATAAAGCAGGGTATGGTGTAATAAACATTGCCAAGCACAACATCAGCCGTCCAGCAAGAGAACAATCCAACGGAAGGAAC 
TTTAATTGCAACGCTTTACACATTCTACCTTCAATTAAGGGCTGTGAGGCACTTGGGGCACAAAAGGGGAGCGCAGATCAAACTGTCAATGTTTTTGATA 
ATTTTGTCGCATCTCATATGGATATTGCCATGAAAAAGCAGGGGTCGGGGAAGATTCTTGGACTGCTCACTAGCATGATTGACAGGCAAGGTCTGACTAC 
TTCATTCCCTAGTAGTGAAGCGGAATACAAGAAGAGAATCCATGATTTCACAAGATACGTCATCTTCTCTTCAACACCCATCAACGACGAACTAGTCAAT 
TCTCGCTGTATTCTTCCCCATTCTAATGTTCTGAACTCCCCTATCAGCTTGAGAAATATTGACCCAGAATCAGTCCCCGACACTCGATTCCACTTTCTGC 
TCATGATGTGGCAGCGGCCAAATATCGATGAACCTAATCTTTCTGCTCTAACTACAAGTCAGCTAGAATTGTTGCTTAGCAAGAACCAAAAATGGGACAA 
ACTCACCACCAGAGCGTTCTTCAATATCGACAGGATCAATTTCCAGATGGCAGACGCTATCATTAAGAACGTTTCTGGAAGCGGCTTCCTAGATGGGAGT 
AAAACTGCCTCTTCTTCTTCCTCAGCGCCTAACTTTTTCCAAATCTTCAGTGGTGCTGAATGCACTGCAAAGCAGCTCCAAAGTATTCGCAAATTCATTG 
GAGAATCTATGCAGCATGTACAAAAGGAATGGAGTAGTGCAGTAAACAATGGGAACAGAGGAGTAGAAAATTATGACGGACTCAATGCTCAGTTCTCTGA 
AGAACTGTTCGAGCTGCTCTACAAATTGATCATCGAGGAGGATATGCGGCCATCCAGCCTGATCGCCTCATCTGAATTCTTGAGCAACTACGTCAACGCC 
ATGGATGAACTTCTTATCAGAGCTAATGCTTCTTAGATAGGGTTTTTATGAAATAAAACAAATATAAATTATATATCATGCAATTTTATTTATACCTTTT 
TTTGTATGTGATACAATATTTTTAGTGTTATGACAGTGCAACCATTATATTATTATCTGTAATAGGTCTTGGTAAATTTTTAATGAAAGGAAGATTAGGT 
GCACAATCTATTGTCATAAAAATGTCCAGGTCAATTTCTTCAACACCTTCCTCAAAGTTATAGAAGAAACTTGTTTCATCATTTTTATTAGCAATAAACA 
AGTACGTTTTTTCCTTAGCTGTGCTTGCAGAGCTTATCCAATAGTCTGATGCCTTTTTAACATTATAACAGTTGGGGAGTAAACTACAAAAACAGTCCTC 
TGTTTTGTTTGTAGCGTATACCCGTACACGGTTACCTTCTACTACAAATGAAAACTCTATCCGGGCTTTTTCAATACGTCCTCCCACAGCTGCTCTACCT 
TGTTTTTGTAGCACGGGTTCTAATCCAACTTGTAGAGAATCCAGATTCGTTGCTGCACTAATAACATTAAATGTCCTCTGTTTCATGTTGACATGTACAG 
ACATTCGTCCTTGCCAAGTGCGCATATCACGATCACTGATAAAACCTCCTCGTATATTATTTATCACCAATTCCTTTGAGrACCGGGCAAGGTAATCAGG 
TGTAGGTTGAGGAGGTTGTTGTTGTCTAGCTAGTTGTGAAGAAGGTGTTGTTGTCAACATCACCATTTCTGCGTCTGGGTCTGTTGATCTCTGTAGGCTT 
TCATCCCCATTTTCATCAGAAGGATCAACTGTTACTGTACCAAACAATAAAATATTGGTCAAGTCTATTACGAGTTGTTCGTAAGTGATATTGGTAGATT 
TTTGTGCAACAACTTCATCCATCGTTTCCTTGACTTCGTCATGGTTCGTCCCTATGAGATCGAACAAGTTTTCATCAGCAATTGCCTGTCCGACATCGGG 
GACGTTTATGTCACCTTCAGGGATTGAAACTTTTCCATCTGTTATTTTATCCTTTATTAACCATACGTTTTCTGATTTGTATGTGTCAAATTCCTGATCT 
GCTAATTTGGTGAAGAATTCATCCTCAGCTTTTTCAAGAGTTCTTGCAAAATGCTCTCTCGTTTGACGTTGAAGCGCTGTCTCATTGATAACATCAACTG 
TTCCTCCCCCTCCTCCTCCATTATTATTATTATCGTCGTCAGTTTTTTCATCATCGTCAGTGTATTTAGAGTCTAATTTATATTTCTTACTCTTTTTTGT 
TTTGGCATTTGAATGTAGAGTTCCCTTCCCTACGCCCGAGATAGCAAAGAAGAGGGCAATAGAAGCTCCTATTACCAAAACAATAACCGCAAAAACAATG 
TATATTCCTCCATTGTTTGAAAGTGCCATTTTTAAACCCTTCAATTCTTATCTAAAAAACAAACAGATTGAAATCTATGTATAAATCATCACTCGCGTTT 
TCGATTTCAGGCATGTGAATTGTAATGAGAGAGGCCGAATCCTCGGATGGGTTGATTATGCCATTTGTATCATGACTAATATATATAGGATCATTTCTCT 
TTAGATAATCACCTATGTTATCAGAAGAATCACTATTTCTTAGATTTGATAATATCACTACACTTTTTGGTCTGTTTACAATACCAGCGAACGTGGATTT 
GCGGTTAAATTCTAACGTGAGAGTTTTGGCTGAAGAATCTATTTTGAACGAT7U\ACTTAAATCAATTAAATCCCCTTGAGCAGCTCCAGGGGGAGTGATT 
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GATAACATGTGTTTATCGTTTATTGGTACACTAAATGAAGTTGAAGAGTCGGTAAAGGCAATAGTATTGTACTTGAAAGAAAATGATGTATGATAAAAAC 
CAGGTATATATCTTCTCCACCGCTCCTCTTTTAAAATTTCAGTTATGAAATCTTCCCTGGGTTCCATGATCCGTCTATTCACATCGTATACTTTTTCCAT 
AACCACTGGTATTCCTTTCTCAAAAATTGCTTCACAGATAACAGGATTGAAAGCGTCTTCGTAAAGATGTTCAAAAGTCATTACATTAGTTAAATGGCTA 
TTGATTTCTTGCCTTACTTGATCCTCTATATATTTTATAGAGGCATTTTTTGCTACCGAATAGGCAATTGCACCTAAGCGATACTTTAATTGGTCACTCT 
TTCTTGTGATCCAAGAATTGTCATGGTGAGCAGCTCCCAATTGAGCCTCCACTGAAGCGATTTCAGCTTCAGTCACAGCAGGCTCATCTCCGTTAGACGA 
CATTATTTACTCCTCCTTAAAAGCAGTGATTGCAAACAATATATCGTCCCTTGTATCGCCTCTTTTTATACGTTTAACAGCGTAT6TTGCAGATTCTCCG 
TTTAGTATATGGGTGTGTTTATATCTAGATTTACATACAATATCAATATCTTTACTCTTCCCTTCAACAATCACATTAAAGAATATGACTACGCGTTCTC 
CTGTTCGATTAAATGCTTCAAAGCTTAGCACGTCArTAACTACGTCACTACGAGACACAAGAATCACTGCATCTACATCCTCATCATTTATGGAACGAAA 
CAATTTTGCCGTTATTTTAAAATCTCCCAAGGGAATGGAGAAATCGTACCTTGTAAATTCTCCATCGTCACCACCAATTCTCAAATTGTACGTAGCAACA 
CTACACCGCCCTACTTGACCAGTTTCTTGTTGAATTTTTTCGTCTATATCTGTGCAAGATATTGGGGGCCGATACCAATGTACAAGATTTTGCGTTAGTT 
TTCGTTTAGCCAAGTAATCAGTTATTATTTGTCTAACATCTATTCCTTGTTTTTCAATTGCCTTTAAAACACAATCCTTTTTTATGTCTTCTAGGCGAGA 
TCTCACATCCAGTTCTGCATCCTTCATGTAAACTCTGTCTCCATGTCTTACAGTGTTCTTCAACAATTCGTTTATGTGTTTGGTTATGGGGTGTATCAAC 
AATTCAGATAGATTAGAAACACCACGTCTTTCTACAATTTTATCAAGGATGAGGATCAACCTCTCATCGCTTATAGTTGCGCCATTAGACATGCTGTTGC 
TACTTGCTTTTGGTATACAACATCAACAGGTGTAAAAAACATACAAAAGAAATAATAAAAATACTTTAGGGGAAGAAAATGTTTAGAAGTATTTTCAACC 
TTTTATTGTTCTGGGGCGAAGATTATATACATTCAATCTTTTAACGTAGACCTTTCTTCGTCTTCTTCTTACTCCTACTTCTTTTTCTCCTCCTTTCTGG 
TCCTGGAACTGGAACTGGGGCTCGGTAACTGGAGAAGTTGTGTTGGTGGTAGTGACAGATATTACCTCCTCTATTTTTGGTATCCTGAGTAC7VATTTTTG 
GTTTTGGTTTTAATTGTACTGTAGAGGGAGTTTTCTGTGTTACCCTTATAGCCTGTTTCAGGAACAAGGGTAGGTGTCTATACCCAAGGGTAGCCATTGC 
TCAAGTGATGCTCCACCCCGTGCTTTTCCCCCTTATATACAATTTCTCACGTGGTCAATAAAGGAAACACAAAAAATATTCATCAATTCTTTTATTTGAT 
CAAAGAACATGTTTTACACAGTTTGGGCACAGATAATCAGGGTTAAGTCTCAAACCCCATTGGTTCATTACAATGTACTTGACAATATCAGAATGGTCAG 
ATTTTTGAGAAGCATTAGGAAGAGCTAAATTGGTAGAGTGTCTAAGAGCATGCAACGGTTTATTCACAACCATTGCCAGAAACATGGGAACAGATAGTGA 
AGAAACTTGATTCTCATCCGCAATATCCTCTGTATTTGCCCTCTTGTGGGATGAAGAGGACAAAAACACAGGGTATTTCAGATTACTAACCATCAATTGG 
TACACCATACAGGCTAACATTACAGCAACATAATCAACACCAGTTTTATCACTCTTTAGTGTTTTTAATGAGGAAAGTCTCCTCTTACATTCAGTATACT 
GGAGAGCCATTTCAAACAACATGTGTAAATGGTGGCAAGAATACATCAAATCCCTAACATCCATTGAACCCCAGGCATCACACAAATGTTTTGTTAGTGA 
AGAACCAACTCCTATTAATTCCCCTTCCTTATCATATCCTACAACATCCATATCAGGATTTGGTAAACTCATACCAATACAATTGTCAAAATTGATGGCT 
GGCCTATTCTTTGTGGCAGGAGAGTGCATCTTGATGTTAAAATCTCTCCCAAAGAATCTTGTGGTCTTAACAAAGGGCATTTCTACAAATAAAACTGAAT 
TGAGCATGCTATTTAGTTGAGCCCATGCGCCTTGTCTAGAACGAGCAATATTCATAGCCATTGTATTCAAAGATTTAGGATTAATCTGGCACATTTGAAG 
AGGTGATGGTAATTTATTGCACATGGACTCGTAATTTGCTGGTTTCTTACGGCGCAATTGTTGTTCCTCAGCATCACAACTATTGTCTTCTTCTACTTCA 
GTGATAGTAGCAGAAGAAGAAGAAGATTCTCCGTAATTTGTTGCAGCCTTAGCACAAAGACTCATTGAAGTGTACAAATGGAAAAAGGTGTTAGTATTCT 
CCTCTAAACGGGAAACTGATTTTACGTCTCTCAATTCTAGGACCTTGTTTCTGATCAAGTTTTTCATTAAAACTGATGGAAGAACGGATGATACGTCACT 
AATTCTATCCTTCTTTTTGTTGTCACTGGAATGCCCACTAAAGTCATTCCATGCTTTCTTGTGATGATTAAATTCGTGTTGACGCTGTTTTATTCCATGC 
TTTCCAAATTTGRTACATCTTTCCATCATTAAAGATGAATGGATATACTTCATTCCTCTATCCAAAAGCCCGATAATTTTCTTAACATCCTCATTCTTAT 
TCTCAAAGACAAATGTTGTATCACAAGGAAAAACAATAAGTCTTTTAGACATAGCTTCGTCGACAAAGAATGAGAAGATGTAACTGTTGGACATCATAAT 
AGGAGACACAGTAATGCTGCTTTTTGCGCCCTGTTGGAAAATGCCTCTAATCATGACGCTGTTTTGGGGTGAAGTCATATTCTTTAATAAACTGGAGGGA 
AATTGGCCCGGAAGCATATGTTTGGTTATAGAACTGGACACATCATTCTTTTGGGGGATGGCCTTAGCGATTTCGGTAATAATGGAAGAAGATCCGTCAC 
TGTTATGTTTTTGGAGTGCAGACACCAATTGCCTTCTTTTAGGCGTATCGACATTGAAACAAGCCTCTTTATTACTGTCTCCTCCACCACTAGAGTTAGA 
AGCAAAGGAAAATTGTGAACTTTGTTGCCTATTTTCTCCAAAAACACTTAAAAAGTTTGCAATATCACTGCTGTTATCGTCTGTTTTAGGTAAAGAAGAC 
GTATTTTTTGATGAATTAGAGGAAGAGTCGTCCTTTTCGTCAGAAAGCAAATTTTTATCCATACACATGCGAATAAAGGATCCAGAATTCTTAGCCGACA 
TCTTATCCTTCTCAATACGCTTCTTTAAGCAAGAAAGGACATTTTCGCAGTCAGAAGTGGAAGTTGACAGAACCACTGAACCATCAGAAGTAAATCCAGG 
AAAACTATATTTTGACGGTTTCTTAACCACATCGAATAAAGATGCGTCATAACGTTCGTACATGTTCATAAAGTCCAGATCTCTATCCACTAATAGTTTT 
GCAGCTTCTGGGTCCTTTATTATGGACGATAACTTGCGAGAAATAGTCTTGATGTTACGAATACTGTCAATATTAATATTCGACAGAGTAGGGCCCAGAA 
GAGCCTTCTCGGCAGCATCATCGACAGCTACAAAGTGTTGTTTCTTGGGAGGGGGAAGTAAAACTGGGTAGTTTGTTTTTGAAAGATCATCAGAAATTTT 
TCCAGTGTAAACGCAATGGTACGCAATCAATACACCAGGGGCGGATGTATTACCATCAATAACTGACTTTTTAACGCCCCCGTTAGTCGACAGGCAATTA 
TCCCCCAGGTATTTTTTAATTTCGAGCATGTCCTGGGCAGACAAAGATGACGTGATCATCTCGGAAAACCTTTCAGTGGCAGGGTTGCAGACTAAAACAG 
GCGAACCGTCGCGATCAAATGATGTATTCCATTTCAAAAGGCCATTGTTTTTCTGTATTATGTTCATAATCGCATCAACTTTGTCTTCAATTTTCATATT 
CTGCTTATTTTTCTTGTTCTTGCTCATAAATTTCTCATTCCATTCATCTATATCTTCTTTACAATCGTTCCAGGAATGAAGATTATTAACCTGGCAGTTT 
TCATCAATAATGGATTCAAATGATTCGTTATCGCATATTTTGGATCCAAATTTACCTAATATGCACTTCAAGGCAGTTTTTACTTCGGATTTAGAGAATT 
TGCCTTCATATTCGATAGGTTTCTTACAGTGCCTGTACACAGCATCGGTTTCGTTAAGCCAAGGAGAATCTGTGCAGTCTTTAATAGAGAAAGGCATGGC 
GCCTCTATGGAGCATATTATACACAATATTTTTCTCTTTAAAGGTAGTTGAAACAACACCAGAAGAACAATTCCACCAGTGAGCTTTTCTGGCAAAGAAA 
GAAATATTGATAGGATCCTTTATGCGGACACCATTTTCGTCATTTTTACCGTCGTTTCTCTCAATTTCCTCACATTCTACCCCTTCATCTTCAATTTCGC 
CACCTCTTTTACGTTTTTTTGTATCATTACAAGCATTTTTCCTATTCTTTTCGTGTTTGAGTTCAGAAATAAAATCGTACTCTTGGTCGCATACTTTTAG 
TTTTGATTCGGTCATTAAACGGTTAATATTACTAAAAACGATCTTAGAGTTTCCTCCGTCACCGTACGCGTTTCTTGCTTCATTGATGTCAACACTATTC 
ATTTCTCCCCCACTGCCGCCAAAATCGACATTACCACTTCTGGAAGCAGACTCGTCCTTATCGTCTTCGCTCGATGACTTCTTTGAGCCAAATGAACCTT 
GGTGGCCAACTTCGTCAATAATCACGATACCCTTTTTGCCAGCAAAGGAAGCCAAATTTGCTTGAGTAGATGAAGAAGAATCCCCAGCATTATTGAGAGC 
CGTAACGTTAATATTTGTAAACCTTTCGAACCAAGTCCTCTTACACAAATCGACAAGAGTGGTCTTTAAATTGTTTCCCTTTGTTCCCACAAATACAACA 
GCAGTTTTTGTGGCAATATTCAAGGCAACGGAACAAAGAGTTTGAACCATGTGGTGGGCTTTCTCTCCATCTCCTGAAGCAATGGCTGTGCACATACCTC 
CAAAGTTCATTAGGCCACGTAAAAGATACACGACATCCATTCTGGGCATGCCAGTTTTTTGGTCAACACATGAGATGTAACATTTTCTTAGCGCATGAAG 
GGCTGTAGTTGTAGAATTAACATCATCCTTTTGCTTCCTCTTCTTATTACTACTACTACTACTACTACTATGTTTTGAAGAAGAAGAGGAGGAGGAGGAG 
GAGGAGGAGGAGGAAGAAGCCGTTTCACAATCGCTAGTGGCTTTAGCAAGTTCGTATAAAATTGATAAAAATATATAATCTTCATCAGCAATAAAAATTG 
ATGGAGGTTTTAGAGAAGAACTGAATGCTGTTTTTGATACAGGTTCGTACTTTACAACGCTAGTATTTTCCGTGTTTTCTTCGTTCTCATCCTCATCTTC 
TTCATTTTCGTCATCATCATCCTCATTTTCATCGGTTTCTTCCTGATCTTCAGCCATAGACCAAAAGTTTTTGTTCATATTACTGACAGAGTCGGTATTT 
TTCATATAAGATACGTGTTCGATTAAACAAAGTCCTCTACCCATTGTAGTGCCTTCCATGGACGAAGCTAGAGCAGCAGAAATAAATTCGGTTGAGAATA 
AAGAATTCAAAATAACGTGTTTTTCTGCCTTTAATTTACACCTTGTATACATGTCCGTGACCAAATTTATCCCTTCAAGGCGTCCTCCTAGAAGATAAGT 
ACCATCAATCAAAAAGTACCTGTTATTATCAACGCTAATACAGGCAGTGTTTTCAGCTAACCTGAAGGGAATTGCTGTACCATTACTAATAAACGTTCTA 
GCATTTCCCATTTTTTCAACTTCAGCCATAATTTTAGAAATGAGAGCAGTTCCTGCATTTGTAATACAAAGACGTATATTTTTGGCTCTTTTGAGCATAT 
TCTTGGCAGGACACACTGGATTAGCTAAAAATGAGCATTTGTATGCTTGTTCCGAAAAAATATTCTGCTTGTTGTAGTCTGACAAAAAATCGGGACAGTA 
TCTATCGTGGAACAACTCGCCTTGTTCTGACTCATTGTCAAACATTGAAGGAGTTAATCGCTCGCTGAAACATCCCCCGTTTCTAATTTGGAGAGTAGAC 
TCGATCGCCTTCTTGTTATTCTTACCCTTTGATGATGAAACAATACAGGCCCAATCAGTAAAGAATATGCGGGCAAATACTCCTCCTGTGCCTTTGTTCA 
ATAAGAAATCGCAACACTTCTTAAAGATTTTTGTTCCTCTAGGAGCGACAAATTTACCCTGCACAGGGGAGTTAAAGAAGGAATTATCCAAGTAGAAACC 
CTTATCTGTTCCTGCTAAAAAGTTAAAATTATCCTTCTTAATGGCCAGACTAGATTCGTCCAGACTTTTCCCATTAATGACAGACCGCATAGAAATTAAC 
ATCCTCTTAGCATCCATTGAATCTTCTTGCAGAAGGCAAGTTTTATCTGTACCACCTTGATCGTCATCATCATCTTCATCGTCTGCGCAGCCAGTATCAA 
TGCCAGCATATAGACCAGAATGGTCCTTATTGTAATTGATGACACGAAGCACGCGCTTATGGCCATCAGAGTTGGTACTTAAAAACACGGCAGTTTTAAG 
GATACTGACAATTTCAATTTCTGATGAAACAAAGTTGGGACATTTCTTAGTTTCTTCTTTACCAATGAAACAGTAATCATTGTCTTGTTGCCGATAACAA 
GATTTGTTGAAAAAATCTCCrCCTCGTCTTATATTTGTCCCTGGTGTAAACAACACTGAACAACTATTAGTGATCCAAGACGTACAGAAATTACCCTCGG 
GATCTAAACATGAAGACAAAACTTCACGGGCGTAAACAGCCTTAGCC/^AACTAGCTGCAGAAATTTTTTCGGGAGAAGGATCAGAGG/W^TTATTGTCTTT 
AGATTTAGCAGAGAAAGTGCGGTGCAGATTCATAAGAGGGAAAGTGTCCTTCAGTAGTTGGCCATAGTCTTCCACACTCAGGAAACCAGAATGGTATACT 
GAAAGTGGAGGGTTTGAATCGGTAACGCCACATGGAGCATTATTCTTTGTCATTTCAAAGTGGCTATTCTTTGAACTATTGGGATGAATACATCCCCAGT 
TACACAAAACGCTAATGTGACGCTCGTACAATTTAATCCACAAAATGTTTAGATTTTTAAACAGCAATTTTTTAAGCAAAACTATCTGGTTCTTAGGGGA 
CATACCATTGTTACAGTTCCACATCTTACCCAAACTTTTATAGGTGTTGATATTGGATATTTTTTGGTGGGTAAAGTATTGCTGGGAGAAAGTCTGGTCA 
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AATGAAGATTGGTTAGAAGAGCCCAAGATAACGTACGCGTCTTTAGAATCGTTAACAATCTTAATGTTTTCTAATTTTTCAGTGGAAGTAGTTAGAGAAT 
CATCATAAAGAGACAATAAATCTTCATCATAACCCATAGGAACATTCTTTAAAGAGTCTTTTTTCTCCAAATCTTGAATTGAAACATATTTCACCGATGA 
ATTACTACTGCTATTAGAGTTATTGTTAAAGTTCTTCCAGGGTCTCTTTCCAAACTTCTTTTTTGACTTGTCCTCATCTCCCAGGGCCATATTCTTAATG 
TAACAAATTGGCATAACTTGAGACCAGTACGAGTCAGAAATATGATTGGAGAAATAGCGTGTCAGGACCTTTGTTTCAGCAAAACCTTTCCCCTTGTACT 
GGATAAAAGTTCTCAAGGGACGGCCAACTATTCTACCCTCATGGCCTGTATTTGCCAAAACACTGCCCTTTCCTTGCTTCTTTTCCGAAGAATTGACAAC 
TTTCTTGTAGAAGTTAAACCAAAACGAGCAATCACTTCCTTCATGTTTCAATTGGTCTTCAGAAATTTCATTGGTCTGGTTAGCTTCATTATTATTATTA 
GAGAGAAGGTGACCTGATGAGGATGATCCACCCCCACACCCACCATTTCCTCCCTCGCCAACAGAAGACATGTAATCGCCAGGACATATGATGTTCATAT 
CATAAACGCTTCCAGTATCAGTCAAATCGCGGCACTCTTTTCTTATCTTGTTAGTTTCTTCCTC7LACTGCCTTATTCCACCTTTCAATGGCATCCATTCC 
AGCACTTAACGCAGTTTCAGTGACACCGTACAAAGCATACTCTTTTTTCGAGAAAGGAACACAAATCTTAAAAAAGTTCTCAATGTCAGAAATTACTTTC 
ACAATGTCATTATTTATCTGATTTTGTGTGATCATGTTCTTAATTCGGTTAAGTACATAGCCATCGTACGAAAACAAGCCACTTTCTCTGTCTTCTTCGT 
TTCTGGCGTCCATGTGTATCTGGCTGAAACCCGTGCGGGATCTCGAGTCCTCATCAACTACGGATGATGATCCCCGACTAGAATCATCATTGTAAAAACC 
ACCCGCGTTATTAAAACCTCCTCCGCCAGATTTCTTTGATCGTCCCCTATTACTTCCACCCTTTCCTGTCATTTGTTTTGCACCACTATTATCCTCCTTG 
TTAATCTCTTCTAGAGAAGGATCGATAAATGCTTCTTCGTTAGCGTCCATTACTTTAGCTTGACCGGCAGGAATGTTGTGGAAAGAAATGTTAGTGTTGA 
TGATTCTAGAGGACTTTTCTACAGCAGCGAGTTGGGCAGTTGTACAGGAATTACTAACCAGATTGTAGTTAGTATCGATTCTAGTCTCTTGCAGTTCTTG 
GCGAGTCTTGGCGCGGAACTCGTCCAAAGATTTAACACTGACATCATTAACGTCCAAAATACAACAATCCAACAAATCATTTGAAGAAGCACGCACAGAG 
TCTGCCGCTTTTACTAAAACAAGCCTTGCAATACGTTTCATTACCTTCCCAACACGTTTTTCATACATATCATTGCTTTTTTCGCTATTTTTGTCGGCCA 
AAGAGGAAACACTAACCAAATTCTTAACTAGTGAAAGGGGCCTAGAAAGAAGCCTGCTTCGCCGTCCCATTAGGACGTGAGAAAATCTATAGGTGATAAA 
TTGCAACCAATTACTAGTATCAAATCCATTAAAATCAAACGACATGAGAATCTTTCGGGTCATTTCTTCAATGACAATATCATTACTTGCAACTTGTCCA 
ATAACATATGCGGCAAAACGGTAAAAGTGGAGAGGATCTTCTTCAATATCGTAAAATCTACCTTCTTTTATGCTAGCGTCGGCCTTTTTGGGAGAATCTC 
CCTTGCCTGTACAGCTGCACCACTTGCTAGTGATTTTAAAGCATCGAATGAGTGGTACCATTTTATCCACAGATAGACGCAAAGGAGCAGTTGCAATTTC 
TCGCACACTTTCTCCCTGCATCATGATACGATACATAGCGACCATTGCTAGCACACAAGAGAGGGACATTTCTGTCTTGAGAACCTCTTCTGTTGCAGCC 
ATCCACTCTGCCCGACTGCGCAAAGTAGACTTGTTTCGTGGATAATATTCGTCTTCAACCCAAATTTGATGCCTAACATCATTGCTAGGGGTGGGTTTTA 
TCTTCAATTTCTTAATTACGTTGTTATTTGTTAGCAAGTTGCTTGAATTTATGAAATTTTCAACAGATCTAATGTCGTTGTCTTTGAGAGCAAACTCATA 
GGCCTTGGCGACCTGCGCTTTCCAGTCCTTATTGTCGTCATTCAAAAGACATTCCCACGTATCAGCATCCAAATTTGTGTCCAGATCTTTACCAAAAAAT 
GCCCTCGCGCTAACGCCCAGAAATTCTGATAGCTCAAATACGGCGCCAGATTCGTCCACATTTGACACACCTGCCTTTAAATTTAGCATGTCACTACCGT 
CCCAATCCTTAACTAAACTTCCCCAATCAATAGGAAGGCTGTTATTAAAGCACTCCCTAACAGTCTCTAGAATTGTCCACAGTCTAGAAGCTGGACTATT 
GGTTCCAGAACCGTATCTTGTTATCAAATTTGAAAAGAACTCAGAATCATTACTTGTTTTTCCATTCAAAGCTGTACGCAT6ATACATTCTTCCATCATT 
TTAAAGCACTCGTATATGTTAAGTTTTGAAACGTTATGGTCCTTGAAATAATTTGCTGCATCTAAACACATCTGTTGACGTTTTTCTTTAAGTTCCTTCT 
CAGAAAGCGATTGATTAAACACTTTAGTTGGTTTTAGAGGATCAACAACTGTAGAACCCAGCACAACAGACTCTAAATCTTGATTCGCACACAATGCCCA 
AATACCAGAAAAGAAACCATTATTAACACTCTTATCTGATACACCATCACCTCCCGATGAACATGCCAATCTGGCTAAACTTCTAGGTCTAAACGTGTTA 
ATGAGTTTGTTATCTTCACTGATCACATTTTCATAGATACCCAAAGGTAATGCTGAAGATAGACATTTTCCTGCCAATTTACACTGTTTTATGGCATCCC 
ACCTCTTATTATCTAAAATAGACCTAATATTACCATTAACTTTGTCCTTTTTCACAAATCCTAAAGCTGCAGTAACATCACTAAAGTATCTCCTTTCACA 
TGCCATTGTGTTGAAACGGAGAGCTACAGTCAGATCTTTCAGATGTTTAGAACTAGGAATTGGGAGTAGTTTTATGCACTTTTCGTTAGACATGGTATTG 
TTACTACTAAGGGGAGAATCGCTCTTGAAGTTATTAAGAGAAGCACAAGAACAAAAGGGAGGGATATTGTTGTTGCTGTTAGCGGTTCCGTCTTCACTAT 
TGACATTAGGCTTGACAATTTCTCGTACAGGCATAGTAAAAGGCACTACCCAATTCCTCATTCTCAAAGATCTTCTATTAATTTCTTTCAAAATATCACC 
AGATTCAGCTGCAGAATTGTCCAATGTTAGTACATCACACAAATTTTTCTCCATTTCCTCCACAATGACAGTTGAAACATACTTTTGGCGATGACATAAA 
CTGGATCTAAAGTTGAATGTTTTGATGAAATTAACATCCCTAAACACATCATCTTGTACGTTATTAATACTACTAGCATTGTCAGATGGTACATTTTCAT 
CATCACTAGAACCAAATTCGCCTAGAAAACCATAACAATCATCTTCATCATCACCACCATAATAATCACCTGTTGCACCTCCCTTGGCAGCTCCATAATT 
CTCCTCATCGTCACTCTCTTCATCACTTTCTTCATCGCTTTCGTTACTGGCCACATTGTTCAGGATCTGGCGAGCAGAAACGTCGTCGTTGTCACATTTT 
CTTAGATATTCTTGTTTTTGTTGATAGAGTTTCTCCCTCTTCTCCGACTTCTTGTCTGCCAAAACAGCTTCCTTATCAGACTTGACGCTGGCAAAAAGTC 
GTTGGAAAGTCTCCATGGGGGCGCATTCATACATTGGTCCAAAAGACTTGTCAGATTCATCATT^TCAAACGTTTTAAATGTATCGTTGTTCAATTCGTA 
GACCATTGTGCTTTCATCATACTTAATTTTGCACAATTTGCTGTTTGTCTTGTTCTTGATCCTCTCCATGGAATTTTCGTATGCTGCAACTCCCATTCTG 
CAGACAGATTGCTTTTCAAGCACACTTTTGGGTACATATATGCTAAGTTCTCGGTTAATTGCGTCGCACATGTGTCTAAACTTCTCAAAACCTCCTTCAA 
ATTCAAAATCTTCACGAATTATTTGCTGATAGTACTCTTCTGGATGAGAAGTGATGTTGTACAGCGCACAAAACCGCTCAGCAAGAGTAACGACTGTCCA 
CGCGAATATGGTGTCGTTCTTGTCAGCACCCGTGGCGAATGTTGACGTGATGATGGACAAGAGCTTCTTGATATTTTCCAAAGAAGCTATCTGGTTTTCA 
TC AG AT GAG G AAGTGGATGTTGCACCTTCTTCT AG AC T AG AAACC AT AT CTGGTGTTGTTG ATGATGTTG ATGC AGC AGC AGC AGC AGC AGC T TC T TCTT 
CACTTTGAGTAGGTGGAGGAGGAATACTCTCCAATTCTTCAATTGTAGGAGGTTTTAGTGATTCTTTGTGGGTAAATTGCCACATTTCTGCAAATTTGCC 
CTTTGATAGTGCATCCCATGTGTTATTCTTGAGTCTTTGTTGTAGAATGTCCGTATCTCTGGGCAATTTCAGGTTTCTGTTCTCATATCTCACAACAATT 
TGAAGGATGGCGAAAGGCATGATTGTGTCAAACTTGATGGTGTTAACGCCAGTCAAATTCCTGATAATGTTCTTTGTTTCTGGTTTCACGCGTTCAACAC 
AATTCTTCTTGCTCATGGCTTTCTGGGGAAGAAGAAACTCCATTGCCAGAAGAGGAGAGTTGTCAATGTCACTCAGAAAGAAACCGTGTTTTCTCTTGAA 
GAAACACACAATCTGGTCAAGCTTATCATCGTCATTGATGATGCTATTATAGACATTGGCAGCAAGGGGAGGGATTTGTTTAGGGGCTGGAGGAGGCTTG 
TTGATGTTGTTCTCGTCTCCAGAACCGCCGCCAGATTTCCCTCCAGATCCTCCTTCCTCATTATCTTCTTCCTCGTCGTCTCCATCTCCAGAATCGTCCA 
ATCCTTTACACGCCATCTTCTCCTTTAATAGGGTAGCATTTGTCTTGGTCAATAACAAGCCACCTCCTTCTGCCAGAAACTTGTAACACATGCCAATTGT 
ACAGAAATCTTGATCCATAATCAAACTTTTGTTTCTGTAAGGCATTTCTGGCCCAAGTTCAGAAAAAAGGATCTTTCTTCTCCAATCGTCGGTATTGGCC 
ATAGAACCCATGAGGTTTTCGAGATATAGACACTTTTTCTCGAAACTTTCCAGTGTTTTGCACATCCACTCTTCGGCTCTATCTGCCTTCACGACACCCA 
GAGAAGTCACCATGTTTTCGTTGCTGATGGAGAGAATGTTCTTCTTTTCTCCCACCAACTGCGCATGGTGATAGGGGAACGCAGTGTCGCTCACTTTACA 
ATGTCCGGGAAGATTCCAGTTATCACTCCCGCCAATATCAATCTTGAAACAGGCAACAGTTTTCTTGATCATACTTTCAACACGTTCGACGATTCTGGGC 
ATACATTCACTCAGTAATTCAAGTTGTACCTTTAAGATTGACTTTAACAACACCACATTTTCTGTGTCTGAAACAGCTGCAAGACGTTCCTCTTCAAGGA 
TGGTTGGGTAGAATATTACCTTGAGCCCCATCAAAGAGGCATTGGTGATGAAGGCAACCTTAAACACTGGCACGTCTGTTGCTGTTCTTCCACAACCCAG 
AAACTCTCTGAAGGCAGGAGTCACACATAGTTTCTCTGCACTAGAAGTTTCTTCCTCATAAAGCCTCATCATAATAATAAGTTCCCTGAGAAGCTTTCCA 
GTTCCAACGTGGGGTCCTTCTTCATCCATGCGCCTATTTTCATTCTGGCAAGAACGCTCATTCAGCTTGTCCAGAAATTGATCACCAGTAATTTCTGGAA 
CCTTTTTTACTTTGTACATGCCTTTCTTCTTGGATTTTGCAACATCCTTTAAAGTTGCAGGTTGACTGTACATTATCAATTTTCCATCGTTTCCTCTGTT 
CACTGTGTCATATTCAACAACATTACATCCAGCATACAGAAGATTGTTGGGGTTGACACTGCCGTCTCCGCCGACTCTATCTTGTACAGCTAGATGCCCC 
ATTTGTTTGACCAGAGGGTTGCCATTCTTGGTCAAGTTATCAGTGACTAGAATTTGTTCCCCTrCGTTGAGAAGGAGAAGAGGTAGACGTCTGTTGTCGT 
ACTCGGGCAGTTTGTTGGAGAGACAGATGCGTAGAGTCGTTGGTTCCTCTTTCATCTGGTTGGCACATTTTATTAAGTGATCTCGCACATCACTTTCAAG 
AAACTTGCCTCCTCCTAGGTTGAGCACATTTGCCTTAAACATATTGACGAGTGCAGTGTGCTGGTTATAAGTTTTGTTTTTTCTTGGTTAGTTATTTTTC 
TGAATGTAGGATAAAATAACGTAACTTTTCTTGTGTCGTTGTTCACCAAGGAATCTTTTGATATCAATTACAGCTACAACTGGTAAGGGGGTTGTCCGAC 
GACCGTCTGTATCCTACGGTATCCAGTGTCGGACTGTGAGGCAGAGAGCTTGTGAGGGCGTATTTATACAAGAATAGCAGCTGGTGCCAACTAGTGATTT 
TATAGAACAGAAATCATGGACCAGTACCCAGAAGTGAGGGATACTCCTCAGACAGAACAGGAACAGGCGGCGGCGCAACAACAACAAGCAGCAACAACAA 
CAGCTGCCGCTGCTGCTGCCGCTGCTCCTACGCAGTACAGTAACACTGTTTCTGCAGAAACTTTATCCGCCATTTCTGAAGATGGAAAATTGGAGAGGTC 
AATCGCAGCTTCGTGCTGGATCAATAACCTTAACCCTGATGAAAAAATGGCTCAACGTGTCCAATTCCATCCACTTAGTTCCACGACCACATACGATTCA 
GAAAATGTGAACCCTGGTAGTTCTGTTGTGTTTTTGAAGCCTAGAGCCCTCCCCACCGGGGGCACGTGTTTAGCCCCCAACTACATTGCAGTGCCTACTC 
TTCGTGCTGCGTCAGAAATTATCGATTCTATTGCATCAACrAGTCTATACCAATGTTCAATGTTCAATTCGTGGAATCTTATTCCCATTTTTATGAGTAA 
TAGCAAACATAGTCAATTCGGTGATCGTGTCATCAAGAGATCAATGATCCGTAATTGTTTTTCCAAACAAAAAAATGTAGAAAATCTTTTAAAGGAATTG 
CGCAGGAGAAAGGTTAACGCCGCCAAGGCATTTTCTCATGCAGTTCAACAGAAATCAGCAGTAAATACTGCCCTTGCAGCATGGAACGCGGGAAGTGCTG 
CAAACCTCGAAAAATTAGTAGATTTTTGCAAATTGAAGTACAGTCCCGATAGAAAATACAAAGCAGGAGGTCTTTTTAGTGCGTCAGCAACAGCTCAGTC 
ACAATCAGGTACTTCTTCTTCCTCAGTTGAACATACTTCTAATGATTTCTTACTCGATATCCTAAAACGTCACAAGGGAACGTCCCTTGATTTAGATTCA 
GCAACAAACACATTTGATACTGCTCTTTCTAGGGTTTTTACCGAGTTTAAGGAACAGGCTAGAGCAGCCGTAGATGCAGCCGCAGATTCAGATCACCTAT 
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CCGCGTCGGACCCTATTTTCTCTATTGTTCGACACAATAGCAGGCGTGAGGGTATTCTAGATTCAGTGCCAAATATCGGTATGTTGGCTCCACGATCAAA 
ATATTCTGTCGCAGAGTACTTGATGGCAGACAGAGACGAGTCTGCAGATATCGCTGCTAAGATAGGGACAAAGATTGCAACAGATTTTGAAGCATTGAGA 
GGAGATAATAATAAGAGAAGAGCAGATACTTCAGTCGATGATCTCAAAGAATCACTAGCGGATTCTATTGAAAAAACTTCCATTAAAAATACCGGCGATA 
TTAATTCGGTCACAAATATTCCCACAGATACAGAGGAGTATGAATTTTCTCTCCACATCACACAGCTTTTCGCGCAGGCATTTTTGGAAACTATGGGAAG 
TTTATTGAGCTGTGCTTTCGGTGTACAGTTCCCCTTCTCAGATGAAGGTTTTGCGGCTATCGAAAGGATTATACGTAAAACAGATCCTGATACTGGAAAG 
GTTTCAGAAATGGACCCTTCCTCTCTTTCAGATCAATATCTCTTGTTGGTAGGAAATTTCCAGGTTTCTCCCTTCCACGTTTCAGATCCTAAGGATATCG 
TTTTCGGTAGACAAGTTACGCCTAATACGCCTATTTTGAGTATCATTACTAGATCTAAAAATGATAAGAATGAAACGTCTACTATTATTAATTTCAGGGA 
TAGGCTTCTTGTTAACGATACCGTGTTGAGAGACGCCACGCAAAACGTCTCAACTTCCACCCCTTCTCAAAGAAGAGTCCCCACCGCCGCAGGTGAACCA 
AAAAAGCCAATGCTCTCTGGTTGCCTACCTATCATCAGGGGACCCCAGGTAGTGACCAGGGAAAGTGATGATATGATTAGTGGTCTTGTGGGTGATTGGT 
ATATCTCTTTGGGGGTCTATTATGCCATGGGTTCATCGGCTGCAGCTATTGCCGCTGGCCACCAGAGAGCTCTCGCTTCAGCCGAGTCAATTT^TTCACC 
CATGATGAAAAAATTCTCAAAGAAGGGAGGAAAGTATACAGAGGAGGAAAAGCGAATCAAGAAGGCTATGCGTAGGAATGCAGATCGTTCTGCTAGGATT 
TTGGCTTTGTTGGGGCAAACTGATGCCCAGTATGGGTATGTCGAACATAATTCTACCCTAGATTCGTTCTGGTCTTCGAATGCCGCTATTCGTGCAAAGG 
CAAAGGAGGATGCTCTTAGCCGTGCAGAAATCTTGGCAGTTAGGAAACAATTGGACGGAAAATGTTCCTCTTCAAGGGATGAATATTCCATGGTAGAGAG 
ATATCTTAGAGACTCATTTTTCAGGTCAGTTAATAGGTCAGGAGGAGGATATGAAATGTTTGATCAAGGTTTTGATATGGGAAGATTTGCCGACTTTTTG 
AGTGACAACTCTGCTGCCAGGAACGCTTGGCAACAGTACGCAGAAGTAATGAGAGGACTTTCTAAGCATGAGAAACGCGTATTCAACATTGAAGGTCTTT 
TCAGTGCTCTAAATTCTTTCAAGTTCCCCCTTGTTCCAGAACAGGGGCGTAAAAAGACTGTAGGTGGAAGGCATAGGCTTAACAATTTGAAGGCGGCCAA 
TAAGATCATTAACGGCATCACAGAGATGACTCTCCAGTCAGCCATCGATGGTACTGGAATCTCTGATATTATTGGTTCAGTTAGCGATGGGTGGGGAAAT 
ACTACAGCTCAGCCGTCTCGCGTCAAAGCTCTTAAAACATTGTCCAATTTCAGCGGAAACGGAAATGTTGTGTCAATTCCAGTTAGTCGAGCAGTAAAGT 
GTGCCGCGGGAAGTCGAGGTGGGGAAACTCTCAAGTGTGTGGACATTCCTTCAGTCATTATTGCAAACCTAATCTCTGATAAGAGAATTCTAGATCAACT 
TTGCGGAGGAGGGATGAACCTCGCTCACGAAATCACAAACTTTATCGAGACGATTGCAGGTAAAGAACATACAGGAAAGGAATCCGTTTTCTTGTCTCCT 
AGATTGTCTGTCATTCTTTTGCGTTATATTTGGTTCAACGCAGCGGTTGTTTCCCTTACGGATAGCAACATTAAAATGCCCCTCAACACAATGTCCGAGG 
GCACTGGCGATGACATTTATAGGGACTATTTGGCCATCAGAGGCATGGTTAATAATTACAACAGTAGTCTCTCCTCTATTTCAGTCAAGGCTATTTCCGA 
TAGGTACAATTGCGGTAGTGGAAATACTTCAACCAGrAATAAGAATGTGACCATTAAGACTCAAGGTGAATTGTTT^CTGTCCTTCTUVCAGACTGCTAAT 
GCCTTGTCCGCCTTTACCAACAAGGGAGGCGTGGGTGCAACCCCCGATGCTGCCAATATGGCCAACGTTATTTCCCCAATTGCTAATGCGGATGTAGTTA 
AGAACACCAACGTGGTTGTTTCAGGGCTAGATAGGATCACTGAGACCATCAACTTCTTTTCATTTTTGTCTCAGATCAAAACAATGAACGAGAACATTGA 
AGAGTATCTTAGGAGATATAGGCTAGGAGAAGGACTAGATAAGAAAGAATTGGATAATTTTGTGTATCCAAATATTGCAGCTATTGTTAAGCGAGAATTG 
GGGGTAAGTGGTTCCGCATTGTCCAGTAATCTCGATACTGATCGTCCAATTACTATCGACCTGAACACTGAACAGCCTTTGATCGTAAAGGCTAGCAAGG 
GTTATGCCTCTAACCGCTACGCTAAATTATTCAACAT^AACAACAAGAACAGCAGCAGAACAAGCTCAGATGGAGCAGTATAATGCACAAATGGCTGCCAA 
TACTATTCCTCAATTAGTAAACAGGTTGACCATCCCTGGATCCATCACGGCAGACRCTGCCATCAATGTCGTTAAAGCTTTCACAGAAAATGGAGAATTT 
AGTAACGCAGAAACACACTTGGGGGTTATGGGTAACGCGATTAATGAAATGCAACCTCTTTTCACGGACGGATTCAACGTTGCGAACAAGCGTTTAACAG 
TTAACGTGGGTTCAGTTAGTAAGCTGATTCAGAATGGGTTAACCGTATCTCTCATTCTTGCTCACTCAAAGGCTAGCCCCTATGTCTTTAAGCCTCTCGT 
GCAAGATTTCGCTAAGCTTTTACTGGCAGTCACTGCAGAGACTTCTCTGGTTGTCTCTAGGTCCCAGAAGAGTTTCTTCCCCATTCCTCCTTCAGTATTT 
TCTTCAGGTGGTCTTTTCAAAATTGATAGGGAAATGTTCGATAATATGAAGACAGATTATGTAGTGGAAGTAATTAGACAGCTATCTAAGAATGCTACCG 
CCGCCATCGAAAGGTGCAATGATTCCGATTCAGCTGCTAGGATTGCCAAGTCAGGTGAAATTTATAACAAGGATGTTGCATCAACCACTGCAGCTCCCGG 
AACTTCTTCCTCCGCTTTAACCTTGTTCGCCAATAATCTCCAGAACCCTGCAAAGGTATGGTCTATGGGAGCTCTCCCCCATTTCGATATGGCCGTGGTA 
CCAAAACTTCATGGCATTTCTCACGATCAAATGTTCCGCCTATCTACATATTATCAGGGTATTCATAAGATGGAACTTAACAGCGATTGCAAACCAGAAG 
AATGGGATAATTCTCTrCCTGGAAATAGGGCTAGCAAATTCTTTGGCCTTTCTTCGGTGAGCGATAACAACCGTTCATTCAATTTGGCATTGGATACTCT 
TTTGGCTTCACCTGCAGAGATTTGCGATCTGGTGACGAGGGAAATGGTAAAGACCAGTAACGATATTGTGCATAATATTGGATCGAATTCCAACACGGAC 
GC GCTTCA7LAAGAGCCTTCAAGTTGGTGCCTCAGCAGTAGAAAAATACGACGAGTCTACTCTTTCTACTAAAGAAACTGACGTATATTCCCTTGTTTCTG 
CTCTGGCTAAGAGCAAATCTCCTCTATCTTCTTCCTCATCTTTGTCGTCTGAGGGACATCTCACCTCTAAGGAGATTGATAGGACATGGAACACCCCCGC 
TCTTCTCGGTACCGCTAAAACTACATCCTATTCTGTTTCTGAAGACGCTCTCAATGCTCCTCTTTCAGCCGTGTTGGACTTTAGAAGGAATGTTGTGGAT 
GCTACTAAATCTCTGTACGAAGTTGCAGCTGTTTGTAGTGTCATGAGTAAAGAGGAGGATGTGCGTTCTTCGAGTAGAAAGATTATGGGAATGGTGGAAC 
AAGAATCGCCCGTTATGCAAGACATTGGCATTGACCGCATTGCTAGTCTTGTTAGTACAGTTGCTACCCCCAAACAGCATCGCAGATTCTTACAGACAGT 
AAACGATTACAAAAATrATCTCATTAGAAAAGTTGCATCGAATCCCCTTCTCTCTTCAAGATTGGGAGGAATATCCCCTACTAGTGGTAACACCGATTAC 
AACCTTAAAGCTGTATATGATGGTGTTGTTTCTTCCTCATCATCAATGACCCCCTCGTCCATGTCTGTCTCTGACAGATTCTGGTCGGGAGTATTTTCTC 
AGTGCCTAGAGACTGGCCCTTCAATGTTTGCCGATGCTGGTCATGGAGGTAGTAACATGTTCCAAATCACTGCACCTAAACTTTACGGTTCTAGAGTCAA 
CACCTACGCAGCTCTGAGCTCTGGCGTTGAGCGGCTGAGAGACTCTATTTCTTCTGCGACTCAGGAAAGAAAGAATAGGATTGCAAAGAGCATCGAAGCT 
CTGGAAACGTTCGTAACCGATGTGGTGGGGGGAGATACTTTGGATCAATTGCGTAAGGCCCAGAACATGTACAACAAACTGTCAGATATTACTTCCAACT 
CTATCTATAGTGATTTCGGAAACATTGACTGCGCTAAAATCATGAAGAATGTGACGAGCAAGAAAATGACCGCTAGACAACAATCAGATACTATTCTTAG 
CTCTCTCTTGCACGAACTCGCTGGCCTGGTACACAAACAACAACCTCAATTGGCTACTCAATTTGCTCTGGCGAGCCATGTTATCAAGGCAAAGTATGTC 
ACTAATGACCTCAATAATATCCACGAGAAGGAAACATTCAGTCAATTGATGGCCGTGGCCGGTGTTGCCGATTACTATAATGTGTCGGCAGCTGCCATGT 
GTCAGCGTCTAGTTGCTTCCGACGTAACAATGTTCCTCGGCGGAACCATGCTCCAACAAGGCCTGTTCGTTTCATTCCTTCTTAACAACGTACTTTTCTC 
CCAGGTTTCTGATAATATTAAAATGAACGAATTGAACGATGAAACAAAGTCTCTTTTGGTTAAACTGGTAGGATTTTGCGGTACAGTTTCAGATGCGCTA 
GGATCTAGGCACGTGTCTTCAATTAGACGTGTACAGAACGAAGAGGATAAGAAATTAGACAGGAGTTTTGTTACATCACTTTATTCAGCATACAGAGATT 
TGAGGAAGAAGACTGAACTATACAGGGAAACTGATACTATTAACAAACTTTTCGGACATCAAAACTTTATGTCTTACGAATCTTCCATGCTCAAGAGGAC 
TTCCTTGGTACATGACGCTGTTTCCGGCCCTAGGCCAAGAAGGTACAGCACCCTTGAGGATGTACTTGAGGCTCCTTCCACGGTTCACAAATCGTTCATG 
GTTTCTTACCCAGAGAGGGCAGCTGCTTCTAGGCGAGTGAAGAGGGCTGGACTCAGGGCTCTGGCTGATAACAGGATGGAATCTCTTTACGGGGAAGAAG 
TCTTGAACGATATGAGGTCTTCGGCGGTCTCTTCCGAAATGATGGATATAGAGTATGGTGAGGGAGGATTCATGATGATGATTAGTGATGATGAGGATGA 
TATTGCCTTTATTGATTCCGAAGAAGAGTCTGAATCATCTACTGATTTCTCCTCATCAGATGAATATTCCGATTCATCCGATGAGTATGATTTTGATGAT 
GATAATAATGGCCAGTCTCCTTATTCAACTACATCTTATTCGTATGATGCTCTAGACCGTCTGAATTCTGCCGCTAAGCCTCTTACTGCCATCTACGGGT 
GCAGGGGAGAAGGTGAAGACGATGAGGAAAATGACCTCTATGAAGAAGAACAAGAAAGGAGGAGACGTCGCTCATCAAAGATGGGGAAGATCCTTAGAGA 
TCTTCATGAGAGTGATGATGACGACGATGACTACTTTGATGACGAATTTGATGGCGAACGTTCAATGTCAGAAACTATTGCAACCAGAAGAGCTGGCCGT 
ATTCAATATGGTCCAGGTrTCCTATCTCATTCTAATATTCTTAACCGTCCGGCTAAAGCACGCGCTTTCTTGACACGAGGCAAGAAATTCAGGCCTTCTG 
CGTACGATAGATTCTTTATGGAGGATGACGATTCCCTCCTCTTCTCTGACGAATCTACCACTTCTTCTTCCTCTTCCGATTCTCCATTCTCCTCCTTCAG 
CAAGGGGAGAAAATGCAAGCGCCGAACAAGCGAGGACCAATGTGCCTTTGTTAAGAGAGTTGTACGTGCTTTTGTGCCCACCAGAGTAACAATGATCAAT 
GGTCGAGTTAGCATGATCACCCCAGTGACTAGTGAAAATACAGTAGGATTCTATGAAAATTACCAGAAGGCCAACAAGAGGGAAAGGGCGCGTCTGATCG 
AAGAATACAAAATTGTTAAGGGTGCTTCAGCTACCTTGCCCGACGAATACGTAGAGGGTAGAGCATCTAAACAAGTGTCTCCCAGGGAACTGAGGAGGTC 
TCTTATCAAGGCAGCTGCTTATGTTGCCCGCACCCAAGAAAGTAACTTGAATATTATCTTTGACGCTCTCACCACAACATCAAACGCCACTCTAGTTAAT 
G ACC C AT CT ACTCTTTTGGGT G AT AC ACTT TT GTTCGCC AAAC AAC T AG AGG C AATTAC C G AG AG G AGG AAT AG GC T AATGAAAG ACC T AACTG AAATCT 
CTCCTTCACTTTTCACATCATTCGGTGATGCAAGTAAAGACACCCAAATGATGGCCGATGCCAAACAGATCGTTTCAGGAGGAAATTTCAAGTCTGCCGG 
CTATCTAGGTGTCCCTCTCAGAACTCTTGCTTCATGTATTAAGGGCACTAATACAGTTGATCGTCTTTTGGCTACCAAAAATAAGAACCATCTCGAATGG 
ATGACCACAGCCGCTATTGTTTTTGCACGTTCATTCAACGATACTACTTTCCATGCACTCGAAGATACACTAAAAATGACCTCCGCTTTGACAGACATGT 
ACAGCGCTTTCACCAACCTCGTCGGATCGGAACATTCTCAGCGTCTAAAAGTAAAGAGTACTCTTTTAGATTCTATTTTCAACACTAGGATGGCTCACAC 
TGAAGCAGTCATGGGTCTCGTATACCCTACAGCGTTCATCAACCATGAAATGCCCTCTGATTACACACAGCGCAGAGAGATGCAATCACTCGCTCTTAAC 
ATTCTTAGGGGAGTTAATTGTAGCCAATTGCCACGAAAGGATATTGGAGACACTGCTGGCCTGTTGACCTTTATTACATCACGTAAATTTGCAGGTTATG 
GAGGAGAAAGGGGAGGTTTGTCTCTATACAGAATGTCCATTGTTGATGCTCTTTCTTGCCCCTCTGACAATCGGCTCAAGGGAGCAGTCTCTCTAGAGGT 
AGGAAAGTGGCAGGATATGGGAGAGGAAATCTTCTACAAGAGGAGCAACGATCTGGTCGATTTTTGTTCAAAGAACAATATTTCTCTGGAAAATGCCGTA 
GGTCCTATTGCTAGGTTTGXTCCCAATGGAACTAACATGGCTGATATTGGCATGACCGATATCATTTCTAGAACAGTCAAGGATGACGCTTCAATGATCA 
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GGCTTAGGCGCGCAGAAGAGGGCGCTGGCGCAGCAGGAAAATTCATTACAGCCTCAGCCATGGGTAATTTGTACGGAGGTATTGATACCGTTGTGAACCT 
AACTGAAAAACTATACGACTCGTTCGTTCTGCTCCAAGATTCAGACTCGTTCAATACACCAACAGAAATGGCCACTGCTATTATCTUVCCGTATGAAGTCG 
AGGAAACATAAGGCrCTCAAAACACCATTCGGGGGAGATATTGCCACCTATAAGAACTTCCCATCCTCCTCTGAAGCAATTGTAGTTAGAGCCAAGGAAA 
TGCGTAACTCTATTAGCACTATCGTGATGGACATTTCCAAGTCAAGGGGAATCAACTCGTTCTCATCCCGCAGTGGTTCTACTTTGGCCAAGATTTCCAC 
ATCTGAATTTGAAAGGATACTAGAAACATCTGCTGTTCTTTCAAATACAAAGGCCAATCTGAGAACCATTGAGAATAGACTCGCCGAACACTACAACAAA 
CTAAAACAATTCAGCCATATTAGTAATGATGGACTTTCCGAGACACGCGCAGTCGTTGCCGTAATTGCTGAATCTTTAACCCCCGTGTATGCGGATGACA 
CCAGCGAGAGAGGAGCATCTGTTAGTGAACTATTGACAGACAATACTCTCCTCAAATTTATTGTTCAAAATGAACTGAAAAACATTGAAGAGGCAAAACG 
TCACGTGACCGCCGCAATTGAAGGTTCATCCCAACTGCACGAAT^AAATGTTGAGCCTGCTCGTTGCCTCAGCCGACATCAACCGTATGTCCGCCCAAAAT 
AACCTCGAATGTAAGAAATTGACTGAAGGAAATAGTAACTTTGTACCAATGACTAACGACCAAGGTGGTACATTCATAAAGCACAAAGAAACAGGTATCT 
GGCTGAAGACCGATGAAGAAAATAACACCAGTTCTATCAAGGACAATGATCAGCGTAGAGTAGCTAAAACCATCCTCGCAATTGTAGAGGACAATAGAAA 
TGCAACCATCCGTTCTCGTCTACAGTCTCTTTGCTTTGGAAAATATGCCATGAACGACATTTTTGCACTTGATGATGCCGATATTAAGAATATGGACAAA 
CTCATTGAAAAACTAGGCGAAGCACTCGCAGAGAAGGCATCTCCTTCTAGCTCGGCCATTTCTTCTTCCTCATCATCTAACACAACATCCTCCrCTTCTT 
CTCCCAGTTCTTCCCCATCATCATCATCTTCCTCTTTCTCAATGGACTATTCAAACAACCTTGCCAAAACTATCCCCTACATGCCTATCGTCTTCCAAAA 
CAAACAATCTAATGTCAATTCTTCTGACGCATCATCCTCATCACCATCATCATCTTCTTCTTCATCTGCCAATATTGATAATGTTGAGCACAAAAAAGTG 
GCTCTCCAACAACTTCAAACACAAGAATCTAACGATTTGAGTAACGTACTTTCTGTTACCACCAAGCACAGATTTGCGTCTCATAATCAAGCTGCAACTG 
TTGGCATCTTCAACGGAAGGCAACACGCAGAGACAGTTGTTGCTATACCAAATGCAAACAAGGCTAATAATAATGCCACCGTTTCCGCAGGCCAAGGAAT 
TCTTACCCGCTTCTCAGCCCCTGAAAATGTTTCCTCCACCAGCATGCAATTGCCTCCATCATCATCATCATCATCAAATGGAGATGATAATAAGGTACCA 
GTAACTGTCAGGCTTAACCAGTACGCCAACTCAATCTTATCATCTATTGAAAACGCATCAGAATTTAAGGACTTGAAGGAAGCAGAAAGGAAAATCGATC 
TGGCCATCCAGGCAGCTTCCACCACAGAAACAAAGGT^AATGGTCACCGTGTCTAAGTGCCCCTCTGCTAACCAGACTGCCATCACTGCCATCTCTCAAGC 
TAAATCCCTTAAGAAAAGTGCCCTCGAATTATTGGAAAGAGTTATCAAGGCAGTCGAGGTTTACACCCCAGATTCATCTATTGCAGCCGTTTCTCTTCCC 
GTTAATGGAGATTCTATGGTTTCTTCTTCCTCGGGATCAGGATCTGCTCCCTCTTCATCATCCTCCTCCTCATCCTCATCCTCTTCCTCTAATGTGACAG 
ACTATTTCAACTATGCTTACGGAAAATTGAAGAACATTGATGAAT^TACTGAAGAAGGGGCAGAAACTGTCCAGAAAAACATGGTCGAACAAGATGCTGC 
CGTTCGCATCCCTCTTCTAGTATCATATGCTCCATTCAGCGAAATGATGAGACGTGCTATTGACAAGTTGAACGAATACTACCAACTGATTGATGCCATC 
AAAACAAAGATCGTGTCAGACACTAAACAGGCTTCCTCATGGGCCATCAAGGAAACGGACAAGGAGCTTGATATGGACAAAGAACAGGTGATTTCAAAGA 
TTAATAACTTGCAACAAAACTTTTCAAACGAATCAGACAAGATAAAGATGGCTATTAGTGTTTTGGACAACAAAAGGAACGAATTAGAGCTTCAGAACAA 
CAAAACTAGGAGCTTTATTGAAACTACAAAGAGCCGTATCGAGGCTGGTGGAGGAGATGTAGCAAACTTCAAGGAGATTATCGATTACGAAAACACATCT 
GAAAATGACAACAATCTCTTCCAGAGCCTGAAAGCATTCGCTGCTGATAACTCGGGGACAGTTTACACCCCCACTGACATGAGCAATGGAAGAGACACAA 
AATC AG AC AGT A AAT TT GTC G AC ATGT AC AAC AAAC AGATTC T C G AGGGAGG AAT C AAACTC AT C AATG AGGGA C AAAAT AC TGT AAAGGT AGAC TT TT C 
AAAGGCTTTGGAGGCTTTCCCTAGACAATCCAACGGTGCTTCAGAGCCTGTATCTTCTTCAGTTGTGGAGAGGAGACAGCGAGAACGTCTTCAGGCTGTC 
GAGATGTTTATGGCAATAATGATGGAGCGCACCGAGAGTCTGAGGAAGAGGTTGGCAGATTCGGCTGCTCAGTGGAATACTGTTAATAATGTAGAAGAAA 
CTGTTAATAGTGGTATGGTTAACATCAAGAGTGAAAGGCTCACAGAGATTAGGAATCAAGCACAAATCGCTGAAAGCACTGCACTAAACTCCATCAACGA 
CGAGATTGTAGAGTCTCCTCTCACCCTCTCTTTGGGAGCACGAGTCGACCAGCTCTTGATCAAGGTAGATAGAGTAGGAAGTATCCAACAGCAGCAACAG 
CAACAGCAGCAACAGCAGCAGCTTCCCAAATTGACAGCTACAGAACAGAGAAAGGAACAACAATACGCTGCAGATAGGGTTGTTTACGATCCTTCATACA 
CCTGCTTCCTGCAACCTCTTCACGAGACAATTAAACGTATTTCTTCTGTCTATAATTCAAAGAACAAGGGTCCTCTCAGTAACACACGTGGTGTTCCCAC 
TAGCGATGCCGACTTGCAACTGATGACCATCACTGACTTGTCTAGGTCTGTACTCGACTCTTCTTCCACTTCCTCCAAGAAAATGCTGTACGAAAATGTT 
CCCTCATCAATTGTTCCTGGACTCTGCCAGCAATGCGCAATGATGATCACCAACGTCCACGAAGCCACTCATACTTCTCCTCATTCATTCAATTTCGAGA 
ACAAAAGATCCCTGAAGCAGCTGACAGAAATGTTGAACGCTGCCACTTCATCCAGTGACGGTCCTGCCGTGAGACACGATGTACTAACAATGTTAGAGTC 
C AAC AATGGTT ACGT C AAAG AT TTTGG ATT C ACTC ACCGC C AAAAGGTTGCGTGT ATC AC CC C T GTT AAT AC ACT TC TGGGAGGT ACTTTC AGT GGAAAT 
GTTGCACCTAATACTGTTATCCTTCCTACTTCTGAGTTGTTTAACTGCCCAGGAGTTGAAAATGACAAATTTAGATCCATGGTTAACAGGACAACCGACA 
AGAATGTGGCTGACGCACCCAAGTCATCTGCAAGCATCGTGGAGACTCTTGCTCGCACGTCTCCCAACGCCGAGCACCTTTACTTCCCCTTCAAGGACCA 
GAGGCGACACTTCAACTCCATCACCGACGCCATCATTTCTGGTATGAGCGGCGAATCTTCATCTCAATTGAACACTACTTGTGATCAAAATCTGGTAAAC 
ATTGATCAAACTACTGGCTTCCCAGTGTTTACAGGAAGAAAGCAGGGCGAAAGAAGGATTGTGCACACTGAAAACACTATGGAAGGAGCTCGCAAGGACA 
AGAACAGTGGCATCCCTTCATGTACAAAGGACCGTCAAACTTATATCGATATGGGCACAAAATTCATGGTTGCTCCAGGCTCTCTTCTGAATGCTAACAA 
GGAAG AAAC TCTCCGTCT AAAC AGGCTTTC AGAC ATT AAC AACGTGAGAC ATT ATGGCACTG AT GTTC ATGT GGCAGGCGCAAACTCTGCATGGAGAATT 
GGTGAGGTGGTGAGAGCCGCCTCCTCATTCCCTGACGGAGATAAGGAATCGGCTATGAAAAAGATGCTTCTTCTAGGATCTGTATCTGCCATCTCTGCTC 
AAAAATCTGCCAGTCACATTAACGATCCTACTGCTTTGTTGAGCACCAACACGTCTATCCAGAATCTGGTCAAGGAAGCTTTCCCAGACCCTGTTTGTTC 
CTCTAATTACTTGGGGTCTGCTGAATCTACGTTCGCCACTCAACTCGCCTACCGCCAGCGCCTGTTCCCTAACGGAGATGACGAAAACGTTACAACCGTT 
TCAAATATCTGCCCTATGGATTTGATGGGAAGTACAAAGCGCTACAATGACGCTTTCAACAACATCTTTGGCTCTAAAATGACATCTACTAACAAAAAGG 
GGTCAAATTGTGAAAATCTACTGAAATCTGCCATGTCTAACGTTCCTGCTATCAACACTGCCTTTGGAGCCTTTGAAGAAGCTTCATCTTCTGTCAGGAA 
TAGGCTTTCTCCCCTTTATGAAGACAGCACCAAATATTCGTCCAACCAACTTGCTGTACAGGCCATGACCGATACTGCTGTGGATGCTTTGTCCGCTGTT 
TCTACTGTTGTCGGTCGCCAGAATGGCAGAAATACTCTTCTTTCTCTCCCTACTTCTATTACTTCTATCGCAACCAGTGGCCGTCCATCACTCTCTTATT 
CTTCGGACATGAAATCTAACCTCATCAAGACAATTTCCCGCATCAATAGAGACGCTAGCCTCCTGTCAATGGGAGACAGCCAAGTAGCTGCAGGTTCTTC 
CTTCTTTAACTCTTTCCTTCGTTCTTCTTCCATCCCTGTCACCACCAGCCAGGATGGAAATGTTGCAGCAGCGGAAATTGTTCTGGGGACTATTCTCGAC 
AAGACTGTGGAGATCAACAAGAGATTCGAGATGCTTGGAGGA6GAAAAATGGTCGCCGGGAGTCCTGAAGCTCGTGCCATCCAGCGCAATACAATGTCCT 
CTATTCTCCAGATGAACGAAAATGAACTCGCTCGTGACTTGTGCGAAATTGAAAATAAAATTGAGACTAGGCAACTGAGGGATGCTTTCCAGGATTTGAA 
GAGGTCTATGCTGATGACTCCAGGAGGCGTGGGAGCCATTTCTTCTGGAGCAAGTACCAACAATGTTCCCCTTTCTCTTCTCATGTCACGTGTCGATGCA 
TC C AGCGGTCT T C T GATG AAC AAC AACAGT GC CAATGT AATGG AAGCTGTGGAT AGTTTC AATAC TACTCC TTT GCTC GTTAGGC AC ATGRT GT TGGAT A 
GTGGAAAGTCCCCCGTTCCCATGGCCAAGGAAATTAGGAGCATGCTGACCCAACCAAGAGCTCTCACCGCCCGCGCTCTACTGAGCGAATCTTCCCCTCT 
TCTCACTGAAATCTGCCTCTACAACACCCGCGACACTCAACCAGAAAGGGCAGTCGACAGACTACTAACTTCAGCCTATCTAGTAAAACAAGCTAAAAGA 
TTCGACGGAGTTGACCCAGCCTTCCCTGCCGCCCTCACCTGCGCTTCTCACCTCATGCTTTCTTCCATGGATTCCCATACAAAGTCATCTTTCATGGACA 
ACATCAAATTGCACATGACTGATACTCAATGCTTCTTCAAGAACATTGAACGATTTGAGAAATTCTTGGGAAGATATGGGGACGAATACGCCATGTCCCA 
CAAGCAAAATTGTAACTGCCCCTTCCATCTCCACCACACTTTTACTCCCTCAGATAACGAGCATCTGGTATCCTCTTTCGCATTCGCCCGCCCAGAAGTC 
TCCATGGAAGAAATTAGAGCCACACCCTATCAGGCCAACAAGCTTATTAGTGACAAACATTACGTGATGAACATGTCCAAGATCGATTCTAGAGTAACAG 
GATCTTCCCTCCTTAAGAAGGTTAGCGAATGGACTGAAATGAGAATGAACTCCAACTTTAATGGAACATTTGAACCATCAAGACTCGCCCTCTCCAACTC 
TGGCATGACAACGGCAGGAGTCAACCTCGACGrTATTGTCAAACCAAATAATGCAAGAAGTGTACTAGGAATATTGGAATGTCATCGCCAGCACGTGTGC 
ACCGCCGACGCCAAGGGAACTGTCGCTTCAGCCATGCCAGCCGTCTTCCAGGCAACCGATGGAAACGGTAACGAATCTGAACTGATCCAGAATGCTCTGC 
CAAGGAACAGATACATCCAAAAGAGCACAATGAACGCTCAAACTGTCGTGTTTGCTAATGTTTTGGAACAACTTATCGCCGATCTTGGAAAGGTTATCGT 
GAACGAACTGGCCGGCACCATCGCTGAATCTGTACCAGAAAGCGTATATGAAAACACCAAGGAAATGATTGATAGACTAGGCTCTGACGACCTCTTCAAA 
TCTAATAATAATGGAGGAGTAGAATCAATGGATTATGAAGATAGCGAAACAACATCCAACAATGGTCCCGTCCTCATCTCAGAAGCCATGAAGAATGCCG 
TCTATCACACACTAATTTCCGGCAAGGCAGCTCGCCCGGAAAATGTACCATTCGCCTCATGCGCCAGCGGCCCTCTCGCCTTTGATTTCCTTCTGTCAAA 
GGGAGATACATTCGAAGAAAAGAACGCCGAACAAGGTGCAGCAGCTGCCGTATCCTCTACCTATTCTTCCTCTTCTAACACTACTCTTCGTAAGCATTTG 
GCTCGAGTTTTCGAAGCCATCTCTAAGCAAGTAACTGATGCTGAATTCAAGGATATCCTCAACGATATCGAACGTAATATTTCTTCTGACTATACTAACT 
GTCCACCAAATACTAACCAAAATGCCTTTGCTCTAGCTATCAAGAGAGAATTCAGCAGAATTGTTTCCTTCTTAACCATTCTTCGTAAGAACATTACACC 
CGCATTAGTCGACCCTAAAGGCGCGTTACACGAGAAAGTAGCCATCTATTTGACCCTTCTTTCAACCAAATCAAAACTAGAAAACTTTTTCCAATACGGT 
CTCAGTAATTCGTCCTCAGTTGATCTTAGCCATCTAAAACCCATTAATTGTAGCAACAATGTCAAGAATATTGAAGACACATTCATGTACAGAAATGTCC 
ACCCTATTCTTATTATGGCCCTCCCAGAAAATTTCACAGCTCTCTTGCAACAGGAACAAATGGACCCCGATACTGCCATTGAAAGCAGACGCTCCCTTAC 
CACCTTCCTTAATCATCCCAACACTGCTTCAATGGCGAACGGTGCAAGAGCCGCTGTGGGTGCAGGAGGAGGAAACCCAATGGGCTTGTATCTTTCTTCC 
CACATTCTTCACGAGTCTACCGTCACAACATCAAACCCCGTCACAGACACCACAGAAAACGTCAACTATCATTCCTCTGTTACACAAGATCCTGTTATGG 
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TAGTGAACCCCTTCAAGGATTCTGCTAGGTTGATCGTTAACAACAACAATACTGGAATTGATGTCTTGAATGATAAGTCGTGCAACTACTTGCAAGTATC 
CATGCCATCTGAATCATCTGGCCTCGTCACCAATACTGGATGCTCTTCTTCTTCTTCCTCATCTTCGTCTGATACCTTCAAGTACGTCflGGAGAGACAAT 
ACGCCTGTGAATCTTCCCCGTGTCACACCAGCCGTTCTCTGTTCTGATGCTTCCTCTAATCTCTTGGACGTGTTCTCCAGGGCAGATATTGTCCTCGAAA 
ACATGAACGTGAGATTTGGTTTCATGCCCGAGATTATTGCTGCCGTCTCCAAATTCAAGGGGCTGACCAAGGAAGAGGTTATTAAGCAAATGGTTTCTCA 
GAACAACATCAACAACAACAGCAACAACAACAACGGAAATGGGAAGAAAACAACCGTCGATCCAGTCACTGGGGATATTGTTATCACCAATGCCACATTC 
CCCGACACTCGTCCTCTATACACTGCAGCAAATGGAGGAACATCATCATTCAAATGGGGAGATATCAACGACAGAAAAATGCACGCCAAGGCTTTCCCCA 
CCTTCTTTATTGGTAACCCAACCGCCGCCGCAACAGCTAACGGAGTGCCTCTTACATCTGAGGGAATTTCCCTCACTGAAGAAAAACGCAAGAAAATCGC 
AGGCATCTCTGAAGGATCAATTGGCACGGGGGCTCTGCGTGCAGCCGCCAACACCCGCCTCTCATCCGACATGGAACCTGTCATGAAGGGATGGAACAAC 
ATTGTTCAGCTTCAACAAACATTCAAGAAAGCTTCAGATAAACTCACTCATCTTTTGAGATCGGGAGGAATTCCACCCAGAAGCCAAGAAACAAACGCTA 
TTATTAACAAGATGCACGACAGCTTCAAGACATTGGAGGAATGTCGTAGGGTGATCCAAGACGAGGCTGCTCTGCTCGTTGCCACCAGCGATCTTTTGAC 
CGGTGGCTACGGAGGAGATGCTGCTCTGGCCATGGTTTCTCCAGTACGTCCAGAAATGACTGGTCTTATTGGTGCAATCTCCGCGCCAGTTAGAGGTATT 
AGCCACTTGTTGAAACTGGGAGGTGTTTCTGCTGCTAACGCAGCTATCCGCAAGCGCCTCAACCTACCTACATCCAACGGGAAAACACTACCAGAACATG 
GAATCGTACACAAATCAGCCAAGACACTTTTGCTTGATTCAGACTCTATTAGCAACCTATACAACACTGATCTTCAAGACGTTGTCTCTAACGCTAGGGA 
TAACAACAATTTGGGAAGAATTATGCAATCTTTGGGACTTAAGGGGAATAATGCAGGGGATTTGGTTTATTCTGCTAGACAACTGACGGACCTTATTACT 
GTACCAGAATATGGAAACAATCGCGATCTTACCAAGCGTCAAGCTATCCTTAAAATGCTCATTTCTAACCCTGAAATTCTAGAAAATGTTGCAGATACCA 
TTTACCTTACAACAGGTAAAAATGCTCTCGCACCGGTATCTGCTCAGGAAATGGCTTGTGCGTCTCTGACAGTCGGAGGAAGTGGAGGAGGAAAACTGTC 
ATCAGACGACAATGTTCAATCCCTGAACCGCCTTTATTTTCGGGTCTAGACTAGTGTAATATTGTATGTTTAATTTATTGTGTAACAATTGGTTTATATT 
TGTATGTGATTTTTTCGGACAAATAAAAGAATTGGAATAAAAGACTTTGTATTATTTACCAAATTTATTGATTTTTTTTAAACACCTTTTTATGCCACGG 
GGTTGATTGTAGCATTTCCTCCCCCTGGAGATTTCATAGTGGGACATCCAAAGAGTGTAACTCCTAGTGCTGATACACCCAAACCTAGGAGGAATCCGGT 
AAGTGCAGGAGTAACAGAAAGCATGAAAGCGACCATGCTCAATGCGATGACGACAACGGCCAAGCCGACGATGATGTTTCTGGTCATATCAGACATGATT 
TCTGGTATCTGTTTATAGGGTTTCACCCAGAAGTGTTTTAAACCCAAAAGTATAAATTTTATTCTATACTGCTCATCTCACTCTTCCGACCTCACAAAGG 
AGAAAAGAGTCAGTCTTGCTCCAGCCACCACCGAGGCAGGAGCTCCTCGGACGCTCCTGTATTTTTGTCCAGAAATGGATAAAGTTTGTGTTATATCAAA 
TACTAGGGAGCGCACGTTTAAGGTACCTGCCGATCTACTGTGCGTTGCAACAGAACCGGAAATCTCTACCAAGGAAGAAGATGCAGGTATAGAAATCGAG 
ACAAGAGTGGTGGTGTTTTCAAGGTGCGTCTCGGTCCAGGAACTACATACAATAAATCCAAATGATGAAGGATTTTCTGTCCAACTTTTCAAGGACTACC 
TGAAATTGCAATCTGCACAAGGAAAAAAACCCATTGGTTTGTACATCCAAATAAAGGCTGGAGAGGATCTTGAAAGGAGATTAATCAGTGGAGGAACTGC 
ATACCTGGATCCGGCT^ACACACCTTTTCTATCTTGATTTCTCCCTTTACCCTAATTATTCAATATTCAATGACATTTCATCCCGTCTGAAAATCATTGAT 
GAAGACACGTACAATGGTGTTGTTTTCTCTAACAGTGAAGAAAAAGAGAAGGATGCACTAGTGCTGATAAGGGTGACTTTTTCTACGCATGAAAAGGCAA 
TTGAAGCAGCCATAAAAAAAATAATGCTAAGGAAAGTGTTTTTCAAGGATGGAGATCTTGATTTCGGGTACTTACGTATACCAAAATCTAAACTGGACAA 
ATTTACTCCCTATTTTCGGAGTCAATACGGGATAGTAAATGTTG/VAAAAAATATCCCTGGTTACATATGGGGAGAAATTATGAAGCAACGAGTGCGATGT 
TCCAGATGGTACCTTTACAACACCGACTCGGAATGGGAATATAAAAATGTGGCCGAAGAAAGAGTTGGACCTCGCCAGTTAGTGAAAAAATATGGTGCCA 
AGTGTGAAAATTTATGTTTTAGGGACATAGACCTCAGAAAAAAGGAAGCAAAGGAAAAAAGGGATATAGAAAGAGAAACTGAAAGCAGATATGTGGTCGT 
AACACTAACCCATAAGCATGAAATGCCTGAAAATATGCCCTATTTTGGACCA/AGTGTTCAGTGGTGAGGTTGGATGAAACTAGAATACTTTTATGTTTT 
GTGGATGAAATTTCTTATAATGATGAAGATGTAGACGAAATTTTGTCTGAGAATAGATCACTAAGAAATGTTTCTATTAGACATAAGGAAAATGTACCTG 
TACACACGTTATTAAAAAAAGGTGTGTCTATTCATGCTAGATTTACCCTTAATGGTTTGGATGATGCTTTAATAATTTTAAAGAGAATACCAAAAACTTA 
TTTTGAAGATGAGGAACTACAAGCCGCTTGTGCGCATGTTAACCTTGAACAGTACGAATGGCTTTGTTCTAATAATAGAGGGAATAAAGTAGAACATGTA 
AAGTCGCGGGTAGTGACTCGAGCAGTTAAGCGTAGGAGAAAATGTAGACACTGGATTTATTTTGATAAAGACACTTTAAATTTAAACTACAAATACTTTG 
ATAAAAAAGTTACTGCTAGTATGGCATCTAAAATATGTAATGCAAAACACGACTGTTTAGTTTTCCATAGAAAAATGGAATTGGAAGATTTGACTGAGAG 
CGCATATTTCAAGGTAGAACCTTCCCCAATAAATTTTGCCAAGTTAAAATCTTGCCCGGATGTTAAATATGTGCAGAAAAAAACAGATGGTACATTTTCT 
GTTATAAGATTCTTTAGAAACATGACAAAGGGTGATCTTATTCAAAGGATGGATCTTTTTTGTAGGTTTATTCCCGACTCACACACTATTACACTTTTGA 
GT AGGGCGG ATT TTT ATGC ATGT AAAAG AGGAG AATC T ATGC AT ATGTGC AC AAAC AAAC AC CG T AT TCTTC AC T AC AAATTC TC C AAC GCTCCC C ATGC 
GGCCATCGAACAAATAACCAATATCATCAGTGArACAAGAGGACGTAAGGGTATACACATAGAATACGCGATCGAAAATGTACAAGAAATGTACGAAGAA 
GATGGAAG AAG AT ATGAAGCT AAAT AC ACTGGAAC TT T AACC GAGTAC AAAAGAAATG AGG AC AAAACCTTC AAATC TC TTC TT G CTC C T C ATTT;U^C AC 
CTGTCAACAAACCATATAATATTAACCATTTGTATGAGCAATATGGAAATTTTGATGAAGAATTAGAAGACAAGTTGAGGAGTGGTTTCATTTCTTATGA 

cacgtatgttactgcaaaagacaactggggcaggtgtgcaactggaaagggggcgtgcatctaggacccataactcaaagtcaatactggtatgtgtttt 

TTGAGAAGGTGTCAGAATGTGCACATTAAAAACATACAAAATGACTACTTCAACAGAAATATCAAAGAACCTTTCAGATGTGTTATCCATCAAGGCAACT 

ggagattggtgcagtaatatcaagacggtattttcacccttcacagaaggcaagggaaatttaccaaacagtctcccgtttacgagaagtcccaatacaa 
catgtggttcaagagaggcggcaaacgccacagagcattttatcaccgtctttgcaaaggacaaatatgagcggaaaagagtaaaacgtacaatcggatt 

CACCCTCGACAACACAAAGGAGTTGACGCCCAACAGATACTTGGTAGCAGATGTATACTCTTGGCAAGAAGAGAAAATGGTGTTTGAAGGATTTTGTGTC 

ccaccaggaaagtcgggaacatttgtacgctacrctaatgaagataaaagttttctactagcagataccggaagatatatgaaaaagaagtacgatgatc 
cagaaaataagaccagtagtgggggtgatgatgacgatgacgacgatgatgatgatgacgacaacaacaatgttgacgtgtatgaagaaaacgaccccag 
aaatgtattcgaggtcgaaaaggatgaaaaatatgcctgtactttttcaattttggtcta7agagcaatgaaaaagtctcctcctgtatgtagagggtta 
ttagtagagacagatggaccctcatctcaccctaaacgggccccgtcagcatttaatccattcggaggaagttctatgttgaacggttatggtgcaggtg 
cagatgcactagaagaagaggatgaagttgatggagttcctgaaagagagaggattacaaattttgctctcaagagaggacctgcaactggccagaactt 

TGTATCTGTTAAACTGGAACATGATGGATCTAAAGCAGACCTGTACT^CGTCACGTGCTTCTCCAAGCAGCGTGGAGTATAAAAGGCGGGACGCATCAGC 
AAATGCAACTCATTCTTTCTCATCATCTAACCATGGCTGGTCGTGTAGAGCTCGTCACTGGACCCATGTTTGCGGGCAAGTCTACCTACCTGAAAAACAT 
ATACCAACAAGAAAATGGAGGCAATAAACATTGCCTGTTTGTCAAACACTCCCTAGAAACTAGGTACGGTTGTGGAACTGGAACAATAGTCACTCATGCC 

ggagaagtgattgaaggttgtactacagtttcttctatcaaggaactaatcagtgtgttaccagaagttgtggatgtgattctcattgacgaagggcaat 
tcttcacggatttggtgctagtcaatagactggctgacaaggggaaaaggattgtgattgcagcacttgatggaacttctgaccagcaaatgttcagtcc 

TATTCATAAGCTATTGCCTTATACAAATTCCATTGTTAAGCTAGCATCTAAATGTATGATTTGTAAAATTGATACCAAAGAAGCTCCTTTTACTGTAAGG 

tttggtaatgacaatgataataatgttatatgtgtaggaggagctgaaatgtacgctgctgcctgccgggactgttacaaaaaaatttvacaagaaaaaga 
acaaggggaaacttgttgtacttgaaggaggtgacaggtgcggtaagagtacccaagccaaactcttgttgaccaataaaaactcgcctctttatggagg 
agaatatatgtgctttcccgacaggagcagccatacgggtaaactcatcaatgattatttaactaagaaaattgaactagatgatcatgcagctcacttg 
ttattttctgcaaatagatgggaagtttgtagtaaaattaagcagttgttagacgatggaatccatgttgtgatggatagatattactactcggggattg 
ttttctctttagctagaggagtggataccgttgagtggtgctctgctagcgatgagggacttcctcagcccgatcttgtattgttgatgcttttagatgt 
tgaaaagtgttcaaatagggatacttttggtgtcgaaagatttgagacaaattccattcaagaacgtgctagagccctattcctagacctcgcaaataag 
gacgaaaagaatgtatggattaaggtagacgctcgcggcaccattgaggaggtgcaaactaaaattataaatattgtatataatattgttgaagaataaa 
gaattgtaaactgtttccatgatgtgtttctttcctagtgattttcacatgttacctagaaagactttgcccgacactgaaaatggttattttgtcttgg 
acgagtctcttctggagaaggtgtactatgataacaacaatgaactgattgtaagagttggtgggatttatatgcagatatgcaagtcaaaatacatctt 
ccatcacgatgatccagagaggttcttttatagtgtgttggaggattatcaccccatcaaagagattgttgaacgactagcagaagaggatggggtattt 
ttaggaccgtgggagtttttatcgcgcaaacaagtgaacctccaacacgggtgctacaaagctcttttgtcattgccagaggacaaatattgtaacctat 
tattaccccagcaaatgaaaaccaacctggaaaaaatggaagaaatacagcgtactagactcattcactctagaacgtacaatacaccccagatagaatt 
gtctgaccagctagatggatgtgttatatgttaacactttttcagtaccatgtaatatttttgatatcaaattgaaaggaaaatggttcagaaattggta 
gtaaaaactttattgtttctttcctctttcttttgtctgtcactttattcgtcagacataacaaccataatactgagtatgggtcttggatctttgttgg 
gacattttgggtgtgttaaaagtatcacaggaaaaatcgtctcaagaaaggcaatggataattataggctcgtgtcggtcctcggctctgtatgtggagc 
aggaattttctgtggataccagcaatattctacacctcagtctgcacccatggcgttgctgtctgtcttctgctgcactgctactgctgctacatcaact 
attgtaggattctggatatcgatgattatccaggcattgaaaaggagtaataaagatataatataaaatttgtgtgttttatttgtttgataatacaatt 
ttcaccttgtaccgaaacccattctctacaagcactgtgtctttggacacggggatcaatatcttggattttggatgaatgttcctgaccctcatatctt 
cttcattttcaaatacgaggagggatgttttctcagtattcctgttgtgtatttggaggtgaacagggaaggagaatagttgattgttctttggtgttag 
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ATGGAACTCGTATACCATGAATTGTACTCCTCCAATATCTTCAAGATAGATGCATGATGTATTTTTGTCATACCRTCTCGTATGGTCGTCAGTGCTCAGG 
TACAATCTCTTGTGGTGGTGGTTACACATCTTTCCACTGGTGGGAGAGGAAGTATTATGGTTCAATATTGATGGAGGATGGATTTTTATACCCTCATGCG 
AATGCGATGTGGAAAATTTTATATGTTTCCGACAAGGGGAAAAATATGAATAAACTGAGCATACACATGTCGTATCCAGATATCTAGGTCGGGAAGAATC 
CAGAAACGATTCAAACCATTTCTGGAACATTCATTTCTGGAAAGGGTTACATTTTATTGTATCTGGTGCATTATTTCTGGTACCATTTTTGGCACCTCTG 
TGTAATCTGACATTGGGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTAGGCCGACCCAGCGGTCCACCCrCTAAACTCGAGTGAGCTGAAAAAATT 
TTTATAAAATTTTTGATGAGGAAAAATAGAAGTAAGATGCTTGTCAATGCATGCTCCCTGTGGAAGGTGGTGGATGAGATAGGAGGGAGAGGCTATCCAG 
ATATCTAGGTCGGGAAGAATCCAGAAACGATTCTAGCCATTTCTGGAACAATCATTTCTGGAAAGGGTTACAATTTATTGTATCTGGTGCATTATTTCTG 
GTACCATTTCTGGCACCTCTGTGTAATCTGACATTGGGTCGACCCAGAGGTCCACCCTCCGAACTTGACATTAGGCCGACCCAGCGGTCCACCCTCTAAA 
CTCGAGTGAGCAGAAAAATTTTTAAAAATTTTTTGAGATGGAGATGGAGTGAAACTTCCTAGGTGTAAGGTGCACTACAGAGGCTTGTGATCCTTTCTGG 
GCGGCTCACTCTCTAGAAACGTCTGCTCCAGAAACACATAAAAAAGATAAAGTACATTTCTGGACCACTCCCATTTCTGGTGTAATCTGACATTGGGTCG 
ACCCAGCGGTCCACCCTCGGAACTTGACATCGGGCCTACCCAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATTTTTATAAAATTTTTGATGAG 
GAAAAACACTACCAGTGGAGCGACCGTACACACTCCTCTAGCTAACTACTAACCGAGAGAGTGAGGCAGAGTCTGTCCAGATATCTAGGTCGGGAAGAAT 
CCAGAAACGATTCAAACCATTTCTGGAACAATCATTTCTGGAAAGGGTTACAATTTATTGTATCTGGTGCATTATTTCTGGTACCATTTCTGGCACCTCT 
GTGTAATCTGACATTGGCCTGACCCAGCGGTCCACCCTCCGAACTTGACATCAGGCCTAGCCAGCGGTCCACCCTCTAAACTCGAGTGACGCAGAAAAAT 
TTTTTAAAAATTTTTGATGAGGGAATATAGTACTCCGTAGCCAACATATACACATGAACACATGAGGCGGTCTACAACAGAAAGAGAACTGATAGCTGTT 
CCAGATATCTGGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGCCATTTCTGGAAAGGGGTACAATTTCTTATAACTGGTATATCATT 
TCTGGTATAATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGATCCACCCTCCGAACTTGACATCGGGCCTACCTAGCGGTCCACCCTC 
TAAACTCGAGTGACGCAGAAAAATTTTTTAAAAATTTTTGATGAGGGAATATAGTACTCCGTAGCCAACATATACACATGAACACATGAGGCGGTCTACA 
ACAGAAAGAGAACTGATAGCTGTTCCAGATATCTGGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGCCATTTCTGGAAAGGGGTACA 
ATTTCTTATAACTGGTATATCATTTCTGGTATAATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGATCCACCCTCCGAACTTGACATC 
GGGCCTACCTAGCGGTCCACCCTCTAAACTCGAGTGAGCTGAAAAAATTTTTGAAAAATTTTTGATGAAGGAATATAGTAGTATAGTGCCCGCCAAAGCA 
TACACACACTCGCCCATGCTCGTCTAGCTGATAGCTGTTCCAGATATCTGGGTCGGCCAGAACCCAGAAACGTTTCAACTCATTTCTGGACAAGTCATTT 
CTGGAAAGGGGGTACAATTTCTTATAACTGGTATATTATTTCTGGTATAATTTCAGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGGTCCACC 
CTCCGAACTTGACATCGGGCCTACCCAGCGGTCCACCCTCTAAACTCGAGTGACGCAGAAAAATTTTTGAAAAATTTTTGATGAAGAGATTGAGTAAAAT 
TTCTTGACGATAAGAGGAGGCAGTAGGTGAGGCTGCTTGTTTGATGTGTCAGCCACATCTGCGTCATACATTATATTTCCAAGAATTTTGCTGACGTCAA 
TGGACCATAAAAGGCTTTGTACGTCCAGAGACAAGTTTTAGTCTGATAGATTTCTTAAAAAAAAGAGGGTGGGAGGTTTGTTTTTGTGGGTTCTGTGTGT 
GTATAAAAGATAGGTGCAAAGGTAGAGAATCATCATATGGACAAAATCTGTCCATGAGACCTCAGTAGAGAACGCACCATGGTTGCTTCAACTCCGTGTC 
CAGGCCCAGGACCAGTTCCAACCCAAGAACTTCTTTCTACAAACTTTCTTGAAGCTCACAAGCTTGTCGTGGAACTTCTTCTCCCGTCCTACAGTAGTGA 
TGTAGTTTATTGTGACTCTGAGACGTACACCAAACCTATACCGATTTTTGGGAACAAGAGTATAGTTTCTACCATTGGAGACTATGTCTTATCAAACCCC 
AATGAAGATGTGAGTTACCAAATGGTTTCTTCCGTCTTAGAAAAATTTCCCTTGCTATTCCACTGCACTTATAAGACGAATGAAGAAGATAAAGGTATTC 
CTCTGTGGAAGAAGTTGTACAACAAAAGAAAATTCAAACTCCTCAACTCATTGTTGGTTCATAACAACAAGAACTGGACTCCTGTTCCAGCTATCCCGTT 
TGACAGGGAGAATATATGTGATGCTTCAGGAAGGAGTGTTCTTATGAGTGAAATAATGTCCACGTCAACTTTTCAGACAATTTGCAAAAACAACACACAT 
TACTTGTTTGATATGTTAAATATGGAACGTGGCAAACAAGGAGGGAGTTTTCTTCACTTCTTTGCATCTAGGAAGAATTCTTTTACTAACTTTGAAAATG 
AAGAAATGGACTCTCATGTGCTCAGTAACATAGCGAAATTCATATGCAATGAAAAGGAAAAACTAGACTCTTTCATACCTGCCAACGGAAAAATACCATG 
CCCTGATAAAACTAATGATGAAGGGTACATCCCGCTGGAAATAGCAATTATGGAAGACAATTACCCTGCATTGCTATATCTCGTTTGTAGGTATGGAGCA 
TCTTGGGCAAACACATACGGGGATCATAATGAATCTCTCAAAGCGTTTGCAATAAGAAATGATGCAAAAGATTGTCTGGAAATTATAGAGTTTATAAGTG 
ATCACTACAGTTTCAACAAAAATGTGACGAAGGAAGAATTTGTTAAAGAGAAGACTGTAGAATGTGTTGGATGTTTATATGATATTGAAGACGAGAAACG 
TTGTTACA7\ACTCCCATGTGGACATTTCATGCATACATTTTGCTTGTCTAATAAGTGTTCTAAAGCTAACTTTAGATGTGTTAAATGTTTCCAAACCTTT 
GATGACACAATTTTTAGAAAATGTCCCCCAACTATACAATGGAAAATGGGTATAAACCAAACGACTAACCATAAGGAAATGGATTTGTTCAATCGTGCAT 
TTGACACATATTTAGATTTTATTTGCTCATATAACGTCAAATTAGACA7VAAAATCAAAACCTAAACACAAACCTGAAAACAAAAAGGTGGAAGAAGAACT 
AGCAAAAAGGACAGCAGAAATTGAAGAGGCCATAAAGAAAAAGGAAGAAGAACTAGCAAAAAGGACAGCAGAAATTGAAGAGGCCATAAAGAAAAAGGAA 
GAAGAACTAGCAAAAAGGACAGCAGAAATTGAAGAGGCCATGAAGAAAAAGGAAGAAGAAGAACTCTCAAAATATAATAAAATAATTGAAAAGGGAAAAA 
GACGACTGAATGAAGAATGTGTCAAGCTGAGAGATATTTCAACTGCAGCCATAAACATGTACAAAGAGAAAGTGAGAATTAATGGTGTATTACTAAAAGA 
TTCCGATCAGGAGTTGGCTGAGGCGA7VAGAGAGGTTGAGGAAAATTTTATTGCTAGAAGAAGAAACAAAACTTGACAGATTTTTGTTTAGACCGAAACGA 
GTAGAAGAACGTATATTCCTAACTAAAGATGATGAAACGTTAGCCTTCAAGTTAGCCCTAGAAAAGAAAACGGAGGACATAATTGCGAAGAAAAACAACC 
AAAAAGGC AGTG AAAGAAGAGAT GGAG AAT AT AC T AT AAC T T C T CAT ATT GAG AAAC TAC CTC AATCC ACTGCTTTGGC T AG TGTGTGTGTGTT AAAC G A 
ATAAATAAAAGTATAA7VATGTAATAATATTGTTTTAATATCATTAATATTTTACCACAATGTCTACTTGTTCGAATTTGTTGTCAGTATTTGGTGGAGGA 
GATTGGACAACAACATTCCCATTCGACCTCGTCCATACACGTCAAGAGTGTGATAAAAAGAGAGAGCAAGACTACTCATTTTTCATTACTGAAACGTGTA 
AAGGAGAGAATATTGGTATACATTCGTATGAACACACGTCAAAGATTATTGACACGGGTAATAATGATTCTACCTCAATAGAGGAACTAGAAGTACTGAA 
TATATACAAAGCTATAAACCATTTAGAAAATATCCTAAAACTCAACAAAGGAGAAAAAATTATACTGATGGATGTAGAAACAATGATACTGGT^AACTCAT 
AAAATTTTAATGAAAGGGATTCTTCCCAAGGGTAAAAATGGAAGTTTCAGTACATGCGTACGCTTTGCTGTAAATAAGAACAATGAACGGCATTACTACC 
CTGTATTTGAAACAGAGAAAGAAGCGTTCAATTCTATACAAAATCTAGTAGATTATTATAATGAAATTGTAGCTCACACCAATGACCAAATTAAAATAAT 
AAAAGCGTGCGCATATTTCATGTACAACTTTCTAACTCTCCACCCTTTCAATGATGGTAATGGAAGAACAGCTAGATTATTGTATAGTTTTCTATTGAAA 
GGTAATGGTATCGTACCTCATTTTTCACCCATAACACACCCTAGGGATCAATTTGTTGATACTTTAGTGTATTTTAGAGAACATGGAGATGGACGACCTT 
TATTGTATGTTTTGCTGGAATCAATAAAAAATAAGTAAAATTCATTTTGAGGCATTTATTTCTTCATCAATTCATATGGCATCTTGCCAGTTTTCTGTAT 
TGTTATCTCAGCAATAATGCGTAACACATCAGCAGATGTGCAGATTTTATCCCCTTCCCCATCCCTCATGGTATTTCCTCCTGCAAAGTAATGGAGGAAA 
GAATCTCCAGTAATGGGGTTAGTTACATTCAATACAGCAAAGAATTGGCCAATATCTTTGTCGGCCAACTTGTACAAAAGCTCCTTGGTCATGACTTCAC 
TCATAATGATAGATTTTCCAGTGGTGTCACAAATGGGGGATTTATCAAACGGCATGGCGGGGATATTGGGGTGGAATGGGTTGGAGAATAACACCAGGTC 
GCCGAGGAGGGCAAACTTCCCGTGCTTGTAAAGCTCCTTCCACAGGGGCGTATTCGTCTGCTTGTAGTTGCAATGAAGTAGGACAGGGAATCGCCTGAGT 
ACGTGGGACAGCACTTTGAAGCTCATCTTTTCGTCCGAAGAAGAAGACAGTACACTCTCTCCAACATCGGTGACAATATTCTTCCCTCCCGACTTGTTCT 
TGTACAATGGAGACTCGCAATCGGTGAACATGGCGTCGGTAGGGAGGAGCATGGTTTGCATGAACATGCTGTCCGCGTTGAAAAAGTCCTGCTCGGCAAT 
GGAGAATATGTTTTCCAATTTGCTTATAAGTGAGTCCATTGCTGCTGCTTGCTGCTTGCTGCTGTTAGGCGTCCAGAAAGCGTGTGTTAATTTCTCGGCG 
GAGGACTGTTCTTATATACAAATTCGTTAAAGGGTGACGCCCACAGATAAGAATGTGGTTAGACTTGAATCTATTTGAAACCATGACAAGGCATTTTCTT 
TTGTAAGGGGTGAATAAGGACTAGACTCCTCTACCAGAATTCATGTGCGTCATTGCAAGCAAAAACTCCCTCTTGTTCCAAGAATTGGCTAGTGGGCGGC 
TGCATGCCAGTCTGTATGACATCATTTCCTTATGTGCAATGGATAATTATAATTTGTAATATTTGAAGTCGTCGGGTCCGGTCCACCCAGAAAGGGGGCG 
AATACACATAACATTTCTGGGGGTGTATTTAGTGCATTATATCTGGCATCCAGAAACGGATGCAAGCAGTATCTGAAACTCTCTCTCACTTTTGACArCA 
ATGAAAAATTGGACATCAACACGCATTGGTGTGGTGGATAAAAGGGGGCAACCGGCTATGACTAACAAGCATTTCCTGCCTAACCTGCAGCCAAGCAACA 
GCATATTACACAATGGAGGACCTAAAATCCACTATCGAGAGAGTATATGAAGAAAGAGTGGAGAATCTAGAACAATGGACAAATACTGTAGAGGAAGAAG 
AAAGGACTGTCTCAGCAATCGATTCTGTCCTGGAGGAACAAAAAAGGGCCCTGGACGCATGGGAAGCAGCGATTiAAGGAACGAGAAAACGACCTCGCAGT 
AAAAGAAGGGATATCTGCACTCGTTTTCAACGCAGCAGACGCCAAAACACGTAAAGAATTGATAAATACGTGGATAGCCGAAAGGGAAACGTCAGAAAAA 
AGAAGAAAGGAAGCAACCTCTACCAATAATCAACTGAAGAACCAGATGTCATCTCTAGTCAACACAACCAAAACACTCAAAGAAAAGTACAACAAATATT 
ACAGAAGAAGTGCCATACTCAACATGCAATACATCAATAACAAAAGGGATTATGAAGCAAGTCAATTTTGGGTGTATACAAACAATGCATAAGTTTTCAA 
ATAAATTTTATTTTATAATAAAAGGTGTTTTAATTATAATTTTTGTCCCTGATGTTGTGTTTTCTATATTTTTACTGCCTCCTCTTGGGGTAAGACATAA 
AAACGGTGGCGGCGGAAACGAGGAACAGAAGAGCGGACCCAGCCAGAAGCATCATATCCCTGGTCCTGTTCTTATATTTGTCCTCATCATCGTTATCGTT 
GGCAGTGTCGTCATCATCATCGGTGTCCTTATCAGTGTCAGGATCGCTGTCCTTCTTTGGTCCCATCCATACATTCATGACGGCCAGGACGAGGATACCA 
ATTGAGATAGCAACGATCAAAAAGAGACCCATGCGAGCCATAGACATGGAGCCTTCAAGATCTTCCATGGTGTAAGAAGGGCCAGCGGCCTGGGCTCCGG 
TCCTGCCGAAAGGAAGAGTGTTAGTCGTGGTGGCCATTTTTTTTCTGTTTGATGAAGAGGTACGTCAATGTAACGTAAAGACCTGTTGTTTTATCGTGTA 
CGGTTTTTGTCAGGACACCTTACGAACCTATTTGTCTTTTTAGACTTACTCAGCGTTAGTTGTCGCAGTTGGTCATGTTCCGTCAATTCTGTTCACTATA 
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TCTTCTTCAACGCCGGGTTAACGACAATCTTAGATCCACGGCGTCCGCTTCGGCAGCGGCCTCATTAT^GGAGATGGAACTGAATTTATAACCGGAGAA 
CCACCTTCTCATAAAATGAGGGGACCTTCTTATAGCGTTTTAGGACCTGATCCGTGCGAGGACCCAGAAAGGGTATATGTTGATATTGTAGTGTCTATTT 
TGCAGACAAATAATATACAGGTAACAAAAGAATGGGAATTGTTTTCCGATAAGTTGAGAAAATTGGGTCCATGGATTGATAGGAGCGGAATTGAGAATAA 
TGGCGAAGGAGAAGAAGATGGAGATGAAAATGAAGACGGGGGTGGAAATGGGGGAAGAATTGAAGACAGAGAAGCACATCGACGAAAAATGATGAAGAAA 
TTGTCCTTTGTTGGAAGAGAAGATCCAGTCGCTGTAGATTTACCCACGTGGCGAGAAAACAGTACAGAATTTGCACGTCGTTTAACACTCAAGGAATTGT 
GCGATTTAATAGTTGAATGTGGATGCATCAAATCAAAAGAGGAACTCTTTGACTTCATTTTTGAAGAACCGTGGGAGATTAAAGAGGCTGCTGACGTTAG 
GGGTATGGCAAACAGGAGTAAATTCACCAAGGAATCATTAATTGACTGGTTTTTTGAGTTCGACACATATAGTAAATGTGTAGTATTTTTTGAAGCAGTC 
AACTGGTACTTGAAATCTCAAGCGTCTCCAATTTCATTGGTACTAGATGATATATATTGTTGTGTCTTTTCCTACATAAGACGCCAAACCTTTTTAACTA 
GGGCAAAAAACCCATCTTTAACAGTGGCTTCATCCTTTTCTCCCACGCCCGACACAAAGCTTTTGGCTATCGACGAGTGCGTGCAACACTTTTTAAAATC 
AGACATTAATATTAGCCAGATGGCATTAACTGAAAGGGACTGCTTCTTCCCTCTTTTAACTGAAATGCCCCGCCAACAAAAAAAAGTAAACACCTTCCTG 
GACACAATGAAGAGACCTACCTTATCTCTTCTACCTTCCACCTCCTCCTCCTCTTCTTCCAACAACAAGAGAAAGAGAAATACTGCCGCTGCCAATATTC 
TTCTTCCAGTGTACAGGAGTAACTTTTCTACAGCATCCAATAACAAGAGACTGAAAACTGATGATGGGGAAAATGCATCAGCCTGTATTCTTATCGAAGG 
GTATGCGAATGGAAAAATAAGCCCTATAAGGATTATGGTAAGAAAATCAACTATTATTCCAGAAGTGTTTAACCATCTTTTGTTCCCTGTCTTTGCCTCT 
AAAGACACTGGTGCGAATATCTTATTTTTTATCAAAATGAAATCCTTTGCAAGTGCATCTTTACTCCTCCCTGGACTTTTTAGACACCCCAAACAATTTC 
TCAACGGGCCGTGCAAATGGATGACTCTAGCAGAAAACAACATCAACGACAACAACATAAACTCTTCCACGATGTGGAGTTACACGCTAGCAGATTATTG 
TCCTCTGGGCTATTACACCCAAGAGAGCCCTCAACCCTATCAGACATGCGGCAATTTTACTTCGACTACAAACAAGAGACTACAAAACGTGCAGCCATTA 
TACTTTTAAACACTCTTTTGGAATACTACAGGACACCTTCAGAAGAGTGGGAAATTCCGTTTAATCTCTTGCTTAATGTGATGAATAACAAGTGGAGTAC 
ACTCATTCCAGGTGTCAAAATAAGTGCAGGTATCATATCGAAACTCCCATGGACCATGAAAACAATGTACGAGATTGTTTCTTCGCCCAATAATAATAAT 
AACAACGGAGACTACTATTCTACATGCAGGCGAATGGTAATGGAATATCCTATCGGGGGTTTATTGCACACGCCTGCCATAACTAATAAGTATCCACGCT 
CCAGAATGGTCACCTGTACAAAGGGCAAAGACCACCAGAAGCTATATGACATCTCTAGACAAATGTTTGATATAATAGAAGCAAATGGACAACTCTGATT 
ATTATTATTACGGTTACGAAAATTCCCCAAAGACTAAAAAATTCAATACTGCACTGATGATTGGGTTCATTTTATTTATTTTATTAATCATCGGAATTTG 
GGGTTTATTTAGATCTCGAAATGCAACCACCCAAGAGAGCAAAACTTCTTCCCCAACAATCTCCTCGACCCCAACTACATATTCTGGCAGCTCAACCAGC 
AGGGGTCCAGGTTCTGGATCTGGAAACAAACCCAAAGATGACACATCCGTTGAAGGAATAGACCCTGGCTTACTGTAACAGAAAAAAGAGTAAAAGGCGA 
CAGCTCGCTTGCCAATTGTCCTGTTACGTACTCTGTGGTTTCACGAGGTTGTCATCACCAAAGGTAACCTTTTTTTTTGTCCTCGCCGACAAAACGACAT 
CTTAATAACCAAGCAACGTTCGATAAAGAAAAAAACTCGTCATGGATCTTTCTTTCACTCTTTCGGTCGTGTCGGCCATCCTCGCCATCACTGCTGTGAT 
TGCTGTATTTATTGTGATTTTTAGGTATCACAACACTGTGACCAAGACCATCGAAACCCACACAGACAATATCGAGACAAACATGGATGAAAACCTCCGC 
ATTCCTGTGACTGCTGAGGTTGGATCAGGCTACTTCAAGATGACTGATGTGTCCTTTGACAGCGACACCTTGGGCAAAATCAAGATCCGCAATGGAAAGT 
CTGATGCACAGATGAAGGAAGAAGATGCGGATCTTGTCATCACTCCCGTGGAGGGCCGAGCACTCGAAGTGACTGTGGGGCAGAATCTCACCTTTGAGGG 
AACATTCAAGGTGTGGAACAACACATCAAGAAAGATCAACATCACTGGTATGCAGATGGTGCCAAAGATTAACCCATCAAAGGCCTTTGTCGGTAGCTCC 
AACACCTCCTCCTTCACCCCCGTCTCTATTGATGAGGATGAAGTTGGCACCTTTGTGTGTGGTACCACCTTTGGCGCACCAATTGCAGCTACCGCCGGTG 
GAAATCTTTTCGACATGTACGTGCACGTCACCTACTCTGGCACTGAGACCGAGTAAATAAATCGTGCTTTTTTATATAGATAGGGAATTTTAATATTACA 
ACAATAAGTUVAATAAAACAATTGAGGAAATTTATACCATATTTTATTGACCTACTTAACCTTCTTGCTATACAATGAATGTTTAGGTGACTGGAAAAGTT 
TAGCAATATTATCCTTGAACGGGAAACATGCACCAATTACAGGCGCAATTTCATACGCTCTCGGCCTATTGGTCTTTTCCTGGTCATACATTTTAGATAC 
AATAGACAAAAATGGAATGTTTGTATAGATAGAATTGGCAGACAAATCTGCAGTTCTCTTAATCAAAATGGACAACATGTCTATTAACAAATAAGCCAAC 
CCAAAAGTCATGGCAGTTTCTGAACACAACTCACTGTTAATAAATTCAGGAGCTGTATGAGGATGGTTACTAAAGAACCTCTCATCAGTTCCCCAACATT 
TAAAATTGTAGTACTTTTTACATGGTACAATTAAACCAAAATCAATCATCTTAGGTTGACCAGTTATTCCATCAATTACTATATTGTCACTTTTTATGTC 
CGGATTCACTAATCCTTGTTGAGACAACCGAGTGACAATATTTACAATTTCTACCAAAACAAAGGGCAATTGGTTTAACATTCTCTCCCTCATTTTTCCA 
ACGATAGCTATGGCTGAAATTGAATCCGTAATGGGTTTCTTGCATTTAGATTGTAGACCTTCAGGCAGGCGGCCAGTAGCTTGAAGCATCCTAACACCGT 
ACAGAGTATCCCTCATCCTACACAAGCTGCCATGATCTTGCATGAGCTCAGCGGCTGGCTTGGAGGAAGTAGTCGAGGGGTTAGGTTGCTGTACAACATT 
CTCGTCCATTTTACATTTCAGGTCTCCATTTAATACATCAATCACAGAAATGCCTGCAAAATCCATCTCGACACAAAAACCTTCAACACACATCTTTCTC 
ACACCTACTGCGCCTTTCATTCTTCCCTCAAGAGCGTGACTAAAAACAGTTTCAAATACAAATTCACAGTACATTTCCTTGTCCGTCATAAACTTGATAA 
CTTTTCCAGTTTCAAGCATGTAATATACCCCGACTTTAACTTCGAGATATAATTCAGATGGTCCAATCATTACGACTTGAGAAGCATATTCTGGTTGGAC 
GTGTGCTGGTACAGGAATATTTCTGCGGTAACTTTCCCATTTCTTACATTGGGCAGTGACCTCCCCCAAGAGTGTAGTTTTCTTCTCCAATATAGATTCA 
CATGCTCCAGGGGAAAGCTTAAAAACATTCTTTGCCATGTGATCGATGCTCATGGTTTTTATCCACCCTTTACCATTATTCTTGCATAACCTGTTCCAGT 
TCAAGAGAGGGCTAGTGTCTAAAACGGATAGGTTTTTACTACCAAACCCAACAGGCCACCAAGATGAAGGGGGTTGTTTAGTACGGTTGTTAGACCACAA 
AAACTTGGACGCTTTTTCCTTTGCGTCAATACAGCTCTGTTTTATGGTCTCGTAGTTGGACGACACATCTTGCACGTCAGAAATGAGGCCTAATTTTGCG 
GCCTGTCTAATCAAATGGAACGGAGATTCTAGACCGGCTGCAATATTGGTATCAGCCTCTGCCACTGCAAAGTAACACGGGGGTTTTTCTAGCTTGACCC 
ACTTGTCAATGGAAGAAAAGTTCCTCTTTACTACATTTGTTTCAGTCACGGTATCACAAGAAGATTTCAAGTGTTTTGGCCCCTTGACATGGGAACGGAG 
GACAGACAGTGGAGTATAGTCTCTAGCCTTCAAATTGACAGTAGAGAATTTTGCACAATTTGGATGAGAGAATCTTTTTCTGTACTGGGTGTCGGCATTG 
GCGCGGTGCTTCCTCAACACAGACGCGGCACTAACAGAAGGCAATTGCTGCTTCTTGATAATGTCTGTCGCATACGTTGGATTTAATGTTGCAAATTGAT 
TTTCATCAATAGTAATAGGCCTCTTCTTCTTATTTCTATTAAAGTTGTAAGGTGTGCTCTCGTAGTCCTCGCGGAGCCTCAAAGTCACAGGAGGCGGTGC 
TTTTCCTCCCACCTCTCCATTATTCTTCTTTAGTCTCTTCATGGGCCTAAGAGAATAAGGAGAATCTGAGGGAGGGGATCGGCGAGTAGCGTTTTTAGAG 
CAACAAGATTTCACAGCTATGGCTGAAGGTATCTTGAATTGCCCTCCTCCTTCTCCTCCTTCTCCTTCTCCTGGCGTTTTCGATTGGTAAAGTCCCATCA 
CGGTTGCTGGCGTAAGTTTTGTCCGTTGGTCCCCACCCTCCATTTTAACGACGTTTTTTATACTCTGCTCGGGCTCTGGGTCAGAGGGGGCTATAGAAAT 
CACTCGTGGATAGTTCCTTTATATATTTAACCAATTTTTCAAACAGGGGTTATTAAAACTACTCTTCTTTTCTGTATATATTAGGTACAAACACACAATA 
CAGGTCAGGCCTGCAAAGGACTGGGTATTTACCAGTCCGAATATAACAGTCAAAAGGCTTATTTTCCTTGATCTAGAGCTGGAAATGAGTGAGTAAAAAC 
TTGCTGAGACATATACTGTCTGGCCGCCCCTTCCTTATTAGCCTGAATAGAATCCTCGAAGCTTACAACTACATTAAATTTCCCCATACCGTTCTGGGTT 
TTCAACATATTTGCACATCCAAACGTTACAGTTTTCATGGATACTACACCCTCGTCTCCTAACAAATCACAGTTCCTACTATTTTTGTCAGCCGCATTCC 
TTTGCCTCTTGACAGATTCTGTTACAGCGTCCTGGATTTTCTTGGCCCACAGAGCTTTAATAGATCTCTCGGACAATTGTTCACCATACCCTAAAACAGC 
ACCTCTGGCAGTAATGGGGCAGTAATTAAAACAGACAACAGAACCTCCTTCTCTCCAAATTCTGGTAGATAGATTGTCCGCTTCTTTATTCAACAATAAA 
CTTCTTAGACAGTTTGAAATAATGTCCAATTCTTTACTCTTGTGGGAAAAGGGTCCTCCAATAAATCGACCAAACACACCCGCACCAGAAGAAAGGGTCT 
CGTTAGAAGATGGTTCATTAAACATCATACAAAATTTGACATCAGAAGGAGGTTTCCCGCTCTTACAAAAATCCCAGAAGTTAGCAGCGATGGGTAAAAT 
CTGTACAGGGTTCACCTTTGAAGCTCTAACTTCTATTACAGCATTTGATATGTCAGTTTCCATGGCCTTCATTAATTTGGGGATATTCCTATTAACAAAA 
TCACACATAGGTATTTTCCATTCAGGGTGGATGTGGGGTGAAAGAACAAGCTCTAGAAGCACTTTAAAGATGTAATCGCCAGTGATTGTTGTGACTGGAG 
TGTTGATTGGGTATAGAGATTCTTGGTGCTTCTTTTCCCTTTCTGATGCTTCATCACAAACATCCATGCCTTCACAGTCATCGTCATCATCATCATCATC 
ATCTTCACTGTCCGACATTTGGTGGGCACTATTATTATTAGTGTTCTTGACACTGTTACCAATTTTCTTGTACATTTTTTCAATAGTTAAAACATTAGCA 
TTATCAGCACTCTCATTCTTCATTATGTTCTTCAGTGAAAGATCCCTCTCCACGTTGTTGTGTTTGGACACGTAAGCCTTGACGATCTTCCTCCATGTCC 
TTATAAAGTGTATGACTGATGCATATTTCTTACCCGTAACAGGTTTATGAGTCAGCGAGGCTTGGTAGAGCTTGAAGGCGTTTGTCCAGTGATTATATAC 
AGTCGTGTACAAGGCCTCGATGGTTTCCTTTGCCAGCTCAGGGGATGTAGTTTTACTCCCATATTTCGAATCCATATGTAGTAGTTCTTGTTTGTTGTCG 
AGAATGTCCTTCAATAAAGGGTCCAACGTACTCTTATTTAAAATTTTGTGTTCGAATTCATATATCAGCGAACAAGTTTTCATGTCTTTGTATGACACAC 
AACATCTCACAAGACAGATGACTGCAGTCCTCAGAAACGTATCACTTTTCCCACTTCTTTCTTTCTTCGTGTCAAGAAGAGGTATAACGTTCTTGTACCA 
GTAATCGAGTATAGATTCTGCATTGGCTGCGATGGTTTCTTTAACAGCTTGGGGTAAATTAGATACAGTTCTGTCCTCTCCTTCAAAAATCACTTTGTCT 
TCTTCTACCTGGCGGCATTGTCTTTTGTTTGTGTTTTTGTTTCCAGAGAGAGTAATTGCAGAATTTTTCTGTTCAACTACCAGAACTCTATAAAGATTTT 
CCACTTTTTGGAAGAGAGATTTCTGGCGGTCATCTTCAACAACTGGAGCCATGTCTGTACCGGAGCCTIACTTTGTTCACAACTAACCGCTCGGTGAATGC 
GTCTTTTAAAGCCATTACGGTCCATGCCATATCTCAAAGGTTGCCAATGACCAACATAAATATCGGCACAGCATACATTGAATTGCCTAATACTATAAAG 
GAAGGACTCGATAACCCGAAACCTATATACTTTGATCCTTTATCTAAAAAATATAAACCGGATACTTGCACTGACGTGCTGGACTCGTTGGACGTGTCGC 
TAGAGGACTATAGAAAAAGTATAGGATGGCGTCCTCGGGGGGATTCTTTACAGGAATAGATGACCTTTTCAAGACAGTGATTCAACAAGAAAAACAAGAG 
AAAAATAAACCAACTCAAGCACCAGAAACAGAACCAAAACCAGGGCCATCTCAAGCTCCAGATCCAGTCCCAGACCCAGTTCCTAAAACACCAACCAATT 
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TCTGTCCTCCTCCACCTAATCCTCTCCCTCCTCCTCCTCCTCCTCCTCCTAAGCCCTCAAGAGAAGAACGGCTAAAGACGTCAAAAATACGTTTAAACAA 
AGCTCTTAGTGATATTGTTGAAGCCAC7VAACGAGCGTGTTGATGCGTTGAAAGAGAACCAAGCATTAAATACAGAATATGACAAGAAGGATAATTACTTC 
CAGGTTTTAAAGTGCTCGATAACACCTTCTGTACCAACAGCTATTATAGGCGCACACGTGAAACAGGTGGCCAAAAGTAGCGAAATCGAACTGGCCGTGA 
ACGAACTCGATATAAAAAATAAGTGCTCTTTAGTGTACAACGAAAATGAGTCGTTAAAATTTTTCAGGGACCATGAGAACCTTATACTACAAATTGCCGT 
CCAGTTATTCTCTAGGCACGATAACACCAAATGCGTGGGGGCAGAAATATGTGTTAAAGGCAACGAAAAAAACAAGTTTGTTAACAAACTGGTGGTAAAA 
AAACTCCCCAATGCACCATCCTCATCTTCAACTGTGCTAGAAATTAGAGGCGCTACCAGAAATTTACTGGAGAATAATTTCAACAAGGGAGAAAArAACA 
CTGTCAATGAAAACAAGGACATTCCTCCTTCAGAACGAGCCAACCTGGACACGACCAAGGCAGAAATATCGCACGTCTTTTCCACTCTACACAGACTGGA 
CACTAAAAGGAAGCTTTTCTTTAAAGGCAACACTTTTTATCAACGAAAACCAACATTCGATAATAAATTCAGGTGGACAGAAGTTATAGGGTGGACAGAA 
AGTGAAGCATCAAAACAAACCACTAAATCGCTAGACAAGCCAACGGACGACAATTTATTCGTGCTACCCCATTCTTTCAATAATTTGGCAGACCACTTAC 
GTTTGAAATTTAAAAACGTCCTCTATAAAAATAGTACCGCACATCCCGGCAAACGAAATTACTACAAGACTCAAGAGACGCTAATAAATCCCCAGATTGA 
TTCGGCGAAAGAGTACAAGATGGTCTTTGCAGAAATCGACAAGTGTTTGGATGTTCTTTTGGCCATAGGGAAGAATGACAAATACACAAAAAGCACTGTC 
ATACAATATAGAGGAAAGTTTAGAAGGTATTTAATATTCTGCTACGCCTTTTATGCTCTAAATAAGGCAAAACATTCTCGCGCAGTATCCCCTCTACCAT 
TTAATTTCTTTAACCTTTTCTCCTTCATGTATTGTCATGGTCCGTTTCTCCATTCCGCCAGTTTTTTGTCCACATTGACGTTCGTCTATCAACACATGTT 
TTTCCCCATGGGCACAGCCGCCCCATCCGTCTCAGCCAAGCGGCTCATGGATATCGATTCCGCCCTAATGAAAGGAGGAAAGGGGGTGGGTGTGAGGGAT 
TTTGGTTCACCTTCAAAAACAAGTCTCCATACAAGAACATTGGTGTCTTTCTTAGGTTTTGCTGAAATGGCTATGGGTVACAATGACGGCTCTCTTATCTG 
GTGTAGAAGTGCGTGTATCTCCAGCTCTCCAACAAAGGATATCTAAATCCCTAGAAAGATGGTGTGATTCAGTCATCTTTATATATTTCACCTTTGTTTT 
ATTCCACAGATTCAGTGGTGCGAAAAAAGTATCACTCGAATCGGCGCTTCGCCTCATCATGGGGCAGACGCACGCCCACACAAATAAGGTGAGGGCCGCC 
AAGAGATGCCGAATAGAAGCAGCGGAAATGGAAGGTGTGGAAGAAGAAGAGGCGGGCCTGACACTCTCTTATGCCCATCTATTGGGTCTTCCTTACTCTA 
TACAAAAAGCCCTCGGATTACCTGTCCCTAAGATAAACCCTCTCATGACAGCATCTTCTTCTCAATACAATTTAGGGGATTTTGTAGGCGTGGAACAACT 
TCTAAAGGCTAAGAGAGAGTTTCCAGCCGAAGGAGAAACCGCAGGATTTCTCGGCATGTTTGATAATCTAGTGAAAGATTCTATTGACAAATACTACGGC 
GAAGGAGCCTTTTCAGACGTGGTTGAAAATGTAAAACAAGGCATGGAACAAAACACACCGTATGACACATCTTCAGCGTTGATGACACCTATCCCTAAAG 
CATTCTACGAAGAAGAAAAGGATGTTCCACAGCAGGAAGAAAATTCTACACAACAAAGATATAGATTGAATAGAGACGTGGAGGAATATTTAATGGCTTC 
TCCTATGAAGATGGTGTTTGTGTCTATACTCGATAAAACTAACCAAAAAGAACGTTTCATGTCTGTTGGGGATATTGCCCTTCTGGCCGTGTGGTGCAAA 
AGGAACGTACTGAAAAAGGATTGGAACGAATACGCTATCGCTMAGGCAACTACGAATGGCTTGGTGCTAAAATGTGCAACCATTTACTTTTAGCTGATT 
TAGTGAATTTTGGAATATTAGGTGACTTGAAAATAACCAATAAACTTGACACAAATACCGACACCTTTCACAGAGACAGTGATAGATTACCCTCAGTTGC 
AGATCAGAAAAAATTTATAAAAAACACATCCCTATCTGATCGAAAACAATTGGCCCTTGTTCACTCGTGCGTTAACGTGAGCACCCGAACCCACGTAGGA 
AGAGTGACTGCAACATCATGGGCTGTCGATGCGCTTCGTACCTATACAAGAGGTGATAAAGACATGTTTGCCGCCCTATCTTCATCGCTGGATATGTACC 
ATCTTGGGCACACGAATTCAGCTAATTTTGTTCCATATTTTAGTAGAAATTACCTATGTAACGAACAAGAGAATGGATTGTGGGGGTATACTCGCAGAAC 
CTCTGAAAAATTGGCCAAAGAAGAATTGGGAAGAGGACGTTTAGGGGGCCTGAATAAGGTAGGGGTGGCTAAAACAGAACTGGCTGCTGCAGCCATTGCA 
ATTTCTTCTGCCTTAGATATGGGGGAAGTAGAAGCTGTAATGGACGACTCTTCTAAAGTGAGAAAAATAGCCTCCACCTGCTTAAATGTTAATGCAGCCA 
AGGTCTCGGCCGCCAGAGAAAAGGCGAGAGAAGCTAGTATTAAACGTCTTCTTCTGGCCACTAATGCACCAGCAGCTGGTTCATCCAGAAACAGTAACAG 
GTTTCTCCTCAAAGATTTGTGGGGGTTCTTTTCTGACCCAGACAAGCGCCAGAAGCTTATAAAGGGTGAAGCAGTTTCTGTACTATGTCCCAATACAGGA 
TTTCTTCATGCTGCTGTTCCTGATTTTGTTATTGAGTATTCCTTCGAAAGTGAAACCTCTATAGTGAGATTACGTTTGAGACTGATTAAACCTGAAAAAC 
AAGACGAAATGGTATGCCCTTCAACAGCTCCCGAAGCTAATAAGAAGAGGAAATTAGTAAGGAATAATCAAGACGCTGTACTGACGTTGGATGATGAAGA 
TAACATCGTTAAATACAACAAATATGATATGGTTGAAGACGAGGAAGCGCGTGAAAGATTACGCCACCAGGACAAACAATCGGTTATTGCAGCCCGTATC 
AGTAAAGTGTGTGAGCGGAAAAATCCAAAGAAAAAACGTCGTTTAGAAGACCCTGAATTGCAAAGTGTGGATGAACAATTGATACGGGAACTGGCTGCCA 
TTGCCTACTGACGAGATCTAAGATTTTGTAATATATTATGTGTCTTGCAGACCTAATATCCTGAGGATTAATAATAATAACAACAACTGTTCAAAAATGA 
GAGACGATACTTTTAACCAAGAAACTGCGGTGAAACTTGTACGATGGTATACAGAGTACGATTGTTGTTGCCCATTGGTTAACCGCGTGGAGCGCCTTCT 
AGGATCGTTCGGAGGAGGCGTGGACGCCACGTCTGTGCGAAGCCGACCGGCTCTTTATGAAGAAGATAAGAAGGGAGATAAATGCATACCCTTTAGGATA 
ACGTCCCTTATTGAGGGTATACTTTTGGAAAGGGCTCTAACTAAACCCGATTTAGCTGCTGCAGCTTTTGATGTATCAGAAAAGCTGGTGTATTGTAGTT 
GTAATAACACTCAAGGCAATTTTGATGTCTCTTCAATGACCATATGGATTGATGGCAATAATAGTAAAAAGTATGAAGTTACATGCCCGTCATGCACTGT 
CGAGAAAATTAGTGGAGGTGCCGAATCTATTCACAAGAAACCCATGTCTCTTCTTGCCTTCTTTAACAATCTGGTAGAGAAAGAAGCCTTCGCCGAAAGA 
ATTGAACTCAAGAAATTGTACCTCTCCTTACTAACGGGCTCGGCAGCCGGAGGAGGAGGTATGTACAAGGACAGCTCCCAACAATCTTCCTTCAACGGCT 
CTTG G AC GTCGTTGT TGT TC C AC AC AT CT AATW^GG AC AAGACTC GT TT AG AGGCTG AAGTTTT AGTC AGTAAQAAGAT AAAAC AC AC ATC AAG AT T GC A 
GCCTAGGTGCGTCTGTTCCGATCTGTTATATGCCCTATGCTCCACCACTAACAACTCTGCATCTTACGCGTACAAGGCAAGAAATTTGTGTGTTATTGAA 
GGTGGGGAATTTTTATATTTTAAATACACAATCTTTGAAGAGAATGGACCTTTCGACTCCAAAACAGACCTTCAATCATTAGTTAATAACGAGCCTGTTT 
CTGAGACAAATTCATCGGCACTGGCCGCTTCTTCTTCCTCTTTAGAAGACGACGATGATTGTTGTGATGATGATGACGATGATGATGATGATGAAGACGA 
AAAAACTAAGAAGAAACAACCCAAGAAACAAACCAAGAAACAAAAAACAACAACATCAACACTTCCACCTATCAGCAAAACCAATCACGACAACATGTTG 
ATGAATGTACTTAAAAAAGGAGCTGTTAATGGAAAACGGAAAATGATGGATTCTTTGTCAGGAAAAAAGGGCCAACACTCTAAAAAATTGAAAACCTCCG 
CTGCTGCTGGTGGTGGTGCTTCATCCGACGTTGTTGCAGGAGAAAATGAGGAAGAGAACAACCCCTCTTCAGTGAGTCCTACTAACAATAGGGATAGAAA 
AGACTATGTGCTTCCATGCCCTCAAATAGAAGAAGTCACTATTTTTTCACAACACAGGATGAACAATAACAAGTTGGCAGAAAGTGTAGTCAAACATTCT 
GTTGTTATTAATGGAAATTGTTTAAACTTGTTTGTTACTCAACACAGAAAAAAGTATATCCTGCCTCACGAAAATATTCTTTTTTGCCCACCTTTAGTCC 
AGCATGTAGGATTTAACAAATTTCGCATTTTGACTGGCGTTTCTTGCTTTTTTGATAGAATTGAAATTGTTTTTTCCGACCAATCTGACTCTGTGGTGTT 
GAGTAATAATGCTGCCCATTCGGCTATTCTAAGGCTATTGTCATACATAAGAGAAAACTCATTGAAGCGAAGTGTGAGGACTGCTTCGGTGAAAGGAATC 
GATTTTGTCGTGAAATCACAGGACACTAATATAGGCATACCTTTAAGCAACAAGGAAATAAGAGAACGGCAATTATGCTCAGCTTCAACCCTGAGTATGC 
TAGCTGGTTTGGGAAAATGATCACCGACCCGGGGGTCATTTTACCCGTTTCCAAGGATGTTGTACTCTTTGGATCACGAGGACAATCAGATGTGGGAATA 
ATGACACTGGACCCTCACGATTTGGATATAAAAATCACATCTAAACGCATAGGTGTGGAAGAAAGACTAGCTCAATACAATACTCTACCTATGGATTTTA 
CACGGGCAATGGAAAAGGAACTAAATAATAGTAGAAATATGAAAGAGTCAATATTCACGGGAATATTTTTAGACACCGGTTCGGCAATCTTCGAAGACAA 
CATGTTCAACGGAGGAGGTTCAGCTTTGCGCTTAATTAGATCCCCCGCTTTGAATTCTGCTGTATTTTCAAGCAAGAACTACATCATCAAACAATTGCCC 
ACCATAACCAAATCTCTAAGGAGAAGTCAAGCTAGAGATAAGCAAGTGGATAAAACAAGAGAGAAGATAGTGGTGGATTCTTTCAGCATACTTAGTGCTA 
TAGCTGCTCAAGTAATGCACCTCACAGACGGAGAGATGACGTACGTCCCCGATGGGCACTGCGTTAATGTTGTCATGTCAGAGACCAATGCTTCGTCCAT 
CTACTTGATCATAAACGACCCCACTGGTTCGGGATGGATUUVTTATGCCTAACAATTTCAATAAAACACTTGAAATGAGAGACGGTGTAATAGATAGAGTA 
GAAACATTAGTGGAGTTTGCGTGCAAGTGCGTCGCATCATCCTTGATTAAAAGGGGCATGGATTTAGTGGATATGCAAAGAACTATAAGGTCTATGGATT 
TCCTCCCTCCAGCTTCTTCTACTTCCAATAATACTCCTAGAGTAGCGATAATGACATCTGGAAGTAGTACTACTACGGGCATTGGATCCTTGTCCATTCT 
TGCAGAAGATGGATCAACACACCACCAAATCAAGCTATCGGAATATAGGACTGGATTATCCATTACTGAAAATAATAGAGAGGTGTCTTTTACGGTAGAA 
CCTTCAATAGACGGCGTTCAAGCAGAGCATCCTCTATCCCCTTCTATTCTTCAGTGGTTACCTCCTCTAGTTAAAAGGCCAGAAGTGGTAGCAGCAGCAG 
CAGCAGCAGTAGTAGAAGAAGAATIACGGGGACAATAAACCTTCTGATAAAGATAACGAAGACAAGTACAGTGATACTGATTTTTGGTCTAATGTCCCCGT 

cacacctctaattacacccaagaaatggagagcgtgcaaaataaacgatcgggcaatgattagtagttggaaaaataatctagtgaaactccacaaatat 
gattggacaaataaaactacaaaggttgattattttgataagatggctgcctttgttgccctcatgacctttagaaaattccaagacatactagcggata 
actatgttcctcctcaaaccccttctcagggaagtgaatacgcagtgaccatgtctaacgtggctacactctttactgacgtgtacggttttgaatcgaa 
tggaaataagccattgtttgccctagaacagctagaaaatgaaaccgggattgaaagcatatacgtcctaaatatcataggaaattcccctgatggtaat 
tctgtcaggctcgtcagactggaaaaggaaatgagtttcctcttgaaggcgaagcagtactttacagaaatggccatacctcctattaatgaaaaatgca 
aatggacagataaggccccgtcatctgtaaaggagtacaagtatttttgtgatcttvacagcacccatttcaaagagacctagaaaagataacaacgacgg 
cggtgtggagcattctgcgttgacttatacacctaggtgcatataccacactgaacgttgtttagtccatctttactctgagccagaaaaaataacagaa 
cacgtatctttcaacaaggatttgaacatattagaaattggaaaaaatattaccaaccaataccaaacaaactacaaaagcatattcgat^ttgtggacg 
ttcccataattgtcgcctctatgtcatcaacaaaaacaatgactgtaaacaactacataatttcaacaccttctgccacgaccaagtttgttcaggatcc 
gccaaaaacagggaaacaacttctggcagttgaagaggtcagaaactttaaactcaaatctgttcttgttcctcctccttattttagggacaataagcgc 
aacacaactctttgttcacaaataactgaacaaaattgcccgtcatcttctgaaggtgggcgtttttcatgtccatcagagtcacttattctcaagtact 
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CTAATCTCTCTAAAAAGCGCGCACTGGAAGAAATTGCCCCAGAGACTGAGACTAGCATTTTGTCACTAGCCATGTAACGTGTCTCGATATATGACAATTT 
AAACAACCTCTTAATAAAGAGGATAATTATCATAATAGAAAATTAGGTAAAATGGGAGTACAGAAGAATATCCTGGTCGGTGGTGGTGGTGGTGTATCAT 
TGTTATTGGGTGTTGTTACACTTCTAGGAACAGTAACAGAAGGAGCACCGGCAGTCCCACCTTTTTCATCGTCTTCCTATTCCTTTACTCCTGAATCGAG 
TGTTTTCTGGGTCGAAGGAAATCGTGTTTTAAGTGGAACCAAGAAGGACACCTTAATCAACGTTCTGGGGAAGAAGATTCCTTATTATGCTAATTCCATA 
TTCAGACATGACTGTTCTGAAACTCGTTCTATTCAATGGCCAGAAACTTCCCCTTTGGGCTTGAACCTTATTTTCTGTTCATGTGCGAGTCATGAACATC 
AACACCGTACTCATGAAACAACAGAACCTGACGATTTATTGTGGGACGGATCAAGAAAAACTACCACCATAATTCTACCTAAAAAATGGTGGTCTGATGT 
TGTGTGGACATCTTTATGGAGGGATAACGACCAGAAGTGTGGGTGCGGGCAGGCATTTGTTTCTTCCTTTACTTCTACGCAGAAAGAAGTCCAAGGGGAA 
TGGCTTGCTGCTCACACTAACGGAAAGACATCTGAGGGTGATACTAATTCAGCCTACCTTTTCATAAGCCTTCAACGAACTACACTCAAGCCTATCATCA 
CTGACGTCACAGAAGATAATATGATGATGGGAAGAATGTCAGGCACACCCATGAACCCTAAGGATATGACATATTTTGTCAACGATTTTTCAGACGATAT 
AGGAAGTACTCCTCAGTGTCTTGTATCGAATTCGGACATCCTGAACAAGAGGGAAGAATGGATAGCTGTTTGGGGTGTTGCAGACTCTAAAGACCTCCTA 
ACTAAACATCAACTGGGGGAGCGGGAATATGGGAGCGAAGGAAGAAGAAGAAATCCCGGTGTTGAGGAGGAAGAAGAAGAGAGAGTGGAAGAAGAAGAAG 
AAGTAGAAGTTGCGCTACCTTACATTAAGAAAAGTGGAAAACTTATCGGACCTCGTAGAAGACCTTTGACAACAACAACAACCACTACTACTACTACTAC 
TACTAATCCTATTGTTAGAGAGGTTGTGGAAGATTTTGATTACGAGTCTTTTAATGAACCAGAAATCTTTGGCAGTAACTCAAAACTTCCCTTCATTAGA 
TTTCTTGATCAAAAAAATTGGAGACTTGGTATCATGAGCAGAGTTTCTTCTTCCATCGCCAACTTTAAAATTGAACAAGAGTCATCAAAAGCCTTATTTT 
GTTTGGCAGTCTGGGTTGGGGATGAACATACCCCTAAATTCAGACTTAGTGTATGGAAGAACTGGAAGCCTTTTACTTCTGCACCTATTATTGTCCAGAA 
TGTAGGTTATTCCTCTGATGTTTTCTGGCATGAAACTCTTAGAAGCAAAATTGTTGATCGGTCAAGGGACCTGATAGAAACAAAAGTGACAAAGAAAATT 
GGGGAAGATTGGGCTAATAAAAAACAAACTGTAGTTGCTATGTTTATTTCAGGTATTGTTTGTATAACAGTAACAGTTATTTCTATATTTTCAATTGTAA 
TATATTACAAAATAAAAATGCCTAAATTCTAATACAATGGTTTATTCATTCCTTATTCCTACTACTAGTGTATAGCACATGAAACAAAGAAAGAGACACG 
CTTCCCTTGTTCCTTAAACAATATGTTTGATACATTTTTCACAATAAATTTGTCAACTTTATCCATTACTTCATTACCTTCTCCTGATATCTTTTCCAAA 
TCCACCACAGTTCCAACACCAGTAAACATGCTCTTTCTGATATCCAATGTTG/iATACTTGGCAAATTCATGCCCTTGCATTTCAGTCATGGCAATGATTA 
TAGCACCATACATTTTACATTTTTTGAATAGAGGATATGTGGCAAAGTTGGGAGAGAATATCTTTGTGGGAAGACCTTCTTCTTCCTCAAAAGTACTTTT 
CAGTGCTCCTCCAGTTCTCTTTACATGCAAAGAAAATTGTACCCTTTTCATTGGATGAACTTGTCCACAGAACAGGACATCTTCAGACATTGAAAAGTGG 
AGATTGTTAGTAGGTGAAATTGTCTTTCTTGGTGCCAAAAGACTAATCGTATCAGATGGTGTTTCTGGGTTACTTTCAAGAATACTTTTGTCCTTAATGG 
GGAATTCACTAACACCAGAAGCTGCTGCTATTTTTGACATGAACATTAACATACAATATACATTAGCTTCGTCTGACATACACAGCCATCCCTTTTCTGC 
ATCTACATGGATATTTTCTCCCTTCTCTTGAGCTGTGCATGTGGACGTTACAATAGACCCATGTTTCGATAGACCACAAGTTTCCGTAGGACGACAAAAT 
TTACCAGTCCTACGCAAGTTGTGCTGCATACTCTTACATACAACATCCATGTATTTATTAACCTTATTTTGATTAGATGTGTCACTAACTTTTTTCAAAA 
TACCTTCAATGTTATTTGAAAACTCATCCGCGCTTGTTATAGGT^GACGCAATTTGACATCTCTTTCTCCCCCGTCCTTGCAATGGGCAAAATCTTTAGA 
CTC GT AACC C AG ATTTGGT AC ATG AGTAG AATTGGTGGT AAT TGATTTTGGGCGT AC AT ACT G ATTC AG GC ACGG AT TGG GAGAT GG C AAAAAGGCG AAT 
GAAGAATTCTy^AACGTTCAATTTTTCATACACTAAAAATATATAATGGACATAATVATTGGAACACAATTGTAGCCTTGTTATGGGGTGC/^TATCAAACA 
ACAACATTTC/UVTAGTGGACTCCTTTGGTGCCCTATTCACAAAAACTGGGTGTATTTTGATATAGTCTTTCTTGTTCTTTTGTTTCGTCTCATATTCTTC 
ACACAACTTTGAAGAGGGCTGTTTTACAATACCACTGGCTATAGAGTAGTGTGTGATTGAACGAACGATAGCAGTGTATAGCTCAATCATTTTCTTGTTA 
CATAATTTCTCATAATGTTTAGCCACATCTCCATTATTAACACCAAAGAAAAAGTCTCTAATGTTTTTAAGTTCTGTTTCTAGCCCATGTTGCTGAATAA 
CCTTAGCAGTGTTGGCCATCCAAGACGCATTAAACGATAAAGATTTTCCCGATCTACTTAGGATGAATTCTTCAGCGGCAGCTACCACATCTTTAGAAGG 
GTCATAGGCAGAATACATTGTTACTCCTCCATCTCCATTATAGTCATCATCCATTGCAGCGGAAGATTCTGCCACCTCTCTAGAGACACTAACTGTATCG 
TGGTTAATAAAGTCGCTCCCATTAAAATCAGCAATGCCTCCAATAGGCATAGTATAAGCCTTGTCTTTACTCATGGCGCATTTAATGTTACACAAGTTAC 
CCCGTTCAACAGAATGATTTTGAATAATGAACAAGTTTTGAGGATTACTATTGCGGGTTACAGCAACATACGCATCCTGGGTAGATGCACCAGAAAGGTC 
AACAATAGTGTCTCTAAAGAAAGTGTGCCCTTGGCAAGAATATATCGTCATGGCTTGACTAGATTCAAATGGAAGATATTGAACGTATACATTGTTCCCA 
AATCCTCCATTCTTGATATTTGGATTACCGATAATTTGTCGTCCTTCTTTAATTTCTAGGCATTTTTGTCCTAGCCGTTCCACGTACACAAATACTGTCA 
ATTCACCATTCTTGTACATTAAATTAGTAACCACTCCAGTGTCCTTAGTCACGAACCTTTCCTGAGTGCCATGGATCATTCTATTAGATGTAGTGAAAAT 
CACATTTTGGCCCTGATAAAGAGTAATAGATTTAGCCCTAGTTACCTTCTTTTTCTCATCCAAAAGAATCGTAGTTAATGCCCTAGGAGATTCGGCTGCC 
CTCAAAATTTTCCTGTCTGTAAACTTGTCAAAAAGTGGTGTACTGCGAGTTTTAATGGCGTTAATTAATTGTTGCCGGTACTTGTGAGAGATGGTTTTCT 
CGTGGCTAATGTCAGACAGATCCATATTCAAAACTACAAATTTTACTTCCACTCTTAATGCGGGCACAGAATTTTCAACTGCCGCAGCCGCTGCAAGTGA 
AGTAGAAGAGGAGGATGAGGAATTGTTATTGTTCTTTTTAGTTTCTTTGATGATAGAAGTATCCACTAAATTCGAGACTGCATTCTTTAAAACGTGTTTT 
TTAGTGAGTGTGATTGCACCCATCATCTTGTCAGTTTTGGAAGAAATTGCATCCTTTTCTTGTTTGGCTTTAGAGTTGGCCATGAGGCGTTCACACATAT 
CCTTAGGCTGCCCTTGTTCTTTTGTTATAAACTCTTCAATCTTCTTCCTCCTCTCATTTCCCTTTGAAATACTAGTCACCATTTGCATGGCCAACACTAT 
AGATTTTTCAGAGTATTTAAGTGAGTTATTGTGCCTATTGTTAATGTTGTTATTTTCAGTGTGAGCAGTTTGAAGCACTCTGGAAATCTTTAAACTACCC 
TTAACACCCTCCATCATTTGACCATTATTGAAAACATCTTCCTTTCTGTACATAAGAAATTCGTTAACCATTGAAGGGAAGATGTTATCAAAATATTGCA 
CTGGGTCAAAGTTGTCCATGAAAACACTCCCATTTCTGGATGATCTACCGGCCATTCGCGTATCTACCAAGTCTTGAAGGTTGGCAGTCATATCAAACAA 
TGAAATACCGTACATTAACGAATAAGCGTGCTTAATAATGTCTATTCTCTTTAACGACGACATGATATATGTTCTCGAAAGGGCAGAAAGATAGGAAGAA 
TATGCAGGTAATAACATACTCTTCCCTACATCTCCTCCTGTTGATATGGGCATAGATAATACCCTCAACCAATATTCGTCATGGAGAGGCTTATCTGCCC 
CTGCTCCTCCTCCTCCTCCTACTCCTACTCCTGTGCTTGT6GCCCATTCTGAGGGCGAGTGGAACAAGCGTCCAGACAGTGCAGATTTATCAAGAGAAGG 
GAGTTTTCCATTATCAAATTTAACCTGTTGCATTCTCACTAGTGTATACAAATACCACACACGGGCTCGTACCAAATCTGAAGGCTTATTTTCAGTGAGC 
CATTTTTTCCACACAACATGAAAGTCAACTAAATGACCTACACGATCTAGAGTAGATGCGTCACGGTGCAATTTATCTGGTATATCAGTTCTATATATCA 
TTTCTCCACTTTCTTCATCATCGTCATCACCTCCCCCACCACTACTTTTCATACCAGAACCTCCACTTCCTCCGTCTTCCAAATCCATTGCTTCTTCTTC 
ATCGAAATAACAATAGTCCTCGTCGTCCTCATAATAAGCTGCTGCTGCAGCTGTACGTGATGATGCACTATTATTATTACCAGTTTCTTTCTTCAAGAGA 
ATAGATCTTATATTTTCAGATTTCGCTAGTAGTTCTTTGTATGTTAAAGTTTCATTCTCTAAAAATGAAGCAACTGTACGCAAATTATCTTTAAAGTAAT 
TAGCGATAGGAGTTTCATCCACAAAGTCTTTTCCATCTCTAAAGAAGGCGTTCCTCACACACGCTGCCACAGCAGACGCACGGTTATGTGCAGAATTGAA 
TGGTACTGCATCTATACACTTGG6ATCAGAACCGTGCACATACTTGACGTTCAATATCTCTTCATCCATTAAGGAGGTCAGATGTGTATCCTTAAATCTC 
TCTGATAGGCTAGAAACAATATACAATTTTTCTCGGCCACTGTAATCAATTTCTCTCATGGCTTTGGCCATCATGTCGAACACAATACTGCATACATTTT 
CAGTGTACTTTTCTAGATTTGCATTGCTCATTTTCTTGAGTTCAGATATGTGAGCTAGCTCTGAACATATTCCTTCTTCTATTAGAGCGGAAAGGGGGAG 
AGATTTCATCATATCCAAAGAAGACTTCTTGTCTGATACAGAAGTTGTTCCACTTCTTATCTTTTCGTTTGCATCAGAAATGCTCTTGCAGCGTAAAGAG 
AATCCATTCAAAGCTCCCGATGCTCTGTCCTTTTCAAGTGCATTTCTTTCCATCAGTTTCAAAAATGCTTCATTGTTAAAGAAATCGTCATCTTCTTCGT 
CATCAAATTCGTCACAATTTTCTTCTGCCGTGGATTGGGCCGGGGAAGAACTTGGTGCCGTCACTTCCTTTTTATTAGAAGCATTACCACAGTCTTCAAC 
TTTAATAGCACTTGAGACTAAATTGTTGTTAACTGACTTGACGTTATTACTCTTCATTAAAATTGCTTCATGGCGTTTATTGACAATATTCCGCAGGTTA 
CTTGGGTTGCCAAAAAAGTCTCTAGAGCAAAACATTTTCAGTATCTTTATAAAAACTTCAGTGTATAGTTTTGCATTATCTTCTTGGCCAAAATTATCTT 
GTTCGACTGACCCTTGGAAGAAGGTACTCTTTTCTTCCTCTTTCTTCGTCTCAGCTTCATCGTCACTTTCTTCTTCTTCTTCCTCCTCCTCCCCCTCTTC 
TTCTTCGTTTTCGTCTCCCTCTTCTTCTTCACCACCTTTCTCTTTCTTCTCTTTCTTCTTCTTCTTGTTCTTGTTGTTCTTGTTGTTCTTCTGAGGTTTT 
TCTTCCTCACTCGGGTTCTTTATTTTCTTCTGGATATGGTTAGTGTTATCTTCCACTACTGCAGAATTTTGCCTGTTATCTCCAAGTAGTATAACAAACA 
CTCCTCCGAAAGGTATATCTGGCCTGTGTTTCGCTATACGTAAAACAGCATCAATAAACACCAATTCCCTGCAACTGGCCATTGTATACTCATCGATAAC 
AAGGAACCTTAAATTTGCCAACACTTGTATCACTTTCCATTTTAGTTTATCGGCCACAAATTGGATTTTTTCTAGAGTTATTTCATCTTTCCTAAATATA 
ACTGGAATCTCAAATCCTTTGTTGAGGGTAGAAAACACACTTTCTGGGTACATCAAGGAAGAACCTCCTGGTTTCTTGCATGCACACATGAGACTCTCCT 
TTGCAGAATTAGATTTGCAACACAGGAGCATGTCTGCAGATCCAGAATCAATCAAGCCGTGGTCGTTAAGAACTGCAATTAGTTCCTTGACCATTGTAGT 
TTTCCCAGCTCCAGGAAGACCACCCAGATGGAACACAGATATTCCTCCTCCAGCATAACTGACAGAAATCGAGTTGCACAGATACCTCACTGCCAGAGAC 
TGTTGCAAGGAAACAGTGCTCAATACCTTTCTGGCCTCCTTGCACGTATCTGGGAGCTTGACCGCAGAAGCAGAGGAGGAGGAGGAAGAAGAAGACCCAG 
AAGGGGATGCCGCTGCAGCTGCAACTGCACTTCTGAGCACCTGGGTTCTGGTAGCCAGAAAGGGGTATGATTTGATGTTGATAACTGGAACTGGATTTGA 
ATTATTGTTGTTGTTCTTCTTTTGTTTTTTGGTAACTGGCTCTTGGCCGCTGGAGCACGGACGTTTACTCCCCATGATGCACTCTCGGTGAATGTCGGAC 
ACGGGGGTAAAACCAATCCTTTTATACTAGTTCTACCTCCACGAGGTTGCCAGATGGTACGATACTTTATTACCAAAAATGGATCGTCCTCTTTTGCTTA 
AGATTTTGTGTGTAACCCTTATCGTTACGATTGTTACAATATGTCTTATGGCATTTTTTATACATAGCAATTCCTCATCTTCTTCAGGCGCCACTGCCGC 
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CGCTACCGTTGCAGCGGCAGGAGGGATATCTTCATCTGCTTTATTTGGTACCTACATAGGGTCCTTGGTGCAGTCRGAATTTTTTTCAGAAGATATCGAT 
TTACCAGAAAAACAAAAGAAAAAGGCTGATAAGGAAAAGGCATCAAAAACCGATAACAATAACAACAACAACAAGCGTCAACGCCGCCAGACAAGGAAGC 
AGAGCCCGCGCGCCAGAAACAGGACTTTAAGAAGGAAGAGGAAATTATCCAAAAGAAAAATACACGTCAAGAGAAAGAAGACTCTAATTAATCGGCGCAG 
TCTCAGAGGAGGAAGAGGATTAAAGCGAGCATCTAGAGGCAAAGGACTGCTTAGAGGGATTGGAAAAAGGACATTTGGATTGTTTTTCCTCCCCTTTTAG 
AAAGGATGTTTCTTTTAATTATCATGTTGATTATATTAGCCCCATTAATATTTGTTGGGGTGAATTTCTTCGTATACATAAAGAAGAATAACTGGTTCTC 
TGGGGCCATTAGTAATGACACTACTATTAACTCCACAAATTCTGGAGGAGTAGGAGGATTATGGGATAATGTTTCTGATTCAGAGTCAGGGACGTTTCCA 
TCTAGGGTACAATATTCCACATCAACGAAAAATTTAGTCAATATAGCCATGGATGGAGATTATGCGACATTAGTAAGGAACGGAATGTCCACAAATCAAA 
GGCCATATGAAACCTACAAAGACGTGGAGGATAGCCAGTTCTATTTACATTTTTTCCACGTTAGAAATTTTAAACCTTTAAACGGTGATGAAAATAAAGA 
CCACCTTGAAAGGGATGAAAGTTTTGTGTTGATCGAATCACCATATTATAATGGAGGGTTTTTATCATACAATATCAACAACCCAAATCCCATTTACAAT 
TCTACTGAAAAGCCGTATATTAACACGGAGATAACTTCCATCGTCAGCACCACTGGTACAGATGAAAGGTTCTTCTGCCTCGAGAAGGAATACGTCGAAG 
ATGGTGAAGAAGGAGTTACAGAAAATAGGTACTTTTTACGCCACATGGCAAGTAATTATGTTGTAAAGGCTAGTTTCAAATCAGTCATGCCCACTATTGA 
AATTAGTGATCTGACTGGCACATACAATAATAAGAATAGTGTTAAAATTAAGCCTGTAACTGGTACTTCTATAATTTATGAAAAATTACAAGGAACAAAA 
TAAAATACATTTTAAAGTATTATGTTTATTGTTTTATTTCTTCTAAAATTTTATTTTAGGTATCCTCTTGT^TATGGAACGACGGGTGGGGGGTTTTTTC 
TTACCCTTCTTTGGTTTTTGGTTAGTTTTCTTGGCCCCACCATCTGGTGTTTTTGGTCCTCCTTTCCCCTTTCCTTTTCCAAAGGCTTTCTTGAATGCAC 
CGCCCACTCCTGGTATTGATCCCAGCGCAGAAGCGATCGTGCCGGCCAGTGCTGTGCCCGCCAGGAGAGATGCTAAGAGAGGAGCCTTGATTATTATTAC 
AGCAATAACAATGACAATTATTGTTATTAAAAGTAAACCAAAAAGTATATTTTTTCCTCCTCCTAAAGACGCCACCAATGGCGAAGCCATCTTGGCTATC 
TTATTGTTATTATTGTTATTGTTATTATCCCCTTCCATGGTTGTTGCACATAATAATAGTGTGTATTATAACATTAAAAGAAACATATAAATTTTGGCTC 
AAGGCACCTTCAAGGCTGTGAAAATATTACATGCAGATAATGTTGATTCTGAATTCAGGGTTGTGATTTTTGGTCGCACGTTTACTGACGTCCAACATAG 
GCATATGGATAGTACAAATTGTCTTGAATTCCTCATCTTCTTTGTCCTCTGTACCATCCTGGGGTTTTGCTGGGAACAATGTCCTATATGTTCCCTTTTT 
TATATAGTAATTATTCGGGTTTGAATACAAGTAGTTATGGAAATGGAGATGCGAACCACTAAAAGCTTGTATATTTATCAAAAAATTCAAGTCAACTTTC 
TCGTCATCCACTCTTCTCATTTTTAGGGTAATATCTCTCCCGTTAATGACGTAGATAATTGTTGCATTCAAAGCACTGATGTGCTTTTCATGATATTCGT 
TTAAAACAATTTTATAATGATTTTTAAAGGGATAACTATAAAGAGACCCTTCTTCTAATACGATATTTACTGATGGTCCTGCTACTACCTTTAAAGTCAT 
GATTGTGAACCCTCTCAAAAGAGCTATCCATTTTTTTTCATCCGTAAAAGTATCGTCAACCCTCCGCTTCCGCTTACCCCTCCTCCACTCCCTCCTCCTC 
CTCCTCCTCCCCCAGTCTTCTGCCTTTTATTATCGCTTCCACCTAAAGATCTTCTTCCACCATTATTCGAAAATACAAAAAAACCTACTGCAACAAGAGC 
AGCTATTATCAGAAAAATCACCAGGATCAAAACAGGGGCATTCGTATTAGACATTCTTGCTACATGGTATTGGAAAATTTTCCACCCTTTAATACATAAA 
AATTTGGTTACAAATAGTTAAAAATATGTAACCATGCAGTACAATAAAAGGTGATCAACTGGTTTTTCATTTCTATATCGTAATGAAAATTTTGCATTAC 
GCGTATTCTTTGTAACTATATCGGAATATACTTCATCCCATACAGCAGATGAACCTCTTTTTTCAGGCTTAATTGCGATATCCTTTTCAGATTCCATTTC 
GTGGAAGATCTTTACAAAGCAAGATTTATCCGTTTGATTATTGAACAAAAAATCGAGCCGTTTTTTCCCATTATCTACTTCAAAATTCTCAATTGTTAGA 
GAAAAATCCCTCTTCCAGACTACAATATCTTGGGCCTTAGCCAGTGTTCTGATTAGCCCGTGATTTATTTTTCCTCCCATAGAAGGAGATAAATTGTAGC 
TGGTTTCATACCTCCCTTGGTCATCACATTTTACTCTAAAAATTGCCACTGTAAAATCACTCATGTGAAGATTTTCAATCAATTGAGGAATTCTCTTCCC 
TATATCAATTTCAACGCTAGCTGGCGTTTTTAGTACGTCCCATGCAAACGCATTGAGGCTCTTGTTCGGTACAAAAAAAGTTAATGCCGATTTCGGATCA 
GGGAAAACATCCGCAGCCACCTCAGTTACCCAGGCACTAATGTTGTCTTCAGACATTGGTAAAGTTGCTGCTGCTTTTTCATTTATTAGTGCTCTAATTT 
TCCCTCTGTTGCCCTGCATTTTATTATTGAATACAGTCTCTCCGTAGGGTTTAATATGTCCTCCATACAAATCTACCCCAGCCATGGAAGCAGTTGCAGT 
ATCTGACGACCTCGTGTCCAAAACATTTGTCACCCTAGAGGAAATTTCCCCATCTTTAAATAGTTCTGCAAGTCTCGCCTCCATTATTATTATTTTTAAT 
GTGT AC AAT AAAAAC AATT AATT TG AAAC AAAAGGT AAT AAC AAAAT AAAAT AC AT AAGAAAGAT TC AAAAAGT T XT AT T C ATT ATC C T T ATC AC AC AC A 
TATTTTTTGACGGACAAGTACATTAACATTATCAACATTCAACAGACCACTACCATCATTCATTACAACAACAGAAAATGGGGAAGTTTTGTTGCACAAG 
AACTTCCATACATCATTCTTCTTTACATGAATCAAGAAAGTATTCCAATCATCCCATTGCTCATCGAGCCTGTACCCCAACCATGGGTTTACAAATAGAC 
TATTAAATTTATCCCATGAACAGTATTCCTGGAGAGAAATGAAGGGGTCTTTCCCGTCCAGGTCATTTGGTCCAACATTTTCCACTACCTTTACTTCCCC 
GAGTCCAATTAGATTGACTGTCGTCTTTTCTTTTGTCTCGATCGCACACTCGCCAATGGCAGCCAACAAGGAATCGGGAAGATCACTCCTCATTTTCTTT 
GGCGGTGAAGAAGGAGAAGTGACCTGAAGAGGCGCTGGTCTCTTATTATTATTTTGTTGCAGTTTATTTCCTTCGGCAATTGCAGTGTTCTTGAGTTTCA 
AGAAATAGTCGATAATGTAAAACTCTTCACCCTCTCCCAGTTCTTCATACATCACATCACCAGATGTCCTGATGGCTTCTTCCACCATAGCCAAAGAAGG 
TGCTCCACATTCAATCAAATCCATCAGCAGTTTCTTCCCTCCTCTCCCCATTGCGACGAGTTTGTTGGCGTACCTGCACCAGGGACATTTTCTGTTCTGC 
GGCTTGTCCGGACCAAGAGTAGACATCATGTACTTGGATTTGCAGCAATCACATTCTACACATACAGCAACTGGAATATCAATATTGCCCTCATTCAATA 
GCTTGTAGAGAAGACRAGTCTTCCCATAC/KAAATCCCCACAGCCTTCGCTGCAGCTGAAACAGCAATTTCATTGTCTTTATCTGTTGGGGGAAGATTGAG 
AGCTCCAATGATATCACTAGCCATCTCCTTTACTTTGTCATCAATTTTCTTGGCATTCAAAAGATTGAGGGTTGTGGGTGATGCAGTTTTGGCACGAGGA 
TCTCTGGTCAATTTATTGTCCCAGTGATAGACCATTGGTTTGCCCATCCTGTAGTTTCCCTTGCATCTCAAGCACAATTTTTCATCATTATCATATTCCA 
TCTTTAGAGATTTTACACCACATCTTTTGCATATATATTTCAAGTCCTTCTCTGATAGGTCAAGTCGACGTACAGGGAACGTCTCATATTTCTCCCCAAT 
CTTTGAGTCGCCAGCCTTTGTCAAGATTCTCAGAGCCAAAAAGTGTTCTAGTTTATTGGTAAAGTTCTTGTCAACTCCTATCTTATTGGCCCATGTAATT 
AAGCACCTCAAAAATACTACACCGGAAAGATTATTCTCGATCTCTCCATCGGCAGTTCCTTCAAGAATAGAGGGCAA/UiTAATATCCAAATCAGTCTCGG 
AACTGCTATTCTGCTCTTGGTAATACTTTGTGAATGCCATTTTTGCAATAGAGAGGATCATGTTTGAAGTGTTGGAGGGTACTTCACAGGAGGCAGCCCA 
AGAGGCGGCTGCACACACAGATTTGATTGCTGCATAACCTCCTTCTGGCAGACTCAGACGTACATCATACAAAGTTCCCAATTTTTGAATCACAGTCTCC 
AAGTCGATAAACATCATTTCTGCCACATCTTCTCGTCCTCCAAATAGATCGCTGGCCAATTCTTCTCGAAGCACTTCCTTCTGGTCATCTGATCCACAAA 
AGACTGGGATGGCAAAATATCTGTTGGAGGTTTTTGACGTCTCCAAAAGCACATCTTCAACATTTCTAGTAATCTTTAGACATGATACCAATTTCTTGAT 
GAAATCAAAGCACTGTCCAAACTCGAACCTCTTGTATTCGTCCAGTGATAATAATCCTCCTCCTGTATATGAATTGGAGAAACTAAATGACTGGTGGAAG 
ATTTTGCCAGCTTCTTCCAAGGCATCATTGGGCACATAGAACCACATAAACATGGAGGATTGTTCAGGTGTGAGATCTGTATTGTTAATCACTTGGAGTG 
CATCATAGCTAGCAGATTTCATGTGCTTTCCTTCTTTTAGGAAGGCATATTCTGCCTTGTTTCCGCATTCAGTTCCCAGATCAGAAAGGATGGTAGAGAC 
CCTCGAGCACACTCCTGCATTATTGCCTGTAACAAAGGCATAAATAGCCCCT^TTTCATCAGGCACAGCACTGCAAGCGTACATCATTGATTGTTCCTTG 
ATCATCATTTTGAAGAGTTTTTCAGCAAACATCTTGGCCCAGTGTACGCGAGCATACTTGCGAGACAACTTGGTTTTTCTTCCTTGATCAGATGTAGCTT 
CAACACTTGCCATCAATTTTGTACAACCCTTACAATCACAGAGGTTTGTATGGTCAATTCCAAAATCGTCCAACGACTGGAAACACGCGTTCATGTGCAG 
TGCCAGAGTAGAGGCGGCATTTCCTCCCCCTGTAATTGACTGGGCCCTTTGAGTCACTACAGCCAGTTTGCTGATACCACCACCACCGCTAGAAGACTGT 
TGTTGTTGTTGTTGCTGCTGTGGATTAAAGGTTTGCTTAGAGCAACAAAAGGCACTATCATATGCACAATTCAAGAACACGCTACGCTTGTGGTTGTTAT 
ATCTTCCTCCTCCCTGTACAACTCTTCCAAAATCTAAAGGTGCCATATTTAAAACCCTAGACTTGAAAGATTCACGTTGAGC6TCGCCATACTTTTCAAA 
TCCTGCAGGCTCTGATCTTCCTTCACTTTCAATAAAAACATTCAATTCTTCTTGGTTGATTTCCCTCAAGATGGGGAAGGCACTAAATTGATCATTCAGG 
TTCTTGAAGAGGGAAATAGATTGCCTCATAATAAAGGTTGCGATATTCTTGCGCACATTGCACGATCTTCTGTCAAAGGATTCAAAAAGTTGCCATCTGT 
TGTTGTCGTATCCAGAAAAATTGATCACATCTTCTTGGCACAATTCACTCTGGTTACTGCCACTACATCTAGAGAAATTGAGGATGCGTAGCCAACAATC 
ATTCTCCACTAGCGCCTTCCTCATGTCAAGAATAGTCAAGGGTTTTTGTCCCCAAGTCTTCATGATGAATTCTTCTTCTGGTGCAGTTTCCGAATCGTAC 
GAGTTGGCACATAATTTTCCAGTGTCACTATTGGCAATACGCACATCCTCATCGGTGAACACAAAAATGAGGATGGGGAATTTTTCATCCCTGGTAATGC 
CATGCATTTTCTTGAATGCAGGAATCATGTTGTTAAACACGTACATCAGGGAAGAATGAAGAGCTTCACTGCCACCGCCGCCGCCTCCGCACACTTCGGT 
CTTGCGCATATGCTTCACAAAATCTCCCCCTTCCCACTCTGAACAGTCTGGTGCACGCACAACAGGTCCACTTTCAATGGAGTGTTTGTCGAAATCATTG 
TAGTAAACTATAGCATTTTGAATGCCTGGAGCAATCACATTGAGTGTAGCCATGATGCCAGAACAACTATCAGCCACAGCCGATTTCACACTTCCCATTG 
AACCACTAATATCCATCAACGAGATACTTAGACCAGTAGGTTTTGCAGTTGGAGTATCTGTATCAGCAGCAGCAGCTTTCTTAGTCCATCCATCTTCTAA 
AAGTTCAATCTTGGAGGAAGGTGCATTTTCCATCAAAGAGTTAAAAAAAGATGCAGATTTCACAGTAGAACTAGAACTCTTGCTCATGGTGGAAGAAGCG 
CTTGAACGGATACGGCCCAACATTGTAGTATATCTTTATGGTAGAAGTTGATGTTGACGGTAGGAAACGATCTCGACAACTGTATGCTGAGGTGCCTCCG 
CAGGCCGCTTTTATACCCCTTTCGTGTGACGTCAATTGACGCTCGTACGTATAAACAGGGTCTATTTTTAGAGATGACATTTTTCAAAAACGTTTATATA 
ATGGTGCAATTTCCTATTACTCAAACTAGTCATATCCGGCTAAGACAGGATGTTTTTTTGGCCAATACGTTTCTGGCGCAATCTGACATTGGCCCGACCC 
AGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCCCTAAACTGGAGTGAGCTGAAAAATTTTTCAAAAGTTTTTGAGATGAAGAA 
GAGGGTGAAAAGTGGGTACGTACTATACACTCAGAGGGTGGCAGATCGGCCCTGTTCCAGAAATGGCTGTCCAGAAATCTGGGTCGGACAGATTCCAGAA 
ACGTTTCT/^CCATTTCTGGAACAGTCATTTCTGGAAAGGGTTACAATTTCTTATAACTGGTGCATTATTTCTAGCATATATTCTAGCCCCTTTGTGCAA 
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TCTGACATTGGGTCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCCCTAAACTGGAGTGAGCTCAAAAAATTTTTGA 
AAAGTTTTTGAAACGAGGATGAGGGTGAAAACACCTGTTAGAAAGTGGTTGCTGGTCGGTAGCATCCGTTCACTGCTGTTCCAGAAATGGCTGTCCAGAA 
ATCAGATTCCAGAAACGTCTATAACCATTTCTGGAACAGTCATTTCTGGAAAGGGTTACAATTTCTTAGATGTGGTACAATAACTTACCATACATTTCTG 
GTATCTTCGTGCAATCTGACATTGGGCCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTGGAGTGAGCT 
CAAAAAATTTTTCAAAAGTTTTTGAATCGAGGATGAGGGTGAAAACCTGGTAACAGAAGGCATGTCTACGACGGCTTCTGGTTAGATCGAGCCGGTCCAG 
AAATGGCTGTCCAGAAATCTGGGTCGGCCAGATTCCAGAAACGATTCTAACCATTTCTGGAACAGTCATTTCTGGAAAGAGTTACAATTTCTTAGATTTG 
GTACAATAACTTACCATACATTTCTGGTATCTTCGTGCAATCTGACATTGGGTCGACCCAGCGGTCCACCCTTCGAACTTGACATCAGGCCGACCCAGCG 
GTCCACCCCCTAAACTGGAGTGAGCTGAflAAAATTTTTCAAAAGTTTTTGAATCGAGTAAGAGGGTGAAAACACCCTCTTGAAAGGCACGATCTGGATGG 
TGTCGTCCGTTCACTGCTGTTCCAGAAATGGCTGTCCAGAAATCTGGGTCGGCCAGATTCCAGAAACGTTTCTAACCATTTCTGGAACAG'rCATTTCTGG 
AAAGAGTTACAATTTCTTAGATTTGGTACAATAACTTACCATACATTTCTGGCACCTTCGTGCAATCTGACATTGGGTCGACCCAGCGGTCCACCCTTCG 
AACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTGGAGTGAGCTCAAAAAATTTTTCAAAAGTTTTTGAATCGAGAAAGAGGGTGAAAACCTG 
GTAGGAGAAGGTACGCACGCATACTGGCGGCACATGCCTCCAGAAATGGCTGTTCCAGAAATCTAGGTCGACCAGAATCCAGAAACGTTTATAGCCGTTT 
CTGGAACAGCCATTTCrGGAAAGTGTTATATTTTCTCGCATCTTGTACATTAACTTGATTATGGTATAGTACACCTCTCTGCAATCTGACACTGGGTCGA 
CCCAGAGGTCCACCCTCGGAACTTGACATCAGGCCGACCCAGCGGTCCACCCTCTAAACTCGAGCGACCCTAAAAAATTTTTCAAAAGTTTTTGAGACGG 
AGGAAGAGGGTGAAAACCCTTGCAACAGAGGGGGTATAAAGGGAGGTACCCTCGCACACCTTAGACACACAACCTCATCACCCTCCGTCCAATCAACATC 
ATCACCCCATCTCTAAATAATCCATCATGTATATCTTCGTCGAAGGTTCCCCCCTCACAGGGAAGAGTTCATGGATGTCCAAGTTGATAGATACAGGATC 
ATGTGGAATGTCTTTCCTCAATTTTCTTCGTATGAACACTTCTGACTACTACAACTGGCCTGCCGAAATCGGGACAGAACATCTCCAGTTAGGTTTCAGA 
GAAACCAGAGTGGTGGATGGAATGTTTGAACCTGTCCTAAAGACCTTTGTCGACTCGTGGAAGAAAGAGCAAGGAAAAGAGAGTTTGAAGGAATATCTGG 
ACTACAACGGCCAAGTCATGGAGATCTACATCGCAGAATGGTTGAGACAAAGGCCACTAGCCTTCCACGTGTTTACCTATACAGATGAAGCTGTCAAGAG 
TGGATTCTTGAACGAGGAGGATCTAGATATGGATACTGCAACCAAGTGGATGGCTGAAATTATTAGAGAGAAGAGGGGCAATATTCAAGAAATAAAAGTG 
ACCCCTAGAGTAGTCTTCAATGGCAATGTTTGTAGTGCATGTTTCTCTAACACTAAGAGAAACTTGTATAACTTTGGAACAAACTATAACAATGTTGTAC 
ATTGTGATTTGTTGTGCCCTTTTGCAAGGCATAGGATTGTACATTTCTTATAAAAAATAAT^CAATATATAAAATTCAGTTGTATTTTTATTGCTCAAAT 
AAGTTACTACACCCAATTCTCCCCCCTCTCTAGTGAGAGATCCCAATCACTTTCTACTGTCACTGTTGCTGCTGTTTGTTGTTCCTCTTCTTCCTCCTCT 
TCTTCCTCTTCATCTTCATCTTCTTTAGTTCGTTCTTCATTATATGCTTTAATTATCCTCAATACATTTAAAATGGTGCGCTCTTCTCGGTTAAACTTCT 
GATTCTTTCCCTCACCACCAACATCGACCCTACGTCCATCCTGGTATACTGTTTCACCATCACCAGAAATUyVTTCTAATAGACGGATTTTTTATAGAACG 
TACAACAATGTCCACTAATGATTCTGGTTCTGGTGCTAACTGATCATCGCCAATAAAAGATAAAACAACTCCAGAAACACATGGATAATATTGCACATTG 
AATTGATCATATAAAGTATTGAAGATGATAGGGTAAGATGTTGGCCCGTAGGCAGACATGATCAAACGGGGCCTGAAACCACCCTCATCTCTCATTGTAC 
AATAATTTGAACACATGAATCTGATTCCTCTTTCTGTACCAGAATATAGGGGTGTGTCTTCATTGAATCTGTTTGTGAGGAAAAAGTCTCGTGCAGATAA 
TGCAAACACGAGACACGTACTTTTGTCTCCTATTCTTCCTATTTCGTTTCCATATTCTTCCCTTGCTTCTTCATCGATAGCATCCAATTTGTTGGTTTCC 
TCTAGTTTTTCTTCTAGGTATTTCCTCCTTGCGGCGTCAAGGTATACCTCGTTTGGCATATGGATGACCTTATTACGTTCATGCTTACCGTCCATTTTTT 
CGGCCGAATGTGTAACAACTGCATTCAAAAATAATGCCATGCATTGATCAAAATTTTTAAATGTGGGGTACATATTACTTGAAAAAATCTTCCCTAACAT 
ACTCACTGGTU^CAACTAAGCATTGGTCATGACAACTACCCACTATAGGATTTTCTATGCGGAGGCCTTTGTAGGATAATCGATGTGCTATATCGTGTAAA 
GGTACCATTGATTCGCCCCCTAAAATACTACCCATACTTAACAGTTTCTTTTCGACTTTGTTGTTGGTTATCCCACAACCCATCAATAAATCAGCAGTTA 
CAATTATGTACTTGCAGTTGTGAATTGCTGCATTGAATAGTCTATCAGTGAGAATTCTATATAGAGGGGAGAAAGTGAAAGACTTGTCATGGGGTAAAGC 
TTCTCTTTTCACTATATTGTTGATATTCCTCAAATCTTCTGGTAGTCTTTTGAGCCAATCAGATTCAGATTCAGCACCTTTCCTTGTATAGAAATTCCTC 
ATTTGTGAATGATGTTGAAGAAATAATGACGTTTTTTCCAAGCAGGGGTTGCCTGCTGAGAGGCGTATTTTATTCACAATTTCATCCACCTCATGTTTTG 
GAGTATTTTTATGCAAAGAGACGATATATACACCTTCAGCGAGTTCGCTTCTCCGTATACTTTTGATAGACTCGATAATGGACGGTATTCTTCTATAGTT 
TTTAACGATTCCCTTGATAGCAAACCCGGACGCCATCTTTCATGACTCATTTTGGAGGGTGAGATCATGCAATGAAAAAAACTTATAAGTGTGTGTGTTA 
ACGCACGTAAATACCAGCCATTGACGTCATATCCGCCTAGTTTTTATCCTCAACTTCTGAGAATTGCTCAACGTACTCATCTCTTTTCCTGGTACATGAC 
GTCAATGGAAATGCCGATTGACGTCATGTTAGGGGTATAAAACGACTGGCTCGAGCCCTCGGATTCACACTTTGTCTGTCCTGAGCAGTAGAGAAGGACC 
CAGTGAAGAAAATACTCCTCAACTTCCAACAAGAATGGGCTCTGCTAACTTGAACGACCGAACTCTACTCGTGGAGGAAGGGCAAGTAAACGACTACGGG 
TCTACCAACAAGAAACAACCAGTGTGCTGTCACATTTGCTATCTAGGAAGAGGAGGAAGAATTGCATGTGTCATCGCATCTGTTCTGGGCTGCGTCCTCC 
TGCTGGTGACTGTCATGACTTTGTTGATTGTGGTACTGGGAACTGCACCAGTTAATTGTGATGTGAGCCCACAGAGCTACTCGCCGCCGCCGCAGCCGCC 
GGTGCAGTTTCATCCTTACCATTCTTCTTCCACAACCACCACTACTTCCACTACTACTACTACTACACCAACTCCACCAGATACTAAAAAAGTTGACGAC 
GACTATGATGACGACGTCAATATTGGAGGGC7VATCAGTTACTGTGAATAATGGAGGTGTTTTCATCAATGGAAGAAAACTCTCAAAAGAAGAAGAAAAAG 
CAATGGGTATCAATACAGATAATGGAGGATTTGTTTGGAAGAATGGTGTTTTTTCTCAATATGGCAAAAAATAAAATATGGTATATTAAATATGTGTTGT 
TTTATTGTTTCTTTAAAAAATAAAAAAAAGAGTAACTCCGTCTACATAGTTTTTTTTAAATTGGAAGATATAAATAGAATGAGTAAAAGGTACTGTGCCA 
TTACATGTGGAGATCTTGTATTAGCAACATTAGAGAAATGGGGGACAATAAAGATTATGAAACAAGACTGATTCAAAGAATTAATGATTTGGAATCAGAA 
ATTGAAAATAAAACTGAATTATGTGAAAAAATCAATGAGCAGATGAAAAATACACAACTAAAATATGATAAATGTTTCGTAGAGGAGGAGACGGAAAAAT 
TCCGCAAGATGGAGGAAAGAGTTAAATACCTCAAAGAGCAGGGAATCCCTCTAGACCCAGAAGAAAGACGTACAATGTTGGCTGAAATTGACAAGAGTAA 
CAAAGAGTTAGATGC CC TTC TTGAGGAAAATGAACGT AT AAT AAAGC TCATTGATGAAGAGTTGGAAAGTATGAAATAAAATAAAAGAGTAGGAG AC TT T 
TCAGTTGGTGTTATAAATATATCCCTGCAAGGAGATACGGTCTCAAAAAATCCATGACTTGTCAGTAGCCACGGAAAACACACTTGAAAAATACACCAGA 
ATAGATGAGCAGACAGGAGAGATGCAAAAAAAATATGATAAATTATTTGAAGATGATAAAAGGTTCCGAGAAATAGAGGAACGAATCCTTCAACAAAAAG 
AGAAGGGAAACCCTCTAGACCCAGAAGAAAGACTTGTATTGTCGGCTGATATTGATAGGAGTATGAAAGAGATTGATGATTGTCTCGAGGAAATAAACCA 
TATAGAATTATCCATTGATACATTATTGGATGAATGTGAAAACTTGCATTATGGTCTTCAAACAACTAAATAACTCTTTATCGGGCAAGTGACTGTATTT 
GTACGTATGCAATTAGGTCCATATTGGTTTGCAAATAAATAATCCACACAAAACGTGTTGTTTTTCTATATCCTATAAATTCTTGGAACGGAAGTTATGA 
CTCACATAGAGTGTATAAAAGGAATTGGATGTTCCTATGTCCGCATCACTTCTAACTGAGACATCGTTGGAAGAGGAACAGAGAAGAGAGGTTAACAACA 
ACACCAGTAAGGAAAATGATACAATGGAAAAAAAGACTGAGACGGCTGCAACAACAGAAAAAGACCCAGAACCGTCTGTCAGTAAAAGGTCCAGAAATAA 
AGAACCCAAAACAACTTCTACTGTTTACACTTCTGTAAAGTGTTACCTTTCTTCCATAATCAAGAGTGAAAGTAGTAGAAGTAATGTCACCTCAACCAAA 
GAAAGGTTTGAGGAGAGGTGTAAATCCGTAAGCAAGATGATGGTCAAAGGTTCACTGTTTTTGAGGTTAGTAGTGGACGAGTGTCTGAGACGTTACAACC 
ATCTAGAAGACGAAATCGATAAATGGCCAGATATGACGAAGGATAACTTTTACGTCCAATTGTTGAGGAAGGGTTTAGACAAGAAGAAATTGAAAGAAGG 
ATCTACACATCCTGTrGTAGAAGATGTTTGGAATTCCCCCATCGTCCAAGAAACATTCCTATCCCAGCAAGGAGAAGGAAATAATCCCATAAAGAGACAT 
CTCATGGATTTCAATACCATCACCTACGCCGCCAAACAACTAAAAACTTGCTTCGAAACAAACCTACGCACCCATTTCCGGACACGACAACAGAGGGCCA 
TATCTGGATGGTTAGCTGAAAACGGGTTCGATAAAAAGTATACGAAACTCGTACAACACTGGATAATTGGATGTACCTACAAGAGTGATTGGGTGGACAG 
TGGTGATTTGGAAAGGGTAAAAGAAGGAACGAAAAATTTCGTGACTCTTCATAGGAAACATTTATGTGTTATTAGTGATAAGAAGAATGGTACAATTTCC 
TATTCACCTGAAGAGAAATATCCGATACCCTCAATACTAAATTATTACAAGTTTCTACAAACAGAGTATCCACAAAACAAGAAAATACAGAAAATGATAG 
TTGTCCCAAAACACAAACTAAAGATACACTATTGTACGTTTGACCAAACGACCATTCAAGGAATTTGTAAAGATTTGGGAGTGTGGAAGGATATGGAAGA 
ACGACACAAACAATCAGAAGATATACTTTACAAGCAAGGATGGTACCTATTATTCGACGTTAAAAAGATTAAGAAATTGCGTCCAAACTGGAACTTTCAC 
TCTATCCAGACGGACGGCGAAGGCGTCTCTGTACTATTTTCCAGAGAAGTGGAAGAAGTAGAGACTGTTTCCAAGAAAAGTAAGAAAAATAAAAAACCTA 
GAGGAGATGAGGATAGGAGAAATTACCCGCCCACTJ\ATGCCAAGTACGTAGTGGGTGTAGATCCGGGAAGAACTAATGTCGTTTCCTGTTCGGTATTTGA 
TACCCGTCAAAAAAGGGTAGTGAGAAAACACAGAATGACTGCGAAACAATACTATCAAGAATCTTGGATGACAGATAGAAGAAAGGCAAACGAAACGTAC 
AAGAAGAACAATAAAGAGTACAAAGAGGCGTTAGAGGAAATAACTAGGTACGATAATGGCGAAGAAATTATAAATGATGGTAACGGTGATACTTCTACAC 
CCACTAAAAAATTCGAAGCTTACTTGAAGGTAGTGAACGAGCACTACAGGTTACTGTGGAACGAAAAGGGAAAGAAAAAGTACAGGAAAAATGCCATGAA 
AGTATACTCTAGAA/VACAAAAGTGCATATCTAACTTTATAGATGAATTAATCCCTAAAAGGGATAAAATTGAAGATTACCACATTGCTTTTGGGGATGCG 
AAATTTGCCTGCACGGGAAGAGGTGAGCAATACGCATCACCTGCCAGGATTTTCGCCAAGAAGATAAAGGAAAGAGTCGGAGGTGATAAGAGGTTTACTT 
TCGTGGACGAGAAATATACGTCAAAAGTATGCCATCGCTGCAATCAACCTTTAAATATGCTGGAGAAGGATTGTTTTTCACCGAATAAGAAAAGAAAACC 
GCCGACAATAGTAACAACCACAACAACAACAACAACAGAAGAAGACGAAGAAAATGGAAAATGGAAGAAGGCTACACCTCTCAGAGAAAATAGAGATACC 
AGAAGATGCTCGTCCGAAAAGACGCAATTCGGTTACAGTTCAAACCGAAAAGTATCGACAGGAGATATCTCTATGGAAACGCCAGTACCTTCTTCCACTT 
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CCTCTTCCTTTTGTACTCCTACTTCCATTACATGTGTCTTGGGAGGAAAATTCGTCGACAGGGACTTCAATGCAAGCACCAATATTGTTCATAAATTTCT 
AGGGTTTTGGGATAAAAAGTTAATGGAAAAGZVAAGACAAGATGCCGTTGAAGTATCACTTTATTCGAGTTGCCTGAGAATAGGATGTTiAAAAACTCGTGT 
CACTAGTGAAATAGTGGCGTAGCCCTGAAGGATGAGAAGTATGCTATTTTCGACCTTACGGTCATTGTGGTATATCAACTCTTGCCCGATAAAGGGTTAT 
TCAACAACATAAAAATGGTACAATATTCTAGGTGTGTTTTACTATTCTTTGCTTGAAATCCCAAAAAGGTGACCCTTCAAGTTCACTTGATACACTAACT 
TCAACAAATGGGGATATTTCTGCCTTCACCTTTTGCACAACAGACTCTATGAATTTATTACACATCAAATTCAATTTGTAGGCTGTTATAGGTCCAATCT 
TCATATCGTTGTAGAATCTCGAACCCTCGACACGTGATTTTTCACGGGCAAGGAAAGCAATATCAGTCAGTATAGGGAATTCAAAAGTGACTTTGTTGAA 
TATCTGCAAATCACTGAACGTGCGGTCTGTAATGGTCATTTCGTTGTGCCTGAGAAGTGATCTCGGACGGAGGCGGTTAGTCATGTTTTCCAGTTGAGGC 
AAAAGATATACCATTTCTTTATA7VATTAGGCGAGAGGCTATTTCTTTCCTCGCTTCCGTTGCGGCTTTTTCTTCCTCGATTTTTATGTCTCGGACCATGG 
CTTCGGCAAGTCTTTCTCTCTCCATTCTGGCGACTTCGATTGATGAGGCGAGGATCTGCACATCACCATCAACATTTTCATCAATATAGACTTCATCTTC 
ATCGACTCTGGGGAGCTTACTCGGCGGGTGTGTTTCTACTGATTCTTCTTCTTCTTCTGTTCCTTCAGTGAGGTCCTCCGCCTTTCTCTTACTGGTCGTG 
GGAGAAACTTCTTTTGCTTTTGCTGCTGCTGCAGCTTTTGCTTCCATAGACCTGGTGGAAACCATTTTGAGGTTCTTGGATGATGTTGTATATGAACTTC 
TTAGTGTTCAAAGAGACTGAACAAGTGTGTGCTCAGGTGGGATCCTGCAGGCCTTATATACAATTTCAAGATGGGGAAGTGGGGGGAGAAAATAAGGTTG 
CCAACTAGTACCTTTGTCACCTACACCAGAAATCCACACATGCAGAAGGTTTGATGCTAGTTTTTGGATACATTTTCTGGGTCCAGTTGTGTGGGGGGGG 
GGTGGTATAAATATAGCCCACAGGAGGAGAGCCTGTGTCATTCTGTGTGTATGTCTTGTACTGAACATTACTGAAATGGGGAATAGTGAATCTCGTTCTA 
• GTGGTATTGAAATTGTACATAAAAATGGTGCACCAAAAAGGTCTCATAAAACTCTTTACTTGTCAAATAGAACTGAAAGACATGCCCAGATACAAAAGCA 
GATCGAAGAGTTACATCACAAAACAAATAAGCAATTTGAACAGGCTCAAAAGGTCCTCGACAAAAACGAAGAGCGAAAGAAGCATCAGCAACAGCAACAA 
ATAATAATTCCTCTAGACCCAGAAGAAAGACGTGCAATATTGGCTGAAATCGATAAACACATGAAAGAGATTGATGGTTTCATCGAGGAAAGTGAACGTC 
TAGGTTTACTTGTAGATGCAGAAATCAATAACTTGGAAGAAAAGGAGGTTGAAGAGGAACATCTTTTGAAACAAAAAGAAGACTAAACAATAAAACATCT 
AAAAATACATTATATGATTTTTTGGATTAATAAAAACAAGTGGAATAATTTTTAGGTATATTTTATTTGTTTTTTACCAACAAGGATCATCAATCATJVCT 
ATCTATTGTAACTTCAATAGAAGGGGTTATATCTTCCTTCACTTTCTGTACTACAGCCTGTACAAAATTATCACACTTGTGGAGCAACTGGTTAACTGTA 
AGAGATCCAATCTCCATGTCGCCAAACTCTTTTGGTGGGGGTGTATTACCCCACCTAGAAACGGCTTCACGGTTGATCAGGAACCCAATATCAGATAGTA 
C TCC A AA ACT AAAATT G AC AGT AT TTTGT ACCTC AAATT C ATGC T C AT AGTGTG AAACT GCC ATGTC ATGTTCCC TT AGGGGGT T TTTC AGTTTG ATGT T 
TTTGGTAGTATTGTCCAGTTGCACCAAAAGATTTTCCATTTTCTCTGCAGTGTCGTCGTCCACTCTGTCTTCATATTCTTCCTCATCCTCCTCTTCCTCA 
TATTCCTCTTCTTCACCATCGTCTTCTTCTTCTTTACATGGGAGGGAGAGGTGATGTGTTTCTTTGTGTTCTTCCTCCTTCTCTTCCTTAATTCTGGGGC 
GCTTGTTACTTGACGAGTCCTCTACCAGTTCTTCTTGTTCCATTTTTACTTCCACTTTCACCTTTACTTCTTCTGGCTCCCTCTTCCTCTTGGTGGGGAG 
AAGAGTAGAGGTGGCTGCTACTGCAGATGAAGATGTTGTTGATGTTCTAGAAGAGACCATTTTGGGGGTTATTTTTGGAACTCGTGGGTAGAACAAGTAG 
TAGTAGTATTAGACTCTTCATTGTTTGACTTCTTTGTTACTAGTTTGCGAAATACGGGCAATGTTTTATTATTTATAGATAATGTCACATATGGCAACTA 
CGCAGATAATGTTAGATTTGATGAAAATCCTGGATCTATAAAAAAGAAATCAAAACACTTAAACGAATTCGATTTAAAAATTTGCGAGGTTAATATAGCA 
AATAATAGTAGATTACCACAAAAATTTGTTAATGCTACAAATAGATTT^TATACATAAAACCTCCATCTTTACACGATGTTTGTAAAGTACATGGTTATA 
TAAACCCACGGAAATTATTATTAGATGCAAAAGGGACCCAGGAAATCGTTTATATGTTCCGCTCATTTTTTGGACTACTTGTATATCACGGATGGGATGT 
CGATGGTTTAATGAATACTTTCGGTTTACTAGAATTTGTGACACAAAATTTCATTAACCAGTTAGAAATGTACTTAGATAAAGACCCAGGTGAAATATTG 
TATCATGCAGAATATCTTATGAATACTTTAAGGAATCTCAGAGAAATTGAAAAAGAGCCAATTAGGGATTTAGGTTGGATTAAGTACAGGGCTAATTATA 
GAGGTTTCAGTTAATAAAGGACTGTTTTGTTTTTTTTTATTTTTTGTCCCAAACCCATTTAAAATAAATGCATGGTTAGCAAACTTTTATGTCTTTACAA 
GCCTTGTTCAAAAGGACTACATATTCATTCCCTTCATTTTCTTCTCTTTTCCTTTTCTCTCCTTTACTGCTGGTGCTGCCGTTTCCATCTCCATCTTCTG 
AAGTTTCTCCTCCACTTCCTCCATCTCCACGTTCTTCATTATACATGGAAATTGCCCTTGACGTATTGAAAATCATGCGTCGTTCTCGCCGCAAACGCTT 
CCGTTCGTCTTCGGGCAAGTTTTCTTCCCTTTTGAGTAACTCTTTGTATTCCTTAGATGTCGGATCAATCACATTGATTTCGGCACCATTTTGATATACT 
TTTCCATTATCCATGAAAATTTTGATGTCTGTTCCTGTTACCTCTCTGACCATTTCCATTAGTTCCCTTTCTCCTGGAGGGACTGCAAAGGTTATACTAT 
TGCCACTGGTGCGGGAGAAAAGTCTCGTCTTGAAGTGGTTTCGTACACTGTCCCTAATACATATAGGGGCATCCGATCCAGTTATGGACATGATCAAACG 
GGGCCTACGTTCTCCCACATCTCTCTTTTTATATTCAGGTGAAGAGACATACCTGATTCCTTCTTCTGTTGCAGAATAAAAGGT^CCATTTCCGTTGGTT 
CTGCATAGCAAGGAGAAATCTCGGTCAGAAAAAGCAAACACTAAGCAGGCTCCACCATCATAGCGGGATGTCGCAATCGATGATTTTTTGTGCACATCCC 
TCAAAGAAATCCTTTGTTCTTCAGAGACGGCACAACGTTTCCTCAATTCCTTCAGTTCTTCTTCCTTCATATTTACTTCATTCTCTGCAAACTTTATTTC 
TCTTCTGAAATTATCCATCTGTAAATTATCCCTTTCATTTGCGTATATGACGGTTGCATTGAAAAGAGATGACATATAATTTCCAATACTTTTAGCTGAT 
GGGTGTGTACTGTTGTAGAAAATCAAACCTAGTTTATCCATTGGAACAATTAGGCACATGTTGGAGTAACTGCCTGTGATTGGGTTTGCAATGTAGGCCC 
CTTTGTATGTTAACCTATCACATATATCATACACTGATATGCGGGAAGAAAAGCCACATTGTTCCACCAATAATTGTTCCATGTTACGAAGTGTTCTAGG 
GCAATGGCTCGGCGTGTAGCCACACCCTACTAAAAAGTCAGTGGTTATTATCATATAGTCACATTTTTTCACAGCTGCATTAGAAAGTCTTTCGTGTAAA 
AGCCTGTAGAGAGGAGAAAATTTAAAGTTTTTGTCACTAAGAAGGGCCTTTTCTTTAACGATTTTGTTGATATTTTGCATATCAATGGGGAGTGATTCGT 
TCCACTTTTCCCCATAATCCTCACACCCATCATCTTGCAATAAACTCCCTAACCTGGGGAGATTGTGAGTATACTTTAATCCGACTTGAAGAGTATTGTC 
GCCTGTCAATTGTCTCAGTCTATGCACTCCGTTTTTTATGTGAAGTACTTCCTTATCTTTTAGACGGACTATATATAGTCCGTCCCTTAATTCACTTTTT 
GTAACCCTATCAACAGATTTGAATGATAATAGAATATGTTCGTATTTGTCAGGGTTTTCACCAGAAATAGTGAATCCATTTTTTACCTTTAAAGTCTTGA 
ATTCTTCCATTTTGATATACTTAAATGTGTCATACTCGGGTGTCACAGATAATAAACCTACAGACCTTGCCATTGAACTGTGACACAGAGTGGGCCGATA 
CGCAAATTGCCACGTTGCTTGAATTACTTAATCTGTAGATGAAATCGGCAAAAAATTACGTCTTTAGAATGTCTCGGAATGTCATATTTCTTTGTGCGTA 
GTACACAAAATGTGGCACTTTTCAAGCCACCACCTCTACATCCCAGTGAGGCGGTGGAGGTGGACGTCATATTTAACTGGAATATGTGTAATAATATAGT 
CCAATTCTTAAAAAATCAATAGACCAATAAAACAACAAAAAACATTTTGTAATTTATTTATTCAACTAATiATCTCATCAAATAATCTTCTCAGCAATAAA 
CTTACATTTCATATCACAGTGTACAATACTATTCGTGCATTTACCAACACGGTACACGTAATTCTTACTCGTCGAGAAACATGAACTACAAATATTCCCT 
TTGTACACATTTCTGGGGATAACTTCAATCTTTACAAACAACGCTCTCTCCTCCTTGAGAACGTCAATAAACCAGTTATGTGCATCATTCAATTCTTGAA 
TAATTGGTTCCATTTTCTTGACATCTTCTTCCCTGTAAACCATCACATGGAAAACTGGCTTGTATTTGACCCACTCGGCGACACATGCTTTGAATTCTTC 
TCCATACATCTCCAAArACTCCCTCAATCCAATTTTTGATTCTGGAAACTGTTCTGTGTCGTGCCTCCACATGCCTGTGAAAAAGTTCAATGATGATCCA 
AACATTCCATCCACGAGCCTGGGACGTTCCTGGTAATCGGATGCACGCAAATGCTCTTGAATGGTCCATGGGAAGATGGGTATWiTAGTCCCTATAATTGG 
TGTACATGAAGTTGAGGAAAGACTGTTTGCCCTTTCCGGCAGTTCTCATGTTATCCACCCATGAAGTTTTCCCAGTTAGGGGAGATCCTTCGACGAATAC 
AATCATGTTGAAGGGTAAGACTTGTCT/mCTCTCTCGAGCGAGCTCTCTCTCTTTTArAGAGGCAAAAATGTAGTTGCCAGCCACCACCTTAAAATTCGT 
GCATTAATAATATATAATTTTTATATTTTGTATCATTCAACCGTAGTCCATCTAAGAGTCAACATGTCCTCCTCCTCGTCGTCGTCGTTCTCGTTCCGCA 
TCTCCACCTACCAGACCTTTCTCAAGGCTCTGGCTCACCCAGATCTGGTGGACAAGATTACCCAGAAATGTGACGAGACAGGAAGAAACCAGAAGTGCCC 
CATCCAGTTTCTGGCCGACATCTCGCACCTGATCCAAGGAGAAAGAAATGGAGGAAATCTGTTCCCTTTGCACCCGTTCAAGAACCAACCACATCTGGAA 
CCAAGAATAGTGGGAAGTCTTCACGGGAGAACATTGGACAATGACATTGAAGAATCATACTGTTATTTTGTCAAGGATCTGTATAATGGAGTATTTTCCT 
ATGTGAACGGCGTCAAGGAGTTACAGGGCGTTCTGGACAAGAAAATATCTGGATCTGGATCTGGAGAATCCTCCTCTTCTAGAGCTCCCCTGATCCCAAT 
AACTGATGTGGATTTGCTGTACATTTTCGGTACTTTGGTAGTTCTTCCCCCCAGATCTAAAGCGTACCGAGTCATCACTGAAGCTGTTCTAGCACTCCCC 
TTC AATG AATTC AGr AAC AACTGGC C T CCTAC AAATATC AAAGG AGC AT ACGTGTC T AG AG ATTT C AGG ATGTTT AATCTGTTGGCC GGT TT AG ATC AT A 
TAGAAGGAGAAGTTGGGGGAGAAAGTGAATGGGAATCCATACACGCATCTGTCGTCAAGCGAATGGTCACCATTATGCGCAACAAAGCTGAGAAGAAACC 
TCCATCAACATCTAGAATTTTTAGAGTGTATGTGGCTGAACCAGTTAATGATGCAGTGACAAAGATCCCTATACGTGTACTCAGTAAATTATTCGGTTCA 
AGACTCGCCGGTATTCTCCAGAAAGTGTACTCTTATTCAATGCTAAATCTTCCATATCTCCTGTCTTCAAATTCTATAGACATCAAGCAAGGAGTAAAAG 
GAATTACATTATCTATACCATCTGCAAGAAAATTGGGATTCTATTTACTCCAAAAGGATACAACATTACAATCTTCCCTATCACAAGATGTTGCCGACTG 
CATAGTTTCAATCAACGCTGGTATTATTGGTGATGATTTCTCTGAAAAAATACGACAGTGCATTGAGGAGAAAAACAAGCCAGAAAACTGTTGTATGTGC 
TTTTGTGAAATTGACAAGACGCCCGATTTTTCTTATAGTGAACATGTGGCAAGGCACAATTTCTTCCCCGTCCACGCATTCTCCTCATCACATGATGACA 
AGTGTTGTGGAGCAAAGATTTGTTCCGAATGTATATTCCCGTACATCATTTCCCTGTATGAGAAAATGACTGGTGTGGCAGGTGTAAAAGTTGTAGATTT 
GTTCCAGTGCCCTGGTTGTAAAAGTGGCATGCTCAATCTAAAGGGAAGATGTTACGAGTTTTCAAATTTGTGTAAGAGAATGATACTACCATACACATCG 
ACTCATTGTTCTTCTCTCTTTGATGCTACCATAAATCGCGCAGAGGCTTGTTTTTACTCCCTAGAGTTTCTCCAGTATGATTTTGAAACTGCGAGGAGAA 
TTGCGCATGGAGCTAAAGACATTCCCCATGTTTACAATAAGGTAGTAAAGAATGTAAAAGATTTGGATAGACTGTGTGCCTTGTACTGTTACAAATGCGT 
ATCTCCTGTCGTATGTGATGAGCCGAATGAAAGTACGGACTATGAAATGGTGGATGTAACTCCCCCTCTAATTAATCTTACCGAGATTGTTGATTCGGAA 
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GAGTATGATGATGGTCCCGGAAATCATATGTGGCCAGCAAAATTTACCTGTT^CTTTATTGCGGGTTCCAGTGGAGAAACACCCACCATTAGTACGTGCA 
GAGATGCTGTAACTTTTCTAGGAAGAGCACCGAGGAAGAAAATGGCAGGATGGGATGATCAATCGGCAGTGGGGCAAGCCATTATAGCGCTAGCCAACTG 
GAGAAAGAGTGGGGAATTGCCCAAAAATATGTTTGRTTTACTAGAAGGGGTAAATGCCGTACTTTATAGAGGCGACAGTTTCTTGTTACGTGCGATAAAC 
TACCCCTGTGTTATTGGTAGATCCATGAGCCCTAGTCTGGAACTCGTT7LAAAGAAAGGTGAATAAAATTGCTTTAATAAAGGCCTTCTTCCACGAGAAAA 
GGGTGCGTCCAGACGCATCTAAAAAGTTACTTGAATGGGCAGAACTATTAGTCAAAAGTTATCTCATGGAAGTTTTACTTCAGACGCCAGAATGTGTCAT 
ACACCGCGCCCATTCATTTGTAGGCAAAACTCTCCTCATTACTGACGAATTGGTTCACATGCGTCCAGATGATGCCACAAGAAACGCCTATATCCAGAAC 
CTAAATGCGGCCAGACAGAATGCGGCTGCTGCAGCCTCATTTTCTGGGTCGCTCCCCAAACCTGAATTTGTCCCCTGCAAAGAAAGGACGATTGAATGGA 
TGTATGAAAAGGACAATGATGATGTTAGAGTTGTAAATTGTCCTTCATGTAA7VAAGGCTATCCAGAAATATGGAGGTTGTGTGAATGTGTTTTGTGAATG 
TGGAACAAACATGTGCTGGATATGTGAAGAGAAGGTTTCTCCTGCTGATTCTAATCATTGTGTGGAGAAACACAGGATTGTTTATAGTAACTGTGTTAGG 
GTTAAATATGCCTTAGAAAGTATGTACGGGTTTGAGATTTGTACCATGAAAAATGTAGAAGAAGGAGTTAAAAATTATTATGTAATGGAGAATGGATTTT 
TCTTTGATGTACAAGAAATGGTTGCTAAGAAATAATAAAACACTTGTGAAATGTTATAACTTGTTTTATTGTCCATTATATTCAACAAACTTTAATTTCT 
TTGCATGCCTTGTTCAAAAGGGCAACATAATCGCCTTCTTTCTCCTCCCTCTTTCTCTTTGTTCCTCCACTGCTTGTGGCTACTTCTTCTTCTTCCCCAC 
GTTCCTCGTTGTACGTCTCAATTGCCTTAACTGTGTTGAAAATCATGCGACGCTCTCGGCGCAGACGCTTCCTTTCCTCCTCTGGTAATGTTTCTTCCTT 
TTTTATTATGTCCTTGCCTTCTTTAGAAGAGGGGTCAArCACTTTTATTTCTACACCATCTTGATATACTTTTCCATTATCCATGAAGATTTTGATGTCT 
GATCCAGTAACCTCTCTGACCATCTCCATCAACAACCTrTCATTCGGAGGATCGATGAATGATATTTCATTACCCTTTCCGGATGCAATGAAATGGTTAT 
TAAAATGGTTTCGTATACTATCTCTGATGCATATAGGTGCATCTGAGCCAGTTATGGACATGACCAATCTGGGCCTACGTTCATCCACGTCCCTCTTTCT 
GTAGTCGTCCGAAGAGACGTATCTGATTCCTTCTTCTGTGGCAGAGTAAAAGGAACCATTTCCATTGGTTCTGCACAACAAGGAGAAATCTCGGTCAGAA 
AAGGCAAAGACCAGACAGGCACCGCCGTCATATCTGGATGATGCAATTGAGGACTTTTTATGCACATCTCGAAGAGAAATCCTTTGTTCTTCTGAGGCAG 
CATATAGCTTTTTGGATTCTTTAAGTTTTTCTTCCAGAACACTTCTTTCGCTAACTAGAAGGTTGACTTCATTTTTTGCAAATCTGATTTCCTTTTCGAA 
ATTATCTAATTTTTGATTACTCTTTTCTAGCGTGTATACAACTGCAGCATTGAAAAGGCATGACATGTAATTTCCAATGCTTTTAGCTGACGGGTGTGTG 
CTGTTGTAGTiAAAGTAATCCAAGTTTATCCfiTTGGTACAATTAGGCACATATTTGAAAAACTGCCAACTATGGGGTTTGCGATACTTAAGCCATTGTACG 
TTAACCTATCACAGATATCATACACTGATACACGAGAAGAGGTACCACCGTGTTGCACTAACACTTGTTCCATATTCTTAAGAGTACGGGTACAATTTCT 
AGGAGAAAACCCACACCCTACTAAGAAGTCTGTTGrGATTATCATATAATCACATTTCTTAACGGCCGCATTTGAAAGACGTTCATGTATGATTCTGTAT 
AGGGGTGAAAATTTAAAGTTCTTGTCATTAACTAGAGCCTTTTCTTCCACAATTTTGTTCATTTCCTGGAATGGGCTTGGTAGTGATTCGTTCCATTTTT 
TTCCATAATTAGGGTCTTCATCTTCTAAGAAACTACCTTGTTTTGGGAGACTGGATGTGTATTTTAGTCCAATCTGAAGCGTAGGATCTCCTGTTAATTG 
TCTCAATCTTTCGTTACCGTTCTTGATGTGGAGTACrrCCTTGTCTCTAAGACGAACTATAAAATGTCCTTCCTTTAGCTCACTTCTCCTGACACTCrTG 
ACTGATTTGAATGTTAATAGTACCCGTTCATAATTTTCGGTATTTTCCCCAGACAAACAAATACCATTTTTTACTTTTAATTGTTTAAAGTCTTCCATTT 
TTATACCGTGTAACCCTAACAATGGTGTGCCAGTATAATAAATCTACGGACCATGCTCTTGATCTGGGAAGTGTACTCGTGATTTCTGTAGAAATTGCTA 
TTTCCTATTCATTATTTCCATTACAGGAAAAGATGArAAAATGAAGACGTGCGATGAGATATGTGTTTTTTCAATATCGTATATCAAATTTGGGCAAATT 
ACATATTTTTTAGCGTCAACACTCATCTTCATGCAAGACTTTTTGGAGTGAAATTACTAAAAAAGGCTATATTTCATGTACAACATAAAACACATGTGCA 
GACATTAAATTGGTTTGCTCACCCCCCCAACATCGTTTTCATCAACTCTTTCCTCACCTCTATATAAAACCAGCCTGCACCCCCTCGGAACCCACAGTCG 
ACTTGTACCCTCCGTCGAGATGACATCTCCAGCTCCATCACCCTCTTCCACCCCCAAATCCAGTTGTACCACTATTGTAAACCGATGTGGTTTCCTCCTT 
GACAACAACAAGGAAGTGGTCATCTACGACACCAATTCCAAATTCAAGTGTGAACCCAAAAATCTGGAACTAATTGGTGTACTTTCTGGAGTCTCTGATA 
ATGTTGTTACCCAGATATCCCCCGACCAGATATTTGTGGGAACATATATGGTCAAATATAACTGGTCTAAATCTGGTCATGAACGCTTCAGTGACATGAG 
TAACAACTGTCTGGACAATATTACACGCCCTTCAGAAGTGATTGAAAGTGTGATAAAGAAAACGTCCAGCGACTTTAAAATGAAGTACACACGTTCCTTG 
ATGGACCACACCGAGAAATACTATTTTTCTGGTGACCAAAAATTGAGCAAAATTAGTAGTTGGTGTACAACCCCTATACGACAGTGGGTATGCAACrCCG 
TCTAGAAAACTTTGTCAAGGAAGAACATGAAACTGTCGTGGTGCACAACCCTTCTGGAATGACTGGATTCAACATATTTAATAGTTCCCCCGTGTATrTT 
GAAGTGCACAATGAGATGGACGCCCTAATTTTTATGGCGGCTTTCTTGAAGCACAATAGTTTATGGGGAGAAATTAACGCCAATATGGACTTGTACACGT 
TTGATTATGCGGGTGCTTTTCTGGACGAAAGATGGTGCCACCACGAGAAGAGTTTTTCTGTCGTCCGAGCACAACTTATCAACTCGTATTACAAGTGCAG 
GAGAAAAATCATGCAAGCCCTGGACAATAACTACAACAACAAGAATAAGAAGAGGAAGAATGTTGGTGGAGCACCTGCGTTCACATTTATGAGCGGGGAC 
GG AG AGGGAGG AAAG G AAGC C C T AG AAGC T AGTTTCGATGTGATT GG GGG AACAAG AGG AGG AAG AT TTGGT GTTGATT C AACAC CATGCCCCC ATTCT T 
CAGCCATGCAACTAAAACTGGACAATGAAGGAAACTATGGATGTATTGCCTGCTTCGCATCAATGTTCTTTGTATTGGAGAACCCAGGTGATGAATCTTC 
CTTCATATCAACGGATGCCTCTAAAATTGGACAAGCGCAAGCATGGATAGATGAACGACTACGAAACAATGAAAATGGAGGAGAAGAAAATAATGTCTTT 
AAAAAGACCTTCCATATGCTGGCTGATATTACCCAAAAGGCTCATGAAACTGCCTATTCCAATACCATCCCACTTGGACCCAATGGCAGGCAGTGGAATT 
GGCCTACTCACACTGTGGAACCTATTGCCCATGAATTTGTTACCCATTCTCTAGTAAACACATTGAAAAATCTAGGGGATAGAAAACTTCCCCGATTCAA 
TTTT GAT ATC TTGT AC AACT TGCTT AATCC ATTTGG AAAAATGTTGC T AGTGTTT ATTC AAAAT TGTC AC ATTTT AACTGG AC AT AAAAAC AATGAAAAT 
GTGGTGCCTCGAGGTTCTGCTTCTGGGAAGTGGTGGACTATTAATTTTGTGGGTGTGAACATGTGGACTTTTCAAGTAACAAAATGTAAAGTTGAAAAGG 
ATAGAAAAATATCCGATTTGGCCTGTATGGAAACTCTCCCTCGTCTACCTAATCCAGGAAGCACTACCGTCGATGACAGAATAGTTTTTAAGGGATTCTG 
TAGAGGGGAAAATCTAGGGAGTGTAGGTGAAGTCGTATCCGACATTACACAGAGTGTCAAGAATTTTTGTCTCATGGTTGAAAATAGGAAATTTAGTGTG 
GATAAAGAAACTGGTTTCATCTCTTCAGAATCGATAGTGTCTGATCCCTTCTTTTCACTAGAAGTGACTGGCTGTAGATCTAATCGTGCCCAAGATACTA 
TTAATAATGGCCGAGTTAGTGCTCGTGTAATGAGGATCCTAAAGTCACGTGAAGGTGCTCGTGTATGGTTGGCCAAGGATGAAAATGCCATCATCTTTGA 
AAACGTTAACCACGATACGGCCATCTCTACGGACGCTATGGAGCGAGCTATAGGGCAGCACAAGATACTGTACTATGATATTGAAACAACAGATAAAGAT 
TTCACCGACAAAAAATCAGTCATCACATCTATTGGGTTCTGTTTGT6TACGGGAGGCGATATGACACATGGAGGAGAGAGAGGAGTATTTGGACTGGTTG 
CACCTGGATCCGACGTGGAAAAGGTGAAAGAGACTATTlATAAATTCGTACGATCCTGAAGAAAAGGAAGACATTATGAAACAGTGCCCTCAAGTGATTGA 

aattttcaccaacgagtttgaaatgttgcttggttttggaaagtatatagataaagtgaagcctcacgtgattagtgggtggaacaatgtagcttttgac 
gacccctttgtctttactcgtatcgtcaaacatttgagtgatcacaccaaagacatgtcttattgtgtagcagatgcatctacagcagaatctgtccttc 
ctagagcaacagaaggaggaggaggaggagaaactccatatagattgagcacccctcaagaaagaatacaactagcaagcactggtattttcaataaatt 
gggaaaatttgtagacaagaaaactggcatgttgaaacctgaaatgactgcagatttattggccggggcagaaagtcaggccaataccaagtttaaggaa 
cgcaacaagttatcctccagtaataaaggatcagcaggatggttccagaaaattattggcggtatgtgcagtgctattcggttggatctcatgaaagtgt 
gcgaaaaggcctataaagaatccctctctgaatttaatttgaacgccgtgctcgccaaagtgagtagtgtcggcgacaaggttaaaaatgtaaaagatga 
agtagacctacactttcatctattgggattcttgtvagctgaagaaggcccaggatcaggcaaaagtacacgtctattgttgcaaggatgcctacttgact 
ggtatagtttctacctccatcaacaaggaaggggagatttttaggctgtgtatggactctgctttaaccgaggcggtcgtgacagccaacctggccactc 
ctctatgtataggagaaggagcaatctgtagaaatatgggagaagaaagggcagatagaagaggtgtgggagtaagaagacactctattgccacagacac 
aaagggaggtatggtgagtcaacctatcgtcaatcatgttccctatcaaacgattgacatgacaagtttgtacccgatgaccatgtgtcagaataatctg 
tgcaccactacctttgtgacccatcgacaaattatgcaactgagggatagattggtacttgaaaaaatgaaaaacaaaaccaccgactctttattgttgt 
tggacgttattgacgagtgcaatcagattgtgttgtccgagtacagacccattgatattgcagtcgcatcatggaagaatagcaactctaatagacaaac 
tccaattactcgcatagaggaaagtttgggtctaagattcatagaaaatttggatgccgagaagacaaataataaaacgtggtgcaccaatacatctccc 
aatatgaatgtcactgccgcaggtatggattacttccccgagattgtgtgtgacattaatatgcagtttgcggccaaggtgaatgatgatatgcatatag 
ccccagcaagtttagagtatatgcttcaagtattgcccataatgttaatcgacagaccgtacattggcgcacacataacagctggaaaatgtcgtacatt 
ggaggatattctctcagaactcgaaaaggacttttctgttgaaaaagatgaggaaattataagaacccattggacatttaagggtcaaaaacaatacgat 
ttctgtcatagtcctgtgacccaaatggctcgtcacattattgaatctacgggaagaaatatccgtgattatgaaggcaatgaaaaattcgagaggttgg 
ttagcttatcagacagaatttatcgccgcgttggcgcatttgattcggccaatgatccagctgttagactgtggtcttctcgtctaatcaatgttggaat 
gttggttaggacatggaacgtaaaaactgacattcttaagggaatcatccctcaaatgcaagccacttacagagccgatcgagttgtgatgcagaacaag 
gccaaggagtttgccaagatgggagacatgaaaagagctggtctaaacaaagttggacaaaatattatgaagctcggtatgaattccatgtatggtcatt 
tggccctaagagcacgttcgagccgtaaagagtttgcgtctggatctgccaatactgcctcaagtatttccaacatgtcagccaccggaggaattggagg 
aggcacaaggcactcggtgacggccaatcagattacagaaaatgctcgatgtgtatttggcaatattggttgtggattacagatggctcttcctggtact 
aagcagacgtacggggatacagattctgtattctgcgtgcataatattgtaggtgatggaggaatgataccagaatatgatgaacaaactggcaaatatt 
attatgtgatggatattgctctaaaaaataaaatggctgcaattattcccatcctagtcaactcgttaacaaagggcatccagtttgtagagcgccgaga 
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CGCTGGTGTGGGCATGATGAATATCGCCCATGAACGTCTAGCTGTCGCTGGTCTTTTGTTTGCCAAGAAAACATACCATATGCTTCACTTTAATGAAAAT 
AGTGCAGCGTTCAATGACATGATAAAATTGAAATCAACCGATAACAATAATAAGTTTGCATCCTTTATCAAGAGACCTAGCCACGCAGATGGGTATGTTG 
TTCCCCATAATCCTTCATTGATTCTTAGAGCGGCCGAAGGACCCGCTGGTAAGAAATTGAAAAGCTTTTTGGAAGAGGAAGGAATTCATGACGAGAAGAG 
TATGGAGGAATGGTTTACCTCTTCACCTACGTGGATGGCCATGGATGCTTCTGTTATCAACAACTTGTATGCCTCACAAATTGTAGGGGTGGAGAAGGGT 
AACTGGATTGACGCCATGACTTCCCGCCCCATAGAAGCGGGTACAGAAATGATGGAGGCGGTGACGCAAGCGAATGCAGCTTTCACCCCTTACAAAAAGG 
GAGCCTTTGTGAAGAAGGGAATTACACCCACCACCAAACTAAAGGGTCTCCAATCATTGATTGCAAGATTTTTACCAAAAATAGAGGAAAAGAAAAGCTG 
TTATTTGGATGTGATGAAGAATCATGTGGAGAATTTTGCATCCCATATAACAAATCCTGCTATGATGATTACTAGTTCCCGAGTCAACAAGTTTGATACG 
TCAAAAGAACAGAGTAGACCTAATCCTCTAGCTCTAGCGATAAATAACCACCTGAACCCTTCTTCAGAAATTTCATTGGGGCAGAAATTTAAGACGGTGA 
CATCAGTTTCTrCTTGGAGTCTTTCGGCAGAGGAAGGGGAAGTCCCTGCTGGTTATTTTAACGCTGGTAGCGTGCGTTGGGATGCCACCAACATGAAGGG 
AAGTGTTCCTGCATTTTCAGTCAAGAATTTATCTGTTGTGCCCAACGCCATCACATCTGTATACAAGATGGTTGAGAGCGATAAGACGGCAATAAAATCC 
ATGATTGCTAAAAATGTAGAAGTGTTGTGTTCTACATCTGCCAATACTGGATTTTCTCTGAGAAGAGGAGCATTGTCATTTAATACAGGCGTCATTGTTA 
CAAAGGACGTGGCTATGGCTTGTATACGATCTCTAAATAATAAACAAATGTTATTGTTTGTTGGAGGGGGAAAGGATTACGGTGAAGACGACGACGACGA 
CGACGAAGAAGCAGAAGAAGAGGACGAAGAAAATGGTGAAAACGAAGAGAACAAAGGTGACTGTGTCACGGAAAAAAAGATCCCTGGACGAAGCACTAAC 
AAGGATGTTGGTGAAGAAACTAAAACAAGCGAGAAAACGGAGGGAGAAAGAAAGGGCTCTAAGACGGCAAAGGGAAAGACGGAGGAAATTGCTAGTTCGT 
TGAGTAAATGTGGGAAGAAAGATGCGAGAGATGTCATTCTTGACCGTTTACTAAAAGCAACACATTCTTCTTGCACCAACAATGAAGAGAGAACCAGAGT 
CTTACAACAATATAGCAATTGTACATTATCTTCCTATATAACTTCAGTCATGAAATTGGACCAAAGAGTAGCAGACCAAATGGAAAATTTAATATCTCAA 
TTGGATCAAATACGTAATCTCTCCAACAAGAAGAGGCAAGAAAAGGGAGGGCCTTTTAAGTCTGAATTGGACGCCATGGTTGCTGCAGTTAAGGTTAAGT 
TTTTCCCAGTTTTAGATGCGTCTAGAAAATTGACTCAAGACCATTGGAAAAAGTGCCCCGTGTCCATCCCAGAAACGCGTGAAGAAAAACCATTAATGGG 
TGTGCCTTTTGAAGTTGCACTCAATTCTCTAATAGGAAAACACAAGTGCACAGATACATGCGACATGGCTTGTTGTCAATCATTGTATTTTGTCCTCTTG 
TACACGCTAGCTTTAAAATTTGAGAACGAAAGATTGGCCCGGCAAATTGGCCTAGATGACTCTGTAGATTTGATGGCTGAGATGTTGTTCGGAGGGGATA 
AACTATTGGCCCAGGAAGTGTTAAAAAGGGTAAAAGATGCTCAAGATAGAAAGTTGGTGAAATCTTTATTGCCTTTAAATTATAACCATGACACAAATAC 
AATTATATTTTTGTTTGAGTCTTTAAGGTTTGCTCAGAAACCTGTAGCTGGTATGAGTGTTAGTGAAATAAAAGACGCTGTTAGAGGTCTGGCCTTTTCT 
ACCACTACAGGTACTGTGTGGAATTATACTGATGAAAGATTTTTTGGACCATTGTATAACATGGATGAACTTTGTAACGAACGTGTCAATGGAAATTGTA 
AATTGTCCTTTATAACTGGTATTTATCATACGGCAGCAGTAGAATTGGCTGCTGCATGTCTATCTTGTGTTTTGTAAGAAATGATAATAAAAAACATGTA 
TAATTTCATTCTGGTTTTTATTTTATATCCTCAAGAGTTGTAGTAGTAGCAGCAGCAGCAGCAGCAATAGCAGCATTATTTTCAGCGACAGCTTCTTCAT 
TTCTATTGATTGATATCTGGTCATTTTCTACAGCCCATGGTCGTTTAAGGAATATTCTTTGAAGCAGAAACACATACCCAGCAATGATTGCAATGATCAA 
TATTAATAGTAGCGAATTAATTAAAATAAGCATTTTGTCCACATTCTGGCTAAAATAAAAAAGTTTATCTCTCAATGAAAAATTTTTATTGACCAATCAC 
TTCGGGAACTTTTGCAGCATGACACACTATTACTCCAAAACTATGCTCACTCGTATCTAGTCCCACAACATCACGCGTCATATTTAAAACATGCACCTTT 
AAAGCCTGTTTTTTCTTATACAATTCTGTATTGATTATAGGGTTTACAACCTTTAGGCCTTCCTTTAGAGATACTATTTTCATGTATTTTACAATCGAAT 
ATGAGTTGTCTTGTGCACCAGACGGGA6CCTGTCATGAAACTCTACTTCTCTCCCACGTGTATTGAGCGAGAGATCATTAAGCACAACTTCTGAAGAAAC 
GAGAGGTTGTAAAATAGAGTTATATTTAGGTTTAATTAGGAAACATTTGTGGAAACTTGAACTGTTAATATCAATAGAAACTGAGCTATATGCTGACGAC 
ATGGCTTGCATTTTCAAATCATGTTCAAATCTTACTACATAACCCAAGAAATTATAATCTGCTTCTTCACCATCATCTTCTTCCATGGCATCCTCTCCTC 
TTGGCTCTTTTAGTACAATTTCTTTTGGTTGGTTAGTCATGTTGTATATGTTCATTGAGCTATGATTTCCCACAACATCATCAAAGAAAACCCCTTCTGG 
GATAATGCATGCACCGACAGAGGTGTAGAATAATCCTCCATATCTAATGACACCGACTGCGGTAGCGTTCTCTTCATCATAGTAACCGAGCTTGATTTTC 
ACATGACTCATGGGTCCGATAGAGATATTGGAAGCCGGTTCAGTAGCGGAAAAAATAGACACTTGTTGGAACATTTTGTCGTCAGGAAGAGTCTCGAATA 
AAATACCGACAGGGGAATTATCTTCAACACTATTAAAGTTTGTTGCATAAGATAGCCGCCCTGAACTACTTGTATTATAAATTGTTATATCGGATGCTAT 
TTCAGAACAAAGATCTCCTATAAAATTGTGATTAGGGCAGACCAACATCCCGTCTCCAGTGTAAGACGATACATGAAGTCCTGGTCCAGTTGGCACGAAA 
ATGGCAGCCGTTCGTGAAGGATTTCTCACGTCAACTCGTCTCACCTTCACTGGTACTTTTCCAAATCGTCGTGGATCTACATTGAAACTAAAATCTCTTT 
CGTGGGGATCACTTCCATCCTTTAAAGAGAAGAGAGGCACTCTGTAAACAGGAGACCAATCTGAATAGTACCTTAAAGCTGTTCTATTTTTTTCTTCTAG 
AGTGGGTACCGCTCTGCGGTGCATTCGGTACACCTCCAAATCCGTTTCCTTTACTTGAATCGATTTTGCTCCTTCATTTTCCCAGAGCGCATCAGAATCG 
TACCTGTCGTCGAACGAATCTTCACCTCCCATTTTATTTATTTTCCTCTATATGATAAAAACCACACCTAAAATGTCTAATATTTTACTACACTGAAAAA 
ATGGCGTACATTGACCAAGGGGCGTTTGGAGCCAAATCTGTGATGCAGCAACAATACCTTCTTCCACCCTCTAATAGGCTCTCCAAACAACAACCTCCAC 
TGGCATCATCTTCTCTTCAACCTTCCTCTTCTAATAAACCTAGGAGTACATCAAGAGTAACAGACATATTTGTTGTGATAACTTGTGTTGTTTTAATTGC 
TGCTTTTATAAGCAACTCATTTAGTGTTACAAAAAATGTGGTAAAACTTTCTAAAGAACAAACCGAGAAAATACTAGAAAAAGATTTGCCTGATAAGGTG 
TACAAATTGTTTGAAAATTTAAAGGATGGAACATTTGGTATAGGGGAAGATGAGGAGGAAGAAAGGGAGGAGAGGGAGGAAGGGGAGGAAGAGCTTGAAA 
CAAAAAACATAAGACTTAAAAGAAAGTGGAAAGAAATGACTGAACAAGGGGATCAAGGAAT7ULAAGTAAGGAAATTACATGGCCCGAGAGGAGAAAGAGG 
AGAAACTGGTCCAGCAGGAGCAGTTGGCCCTGCAGGCCCTCAAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAAGGATGGAGCAGTTGGCCCTGCAGGC 
CCTCAAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAAGGATGGAGCAGTTGGCCCTCAAGGCCCTCCAGGAGAAAGAGGAGAAAATGGACGCCCAGGAA 
GAGATGGAGCAGTTGGCCCTCAAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAAGGATGGAGCAGTTGGCCCTCAAGGAGAAAGAGGAGCAATTGGACC 
GGCAGGAAAGGATGGAGCAGTTGGCCCTGCAGGCCCTCAAGGAGAAAGAGGAGAAAATGGACGCCCAGGAAGAGATGGAGCAGTTGGCCCTGCAGGCCCT 
CCAGGAGAAAGAGGAGCAATTGGACCGGCAGGAAGAGATGGAGCAGTTGGCCCTGCAGGCCCTCCAGGAGAAAGAGGAGCAACAGGTATACCAGGAAGGG 
ATGGCGTGGACGGTTCTGTGGGCCCTCAAGGAGAAAGAGGAGAAATTGGACGCCCAGGAAGAGATGGAGCAGTTGGCCCTGCAGGCCCTCAAGGAAGAAG 
AGGAGCAACAGGACGCGCAGGAAAGGATGGTGCAGTTGGTCCTGCAGGCCCTCAAGGAGAAAAAGGAGAAGCTGGTAAGGACGGTTCTATAGGGCCTCAA 
GGAATACAAGGCCCAAGAGGAGAGACTGGACCACCGGGAAGGGACGGCACTGCAGCAGAAAGAGGAGAAAGAGGCTTCCCAGGACCACCAGGCGAAACTG 
GACCACCAGGAAAGGATGGTGTGGATGGTTCTGAGGGCCCTCAAGGGAAAAGAGGAGAAACAGGACCCGTTGGACCTAGGGGTGAACCAGGTCTAGCTGG 
CCTCCCAGGAAGAGATGGAGCAATTGGCCCTGCAGGCCCTCCAGGAGAAAGAGGAGCAACTGGTCTACCAGGAAGGAATGGTGTGGATGGTTCTATCGGC 
CCCCAAGGAAGAAGAGGAGCAACAGGCCGCGCAGGAAAGGATGGGGCAGTTGGCCCTGCAGGCCCTCCAGGAGAAAGAGGAGCAACAGGTATACCAGGAA 
GGGATGGTGTGGACGGTTCTGTGGGCCCTCCAGGAGAAAGAGGAGAAACTGGACCAGCAGGAAGGGACGGTTCAGTTGGCCCTGCTGGCCCTCATGGAGA 
AAGAGGAGAAAATGGACGCCCAGGAAGAGATGGGGCAACTGGCCCTATAGGTCCTGCTGGTCCTCAAGGAGAAAAAGGAGAAAATGGACGCCCAGGAAGA 
GATGGAGCAACTGGCCCTATAGGCCCTAGAGGAGAAACTGGTGCAATGGGAAAGAATGGCGTGGACGGTTCTATGGGTCCTCAAGGAAGAAGAGGAGCAA 
CAGGCCGCGCAGGAAAGGATGGGGCAGTTGGCCCTGCTGGCCCTCCAGGAGAAAGAGGAGAAACTGGACCAGCAGGAAGGGACGGTTCAGTTGGCCCTGC 
TGGCCCTCAAGGAGAAACAGGATTAACTGGCAGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAAAAGGAGAAAAT 
GGACGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAAAAGGAGAAAATGGACGCCCAGGAAGAGATGGAGCAACTG 
GCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAACAGGATTAACTGGACGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTAGAGGAGAAACTGG 
TGCAATGGGAAAGAATGGTGTGGACGGTTCTACGGGTCCTCAAGGAAGAAGAGGAGCAACAGGCCGCGCAGGAAAGGATGGAGCAGTTGGCCCTGCTGGC 
CCTCCAGGAGAAAGAGGAGAAAATGGACGCCCAGGAAGAGATGGAGCAACTGGCCCTATAGGTCCTGCTGGCCCTCAAGGAGAAACAGGATTAGCTGGGC 
TGCCAGGAAGAGATGGAGCAATTGGTCCTCAAGGAGAAAAGGGAGAAAATGGACGCCCAGGAAAGGATGGGGCAACTGGCCCTATGGGTCCTCCAGGAGA 
AAGGGGAGAGACTGGTCCTATAGGTCCTGCTGGCCCTCAAGGAGCAACTGGTCTTCCAGGAAGGGATGGTGTGGATGGTTCTGTTGGCCCTCAAGGAAAA 
AGAGGATTAATAGGGCGCACAGGAAGGGATGGGGCAATTGGCCCTGTAGGTCCTGCAGGCCCTAAAGGAGAAACAGGATTAGCTGGCCTGCCAGGGATAG 
ATGGAAAGGACGGTTCCGTGGGTCCTCAAGGAGCAATTGGACCTATAGGCCCACGAGGAGAAAGAGGAGAAACTGGACGACCAGGAAGGGACGGTGAGGA 
TGGTTCCACAGGCCCTATGGGCCCCCAAGGACTAAGAGGAGCTACGGGAGCTCCAGGACCGCAAGGAGAAAGAGGATTAAAGGGACGGCCAGGAAAAGAT 
GGTGAAACAGGTCCTCCAGGGCGACAAGGAAGGGATGGAATAATGGGTCCTAGGGGTCTTCGAGGAGAAAAAGGAGCACCTGGTAATGATGGTCTAGAGG 
GACCTGAAGGAAGAGATGGTGCACCTGGTCCCGCTGGCCCTATTGGACCTCAAGGAATAAGAGGATTAAAAGGTATCCAGGGACGACCAGGAAGAGACGG 
AGAAATGGGACCAGCCGGCAAGGACGGAATAGAAGGCCCTAGAGGTCAAGATGGAACAACTGGCGCTAAAGGACCTAGAGGATTAAGAGGTTTTCAAGGA 
AGAACAGGAGAAACTGGTGCACAAGGATCTAGAGGAGAAAAAGGCGATAGAGGGCTAACAGGCCCTCAAGGAAGAGACGGTCCACCCGGTGAAGAAGGTC 
CTCAAGGTCTTAGAGGAGAAAGGGGAGCACCTGGCCCTAGAGGTCCTAGAGGTATTCGTGGCCGTTCAGGACCTCAAGGAAGTAACGGCGTGCAAGGACC 
TCGAGGTCCCCGAGGAACAAAAGGAAGAACAGGAATACAAGGCCTCACTGGCATAGAAGGTCCTCGAGGTCCTAGAGGTATACAAGGAAAGGAAGGAAGA 
ATGGGGAAAATTGGACATCGAGGAGAAAAGGGTGATAAAGGAGACCGTGGAGAACAAGGCATCGCTGGAGCAGACGGGGAAAAAGGTCCAAGAGGTTTAC 
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GAGGAATTCGAGGCCCTATTGGTGCTCCTGGTAAGCCTGGCACGGAAGGGGTTRGAGGTCCTAGAGGGGTGAGAGGTGTTCCTGGCTATCCTGGCGCACA 
AGGGGAATTAGGTCCCCAAGGACCAACAGGTCCTCAAGGGCCAGCAGGTCCTCAAGGGCCGATGGGGCGTRCAGGAGATACTGGTCCCATGGGCCCTCCT 
GGAGCAGTGGGACCAAGAGGAGAGAAAGGAGGTAGAGGARGAAAGGGAAAAAATGGCCCTAAAGGAGCGGACGGAAAAGATGCCGTAAATATCATACAAA 
AATATTCAATCACCCATGCTCGTGCAGAGATAATGTGGGAAGGAAATGAAATCGGAGAAGCATACATTGGAAGATCTTATGGAACTGATACAATCCCTGT 
GATGAT AGAAAATAG AATAGGGATC AC AAATGAGG ACAAAAAAAACGAAT ATTGT ATACAAG T AATGAC AATGC ACTCAAT AAC AAC T AGAGGAAGAACA 
TCGGGTGTTTTTGTGGTAAGCAATAAGACAGATTATATCCTTTTAGTTACTTTACTGATGCCAGAJiAGTGTTTCCTGTAGAACAGATGTCAGTACAAATG 
CGAGGTCAGAGAGGGTGAATGCTGTTAGAGAAAGAGAAAGCAAATCGTACAGATTTATTAG6CCGTCTGACCAATCTATAGGTACTCATTCACGTTCAAA 
AATTGCCGTGGTAATGTATCCAGACGCAAGCATGAGTTACTCAGTTGATACATTAGACGCTGATGTGGCGCGAAGAGAAACAACGTCTGTGCTTTTATTA 
GCAGAAACCATACACGGGGAAAAAGATAGAGGTTTCTATGCTGATAGAGGAACTGTAGGGAGGTIGATGGTACCTCCCACTGAAGAAGAGTTATTGGTAT 
TGCAAGC 
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CT1035 

Nucleotide 

Genomic coordinates: 

Start: 130589 

Stop: 131444 (SEQ ID NO: 2) 



Amino Acid 

MAAAKMDAILADINGNDTDLSKLITDVIQKRAKAVMDRNRAKMDMNRRVDEAIQEAVAAK 
KQKALVVFDKLVEETDSGQSVPPTLSGSDYDAWVDRAMPSHIELVESVEGDSLYDKLPPF 
NVQDIDDQIGDEIDTPISYLAMVVVKVDCETGDIEEEYNLAPTFGVTQNNKIYRDERDQI 
FTKADKSVRIFKLAKLDSISGKSRQLTYAVKNNNEYTEFVCSVFAEFESDSDTTKSGIGI 
REYDKPKNEFEYEEREIFTFFIPIQPAGTKLLLYFLVDVRSRII 
(SEQ ID NO: 3) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q65326 <Q65326) DMA POLYMERASE (FRAGMENT) 36 0.31 

MYSB_CAEEL (P02566) MYOSIN HEAVY CHAIN B (MHC B) 34 1,6 

O02244 (002244) UNC-54 PROTEIN 34 1.6 

004327 (O04327) CELL DIVISION PROTEIN FTSH ISOLOG 33 2.1 

Q04009 (Q04009) MYOSIN HEAVY CHAIN 33 2.1 

002077 (002077) COSMID C48E7 33 2.7 



Coniinents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 7 65 of CT1035: this 
corresponds to nucleotides 13064 9 to 131413 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=350 

5'stop=373 

3' start==405 

3' stop=^427 

5'primer-TCCCTCCTTTTAACGTACAAGACA (residues 350 to 373 of SEQ ID NO: 2) 
Tm5=57.95 

3 'primer=CCATGGCAAGGTAAGATATTGGT (residues 405 to 427 of SEQ ID NO: 2) 
Tm3=57 . 97 

probe 1=CGGTGATGAGATAGATAC (residues 387 to 404 of SEQ ID NO: 2) 

probelstart=387 

probe lStop=4 04 

direct ion l=Forward 

Tml-69.08 

SGorel=l . 91 

length=78 
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CT1037 

Nucleotide 

Genomic coordinates : 

Start: 131480 

Stop: 132941 (SEQ ID N0:4) 



Amino Acid 

MFVISIATSLVLFFFLLFVSITILDGAKTIDSQPFRKRRBCRKRYRTSESGDGIDGGTGTT 
NGGGGGGGEGGGGGTNGNGTGTTMGGGGGGEGGGGGTNGNGSGTTNGGGGGGEGGGGGTN 
GGGNGNGGGNGNGNGNGGDTDTDDFEPTPALLKERLLNSISSKPKEYYEAFVSAEVETAL 
QLSRDDSTQTIIIDDDQLELDASDTLQGKPRDYLFKLAGVSSAFLEGTTrRKAEDRARNT 
NEEE I AQT I LS QLREKH I NDE Y DGK Y AT PEERADFSNS LNLVTKYTNHEVGLLVGET I EK 
AFPHEIEFERCIILVEDFNSGTITSNTMQYRSNAYKIRVVEGSTTDPGEVVPDDCLVFAV 
VVNKEQHSLEISATNRCQDICFVIIPRLSAIGKNATMVIRKGDEIKQETYLFVANECNDTT 
HFSIITDKDESVGIELNMLIFSERILPTLSDPATVPRPLTDANVLSAYGKRLGVGAFTDK 
NLLSSQ 

(SEQ ID NO:5) 
Top Blast Hits 

Score E 

Sequences producing significant alignments : (bits) Value 

055514 (065514) PUTATIVE GLYCINE-RICH CELL WALL PROTEIN 120 3e-26 

Q218 35 (Q21835) R08B4 . 1 PROTEIN 112 6e-24 

Q43522 (Q43522) TFM5 GENE 110 2e-23 

053553 (053553) PGRS-FAMILY PROTEIN 108 le-22 

0654 50 (065450) GLYCINE-RICH PROTEIN 107 2e-22 

GRP1_PHAVU (PI 04 95) GLYCINE-RICH CELL WALL STRUCTURAL PROTE 106 3e-22 
Comments : 



EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 392 to 1365 of CT1037: this 
corresponds to nucleotides 131892 to 132865 of the genomic reference sequence . 

TaqMan Primer/Probe Sets : 

5'start-723 

5'stop=74S 

3'start-792 

3'stop=811 

5'primer-TGAGGAAGAAATTGCACAAACAA (residues 723 to 745 of SEQ ID NO: 4) 
Tm5=58.39 

3'primer=CCTCGGGTGTGGCATATTTT (residues 7 92 to 811 of SEQ ID NO: 4) 
Tm3=58.37 

probe 1=CAACGATGAATACGATGG (residues 774 to 791 of SEQ ID NO: 4) 

probelstart~774 

probelStop=791 

direc t r on l=Re verse 

Tml=68.98 

scorel=l,98 

length=89 
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CT1038 

Nucleotide 

Genomic coordinates: 

Start: 132993 

Stop: 133896(SEQ ID NO: 6) 
Amino Acid 

MGDKQKVEQLLRELKAEANDDWLSANVDPIVERFVTTKSDETAQVVKQAVDEKYDELLED 
KVEEMRPDIINEASETYDKLAADMIREVDTSSVIAPAIAGTVARTINNLRDKRKEYEKRL 
WTLAYKPWRRYVOAITVMEFRLSYKDLTVHANSDTYLTFPFLRIKKIAYINNDRASPVNC 
SLSVSYPNKSEWGNDNGVGRKVDIHIRRNDLQEKDLYLSVICMLDTDFSGYDKAVEVDAH 
KFHFEAGNRTMFLPKTSNLFNRSHIVNSKICTIVFPPASASSASTTELDNVYYRITCTCS 
(SEQ ID NO: 7) 

Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



RESA_PLAFP (Q26005) RING- INFECTED ERYTHROCYTE SURFACE ANTIG 
Q9X9G6 {Q9X9G6) HYPOTHETICAL 35.5 KD PROTEIN 
CAA21397 , (CAA21397) ORF74 

Q9ZFZ9 (Q92FZ9) HYPOTHETICAL 21.1 KD PROTEIN 

RESA_PLAFF (P13830) RING-INFECTED ERYTHROCYTE SURFACE ANTIG 
O9ZH03 (Q9ZH03) LAMBDA HOST SPECIFICITY PROTEIN J 



38 
37 
37 
37 
36 
36 



0.12 
0.15 
0. 15 
0.15 
0.26 
0.45 
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CT1039 

Nucleotide 

Genomic coordinates: 

Start: 133968 

Stop: 136344 (SEQ ID NO: 8) 



Amino Acid 

MEEESQRVQRRIGVLPEEAASQILKDTKLRVSYLGVGHWGYSVSVIKSALQKGCRRNDED 

ITAWSIREAYLYyHLGLNYIENVKPAAKSLNTNMVNRIKIIAVEDTSPRSMVASNECVRT 

LEKYEKGNFRQPSYLMDAAMRLVHASSSRVCSHMRALCCKEEDSDKLGGIYYANFNELET 

QCVSAVNFSPIERIKHVFREIESVKLGKKSVQLLNLRSVAAYHVLRYYGDKVKDTNKKHS 

GPFKRKEFEQFWGLCFKFVTQHVKTDPELRCYFNELTYAINWRRDFFCSKGFFREESLFL 

TSIVELIIAMCIGDRKQFAKIQKRDLKRFNKGEEGRKEEAATFDWIEGHVKRMPQMPVWV 

LDKHTNKNTHGVSFALESSMVSGGDKRWSPGVWLHSYTKMRLDSPPPPEVGQFLDQAFNT 

LKREAASHCVTRNICTTTGFIKASSFTANINSEPMEIKEEIKKRKIEIKDDNTTATVTVS 

ATTSSSITSTPPPTKKQKTTPSGSNKVDSIQLNNLPTLNMEDLDRVLEVHNQNSKKGVAA 

TVLMKDGNKVVFKEMRKSFGWGSHQNFVQVLKDEDVCKLDYLLPCPDSGPYRGLYRCYFK 

IVKDEISSTAARIEKVKWGENAMCYFISGCVTRQEGIGKIITDVRLSHMGPNKQYVYDNY 

RQLI H I LI FRLLTGVS DTNTSN I LVGDGGNLFS V DEN Y VGAKDPRTALENRKI KELQLLL 

KTSFKVNFCVTKEDIDSCLPSWLFDTSKSDKIMNGVCNIGKNMGIGPTTLDIVKNNCTCIL 

GVVNDLLYDNK 

(SEQ ID NO: 9) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q9Y1H4 (Q9Y1H4) GAG 43 0.011 

RAP1_YEAST (P11938) DNA-BINDING PROTEIN RAPl (SBF-E) (REPRE 41 0.031 

SLY1_YEAST (P22213) SLYl PROTEIN 40 0.070 

P90603 (P90603) MUC.CL-1 38 0.20 

052224 (052224) GLUCOSYLTRANSFERASE (EC 2.4.1.5) ( DEX TRANS U 38 0.35 

Q20202 (Q20202) F40E10.5 PROTEIN 38 0.35 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1092 of CT1039: this 
corresponds to nucleotides 1338 66 to 134957 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 
5 • start=1255 
5'stop-1277 
3' 3tart=1353 
3* stop=1375 

5*primer=AATACATTGAAGCGAGAAGCTGC (residues 1255 to 1277 of SEQ ID NO: 8) 
Tm5=58 -71 

3'primer^CCTTAATTTCCATCGGTTCAGAA (residues 1353 to 137 5 of SEQ ID NO: 8) 
Tm3=58.18 

probe 1=TAGTCATTGCGTAACGAG (residues 1278 to 1295 of SEQ ID NO: 8) 

probelstart=1278 

probelStop=1295 

di r ec t ion l=Re verse 

Tml-69.03 

scorel=l . 96 

iength=121 
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CT630 

Nucleotide 

Genomic coordinates: 

Start: 249215 

Stop: 247358 (SEQ ID NO: 10) 



Amino Acid 

MALKDAFTERLVVNKVGSGTDMAPVVEDDRQKSLFQKVENLYRVLVVEQKNSAITLSGNK 
NTNKRQCRQVEEDKVIFEGEDRTVSNLPQAVKETIAANAESILDYWYKNVIPLLDTKKER 
SGKSDTFLRTAVICLVRCCVSYKDMKTCSLIYEFEHKILNKSTLDPLLKDILDNKQELLH 
MDSKYGSKTTSPELAKETIEALYTTVYNHWTNAFKLYQASLTHKPVTGKKYASVIHFIRT 
WRKIVKAYVSKHNNVERDLSLKNIMKNESADNANVLTIEKMYKKIGNSVKNTNNNSAHQM 
SDSEDDDDDDDDDCEGMDVCDEASEREKKHQESLY PINTPVTT ITGDY I FPCVLLELVLS P 
HIHPEWKIPMCDFVNRNIPKLMKAMETDISNAVIEVRASKVNPVQILPIAANFWDFCKSG 
KPPSDVKFCMMFNEPSSNETLSSGAGVFGRFIGGPFSHKSKELDIISNCLRSLLLNKEAD 
NLSTRIWREGGSVVCFNYCPITARGAVLGYGEQLSERSIKALWAKKIQDAVTESVKRQRN 
AADKNSRNCDLLGDEGVVSMKTVTFGCANMLKTQNGMGKFNWVSFEDSIQANKEGAARQ 
YMSQQVFTHSFPALDQGK 
(SEQ ID NO: 11) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



097318 (097318) PFC0865W PROTEIN 45 0.002 

YBO0_YEAST (P38114) PUTATIVE 12 6.9 KD TRANSCRIPTIONAL REGUL 4 4 0.003 

Q08281 (Q08281) CHROMOSOME XV READING FRAME ORF YOL138C 42 0.010 

Q92271 (Q92271) 12.8 KBP FRAGMENT OF THE LEFT ARM OF CHROMO 42 0.010 

062235 (062235) F36F2 . 3 PROTEIN 41 0.018 

Q83970 (Q83970) (CPV) 41 0.023 
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CT631 

Nucleotide 

Genomic coordinates: 

Start: 264975 

Stop: 259164 (SEQ ID NO: 12) 



Amino Acid 

MGSKRPCSSGQEPVTKKQKKNNNNNSNPVPVINIKSYPFLATRTQVLRSAVAAAAASPSG 
SSSSSSSSASAVKLPDTCKEARKVLSTVSLQQSLAVRYLCNSISVSYAGGGISVFHLGGL 
PGAGKTTMVKELIAVLNDHGLIDSGSADMLLCCKSNSAKESLMCACKKPGGSSLMYPESV 
FSTLNKGFEIPVIFRKDEITLEKIQFVADKLKWKVIQVLANLRFLVIDEYTMASCRELVF 
IDAVLRIAKHRPDIPFGGVFVILLGDNRQNSAVVEDNTNHIQKKIKNPSEEEKPQKNNKN 
NKNKKKKKEKKEKGGEEEEGDENEEEEGEEEEEEEESDDEAETKKEEEKSTFFQGSVEQD 
NFGQEDNAKLYTEVFIKILKMFCSRDFFGNPSNLRNIVNKRHEAILMKSNNVKSVNNNLV 
SSAIKVEDCGNASNKKEVTAPSSSPAQSTAEENCDEFDDEEDDDFFNNEAFLKLMERNAL 
EKDRASGALNGFSLRCKSISDANEKIRSGTTSVSDKKSSLDMMKSLPLSALIEEGICSEL 
AHISELKE^SNANLEKYTENVCSIVFDMMAKAMREIDYSGREKLYIVSSLSERFKDTHLT 
SLMDEEILNVKYVHGSDPKCIDAVPFNSAHNRASAVAACVRNAFFRDGKDFVDETPIAWY 
FKDNLRTVASFLENETLTYKELLAKSENIRSILLKKETGNNNSASSRTAAAAAYYEDDED 
YCYFDEEEAMDLEDGGSGGSGMKSSGGGGDDDDEESGEMIYRTDIPDKLHRDASTLDRVG 
HLVDFHVVWKKWLTENKPSDLVRARVWYLYTLVRMOQVKFDNGKLPSLDKSALSGRLFHS 
PSEWATSTGVGVGGGGGAGADKPLHDEYWLRVLSMPISTGGDVGKSMLLPAYSSYLSALS 
RTYIMSSLKRIDIIKHAYSLMYGISLFDMTANLQDLVDTRMAGRSSRNGSVFMDNFDPVQ 
YFDNIFPSMVNEFLMYRKEDVFNNGQMMEGVKGSLKISRVLQTAHTENNNINNRHNNSLK 
YSEKSIVLAMQMVTSISKGNERRKKIEEFITKEQGQPKDMCERLMANSKAKQEKDAISSK 
TDKMMGAITLTKKHVLKNAVSNLVDTSIIKETKKNNNNSSSSSSTSLAAAAAVENSVPAL 
RVEVKFVVLNMDLSDISHEKTISHKYRQQLINAIKTRSTPLFDKFTDRKILRAAESPRAL 
TTILLDEKKKVTE^KSITLYQGQNVIFTTSNRMIHGTQERFVTKDTGVVTNLMYKNGELT 
VFVYVERLGQKCLEIKEGRQIIGNPNIKNGGFGNNVYVQYLPFESSQT^TIYSCQGHTFF 
RDTIVDLSGASTQDAYVAVTRNSNPQNLFIIQNHSVERGNLCNIKCAMSKDKAYTMPIGG 
lADFNGSDFINHDTVSVSREVAESSAAMDDDYNGDGGVTMYSAYDPSKDVVAAAEEFILS 
RSGKSLSFNASWMANTAKVIQOHGLETELKNIRDFFFGVNNGDVAKHYEKLCNKKMIELY 
TAIVRSITHYSIASGIVKQPSSKLCEEYETKQKNKKDYIKIHPVFVNRAPKESTIEMLLF 
DIAPHNKATIVFQFYVHYIFLVYEKLNVLNSSFAFLPSPNPCLNQYVRPKSITTNSTHVP 
NLGYESKDFAHCKDGGERDVKLRLPITSADEFSNNIEGILKKVSDTSNQNKVNKYMDVVC 
KSMQHNLRRTGKFCRPTETCGLSKHGSIVTSTCTAQEKGENIHVDAEKGWLCMSDEANVY 
CMLMFMSKIAAASGVSEFPIKDKSILESNPETPSDTISLLAPRKTISPTNNLHFSMSEDV 
LFCGQVHPMKRVQFSLHVKRTGGALKSTFEEEEGLPTKIFSPNFATYPLFKKCKMYGAII 
lAMTEMQGHEFAKYSTLDIRKSMFTGVGTVVDLEKISGEGNEVMDKVDKFIVKWVSNILF 
KEQGKRVSFFVSCAIH 
(SEQ ID NO: 13) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9YTL7 (Q9YTL7) ORF 48 68 5e-10 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 68 5e-10 

NAB3_YEAST (P38996) NUCLEAR POLYADENYLATED RNA-BINDING PROT 68 8e-10 

Q07034 (Q07034) RNA BINDING PROTEIN 68 8e-10 

096229 (096229) HYPOTHETICAL 78.6 KD PROTEIN 66 2e-09 

096134 (096134) SER/THR PROTEIN KINASE 66 3e-09 



Comments : 



EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 830 of CT631: this 
corresponds to nucleotides 259304 to 260133 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 
5' start=2408 
5*stop=2429 
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3*start=2489 
3'stop-2509 

5'primer=GAGCCCGTGTGTGGTATTTGTA {residues 2408 to 2429 of SEQ ID NO: 12) 
Tm5=58.49 

3'primer=AGCGTCCAGACAGTGCAGATT (residues 2489 to 2509 of SEQ ID NO: 12) 
Tm3=58.56 

probe 1=CACTAGTGAGAATGCAAC {residues 2432 to 2449 of SEQ ID NO; 12) 

probelstart=2432 

probelStop=244 9 

direct ion l=Forward 

Tml=68,99 

scorel=l . 99 

length=102 



FIGURE 3, Sheet 8 of 160 



wo 01/38351 



50/201 



PCT/USOO/28888 



CT632 

Nucleotide 

Genomic coordinates: 

Start : 268485 

Stop: 267717 (SEQ ID NO: 14) 



Amino Acid 

MAGVDLYGGHIKPYGETVFNNKMQGNRGKIRALINEKAAATLPMSEDNISAWVTEVAADV 

FPDPKSALTFFVPNKSLNAFAWDVLKTPASVEIDIGKRIPQLIENLHMSDFTVAIFRVKC 

DDQGRYETSYNLSPSMGGKINHGLIRTLAKAQDIVVWKRDFSLTIENFEVDNGKKRLDFL 

FNNQTDKSCFVKIFHEMESEKDIAIKPEKRGSSAVWDEVYSDIVTKNTRNAKFSLRYRNE 

KPVDHLLLYCMVTYF 

(SEQ ID NO: 15) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



O97036 (097036) PLC-BETAH2 


34 


1. 


.5 


045329 (045329) F09C6.2 PROTEIN 


33 


2 . 


.5 


P97868 (P97868) PROLIFERATION POTENTIAL-RELATED PROTEIN 


32 


5. 


. 6 


P70287 {P70287) RETINOBLASTOMA BINDING PROTEIN 6 (PACT) (FR 


32 


5. 


. 6 


AAD4 9229 {AAD4 922 9) EHEC FACTOR FOR ADHERENCE 


31 


7 . 


. 4 


CAB55629 {CAB55629) LYMPHOSTATIN 


31 


7 . 


. 4 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=378 

5" stop=403 

3* start==452 

3*stop=477 

5'primer=TGAAACCAGCTAC;UVTTTATCTCCTT (residues 37 8 to 4 03 of SEQ ID NO: 14) 
Tm5-S7.68 

3'primer=CCTCTTCCAGACTACAATATCTTGGG (residues 452 to 477 of SEQ ID NO: 14) 
Tm3=59.43 

probe 1==TCAGAACACTGGCTAAGG (residues 434 to 451 of SEQ ID NO: 14) 

probelstar t=434 

probe lStop=4 51 

direct xonl== Forward 

Tml=69. 15 

scorel=l . 84 

iength=100 
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CT633 

Nucleotide 

Genomic coordinates: 

Start: 266790 

Stop: 266442 (SEQ ID NO: 16) 



Amino Acid 

MASPLVASLGGGKNILFGLLLITIIVIVIAVIIIKAPLLASLLAGTALAGTIASALGSIP 
GVGGAFKKAFGKGKGKGGPKTPDGGAKKTNQKPKKGKKKPPTRRSIFKRIPKIKF 
(SEQ ID NO: 17) 

Top Blast Hits 

Sequences producing significant alignments: 

Q02391 (Q02391) CYSTEINE-RICH FIBROBLAST GROWTH FACTOR RECE 

Q910ig (Q91019) MUTANT CYSTEINE-RICH FGF RECEPTOR 

Q69526 (Q69526) GLYCOPROTEIN B 

Q9XZ15 (Q9XZ15) HYPOTHETICAL 29.3 KD PROTEIN 

Q14113 (Q14113) AORTIC CARBOXYPEPTIDASE-LIKE PROTEIN ACLP ( 

P79922 (P79922) MODIFICATION METHYLASE (EC 2.1.1,73) (CYTOS 



Score 
(bits) 

36 
36 
36 
36 
35 
35 



E 

Value 

092 
092 
12 
12 
21 



0. 
0. 
0. 
0. 
0, 
0.21 
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CT634 

Nucleotide 

Genomic coordinates: 

Start: 283360 

Stop: 282673(SEQ ID NO: 18) 



Amino Acid 

MVSSRTSTTSSSAVAATSTLLPTKRKREPEEVKVKVEVKMEQEELVEDSSSNKRPRIKEE 
KEEEHKETHHLSLPCKEEEDDGEEEEYEEEEDEEEYEDRVDDDTAEKMENLLVQLDNTTK 
NIKLKNPLREHDMAVSHYEHEFEVQNTVNFSFGVLSDIGFLINREAVSRWGNTPPPKEFG 
DME I GSLT VNQLLHKC DN FVQAV VQKVKEDI T PS IE VT I DSLI DDPCW 
(SEQ ID NO: 19) 

Top Blast Hits 

Score E 

Sequences producing significant aligniaents: (bits) Value 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 60 2e-08 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 57 8e-08 

Q9YTL7 (Q9YTL7) ORF 4 8 56 2e-07 

094922 <094922) KIAA0835 PROTEIN 55 3e-07 

O0899S (008995) MYELIN TRANSCRIPTION FACTOR 1 55 3e-07 

Q93424 (Q93424) HYPOTHETICAL GLYCINE-RICH 37.0 KD PROTEIN E 55 5e-07 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 575 to 1 of CT634: this 
corresponds to nucleotides 282678 to 283252 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=210 

5*stop-231 

3'start=287 

3'stop=308 

5'primer=CCTCTCCCTCCCATGTAAAGAA (residues 210 to 231 of SEQ ID NO: 18) 
Tm5=58.19 

3'primer=TCGTCGTCCACTCTGTCTTCAT (residues 287 to 308 of SEQ ID NO: 18) 
Tm3=58.4 0 

probe 1=AAGAGGAGG AT GAGGAAG (residues 266 to 283 of SEQ ID NO: 18) 

probelstart=2 66 

probelStop=283 

direction l=Re verse 

Tml=68.99 

scorel^l , 98 

length=99 
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CT635 

Nucleotide 

Genomic coordinates : 

Start: 285773 

Stop: 284075 (SEQ ID NO: 20) 
Amino Acid 

MARSVGLLSVTPEYDTFKYIKMEEFKTLKVKNGFTISGENPDKYEHILLSFKSVDRVTKS 
ELRDGLYIVRLKDKEVLHIKNGVHRLRQLTGDNTLQVGLKYTHNLPRLGSLLQDDGCEDY 
GEKWNESLPIDMQNINKIVKEKALLSDKNFKFSPLYRLLHERLSNAAVKKCDYMIITTDF 
LVGCGYTPSHCPRTLRNMEQLLVEQCGFSSRISVYDICDRLTYKGAYIANPITGSYSNMC 
LIVPMDKLGLIFYNSTHPSAKSIGNYMSSLFNATVIYANERDNLQMDNFRREIKFAENEV 
NMKEEELKELRKRCAVSEEQRISLRDVHKKSSIATSRYDGGACLVFAFSDRDFSLLCRTN 
GNGSFYSATEEGIRYVSSPEYKKRDVGERRPRLIMSITGSDAPICIRDSVRNHFKTRLFS 
RTSGNSITFAVPPGERELMEMVREVTGTDIKIFMDNGKVYQNGAEINVIDPTSKEYKELL 
KREENLPEDERKRLRRERRMIFNTSRAISMYNEERGDGGSGGETSEDGDGNGSTSSKGEK 
RKREENEGNEYVVLLNKACKDIKVC 
(SEQ ID NO: 21) 

Top Blast Hits 





Score 




E 


Sequences producing significant aligninents: 


(bits) 


Value 


Q21885 (Q21885) COSMID R09H3 


37 


0 


.32 


077336 (077336) PFC0425W PROTEIN 


37 


0 


.42 


000164 (000164) RIBOSOMAL RNA UPSTREAM BINDING TRANSCRIPTIO 


36 


0 


.71 


UBF1_HUMAN (P17480) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTR 


36 


0 


. 71 


YPT2_CAEEL (P41880) HYPOTHETICAL 21.6 KD PROTEIN F37A4 . 2 IN 


36 


0 


.93 


YG2K_YEAST (P53253) HYPOTHETICAL 106.5 KD PROTEIN IN CTTl-P 


35 


1 


.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 501 to 1 of CT635 : this 
corresponds to nucleotides 28404 7 to 284547 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5' start=686 

5' stop=706 

3'start=799 

3' stop^822 

5*primer=CAAACCCAATCACAGGCAGTT (residues 686 to 706 of SEQ ID NO: 20) 
Tm5=58.55 

3'primer=GGTTGCATTGAAAAGAGATGACAT (residues 799 to 822 of SEQ ID NO: 20) 
Tm3-58.19 

probel^ACTCCAACATGTGCCTAA (residues 707 to 724 of SEQ ID NO: 20) 

probe Is tart =7 07 

probelStop^724 

directionl= Forward 

Tml=68 . 89 

scorel=l .89 

length=137 
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CT636 

Nucleotide 

Genomic coordinates : 

Start: 286706 

Stop: 286076(SEQ ID NO: 22) 



Amino Acid 

MIVFVEGSPLTGKTSWVDNMRTAGKGKQSFLNFMYTNYRDYLPIFPWTIQEHLRASDYQE 
RPRLVDGMFGSSLNFFTGMWRHDTEQFPESKIGLREYLEMYGEEFKACVAEWVKYKPVFH 
VMVYREEDVKKMEPIIQELNDAHNWFIDVLKEERALFVKIEVIPRNVYKGNICSSCFSTS 
KNYVYRVGKCTNSIVHCDMKCKFIAEKII 
(SEQ ID NO: 23) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P87968 (P87968) ENVELOPE GLYCOPROTEIN, VI -V5 REGION (FRAGME 33 2.0 

P88315 (P88315) ENVELOPE GLYCOPROTEIN (FRAGMENT) 33 2.0 

P88314 (P88314) ENVELOPE GLYCOPROTEIN (FF^GMENT) 32 2.7 

DYHC_PARTE (Q27171) DYNEIN HEAVY CHAIN, CYTOSOLIC (DYHC) 32 4.6 

040068 (040068) GP120 (FRAGMENT) 31 6.0 

040067 (040067) GP120 (FRAGMENT) 31 6.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 594 of CT536: this 
corresponds to nucleotides 286111 to 28 6704 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=239 

5'stop=259 

3*start=317 

3*stop=335 

5'primer-^GGAGGCACGACACAGAACAGT (residues 239 to 259 of SEQ ID NO: 22) 
Tm5=58. 81 

3'primer=CACTCGGCGACACATGCTT (residues 317 to 335 of SEQ ID NO: 22) 
Tm3=5 9. 42 

probe 1=TGGAGATGTATGGAGAAG (residues 293 to 310 of SEQ ID NO: 22) 

probels tar t=293 

probe lStop==3 10 

directionl=Reverse 

Tml-68. 95 

scorel=l . 95 

length=97 
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CT637 

Nucleotide 

Genomic coordinates : 

Start: 300432 

Stop: 299085(SEQ ID NO: 24) 



Amino Acid 

MGGEDSFDDRYDSDALWENEGAKSIQVKETDLEVYRMHRRAVPTLEEKNRTALRYYSDWS 

PVYRVPLFSLKDGSDPHERDFSFNVDPRRFGKVPVKVRRVDVRNPSRTAAIFVPTGPGLH 

VSSYTGDGMLVCPNHNFIGDLCSEIASDITIYNTSSSGRLSYATNFNSVEDNSPVGILFE 

TLPDDKMFQQVSIFSATEPASNISIGPMSHVKIKLGYYDEENATAVGVIRYGGLFYTSVG 

ACIIPEGVFFDDVVGNHSSMNIYNMTNQPKEIVLKEPRGEDAMEEDDGEEADYNFLGYVV 

RFEHDLKMQAMSSAYSSVSIDINSSSFHKCFLIKPKYNSILQPLVSSEVVLNDLSLNTRG 

REVEFHDRLPSGAQDNSYSIVKYMKIVSLKEGLKWNPIINTELYKKKQALKVHVLNMTR 

DVVGLDTSEHSFGVIVCHAAKLPEVIGQ 

(SEQ ID NO: 25) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

023592 (023592) CARBOXYL-TERMINAL PROTEINASE HOMOLOG 34 1.6 

YGA1_YEAST {P53199) PUTATIVE 3 BETA-HYDROXYSTEROID DEHYDROG 34 2.1 

VILIDICDI (P36418) PROTOVILLIN (100 KD ACTIN-BINDING PROTE 33 4.8 

Q9Z5B9 {Q9Z5B9) PUTATIVE TRANSFERASE 32 6.3 

Q9YUY4 {Q9YUY4) ENVELOPE GLYCOPROTEIN 32 8.2 



Consents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 3 to 1254 of CT637 : this 
corresponds to nucleotides 299132 to 300383 of the genomic reference sequence, 

TaqMan Primer /Probe Sets : 

5'start-^678 

5'stop=702 

3'start=788 

3'stop=812 

5 *primer=CGGTGTCATTAGATATGGAGGATTA (residues 678 to 7 02 of SEQ ID NO: 24) 
Tm5=57.26 

3 'primer-^TCTTTTGGTTGGTTAGTCATGTTGT (residues 788 to 812 of SEQ ID NO: 24) 
Tm3-57.75 

probe 1=CTACACCTCTGTCGGTGC (residues 7 05 to 722 of SEQ ID NO: 24) 

probe 1 start ==7 05 

probe IS top=722 

direct ion 1= Forward 

Tml-68.85 

scorel=l . 85 

length=135 
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CT1040 

Nucleotide 

Genomic coordinates: 

Start: 137588 

Stop: 139940 (SEQ ID NO: 26) 



Amino Acid 

MAAAAVSGEGRISADLLLLEQLTPDGDVIRYDSEQYTKPRKIFGDKSVIETIGHFLIHNH 

NQGESYQIASSVLEKFPALLNCIWNGESGGMALWKALYRAKKYRLLNSLLVHKIKNWPSV 

AVIPIYGSVCDREERPIIMSEIIDKETLQTICKSDIRSLLGMMNAKHGTLGGNFLHFYAR 

STKPFENFQYEAMGANAVLMAAEAIYDGFRDHGLNPSEYTFPGLESADVYGNNPVEIAIS . 

GDDDNMLLNLICNYGVSYEKTRGRVNRSLLDFLKMNTASKCLSVLKFVEKHFKIESNTPK . 

GEFEEKAETCVNCLDRNNVLTKGSEQESYKLSCGHFLHVKCLRNICIVSQHLRCEKCLKR 

FDESILRKCTPNLNWWLTMPAGAGNEEEICFMRNKKLVDDFRKLLSPVSIPHFFKNSRQR 

NLDMLCPYSDHTIIPNKEDPKKNEDGNRVRVNHTAISEKQNKEEEDARIKRVAVRTFTAI 

REKQNKEEEDARIKRAVDMAVAAINEKNKEEEDARIKRAVDMAVAAINENNKEEEDARIK 

RAVDMAVAAINEPCNKEEEDARIKRAVDMAVAAINENNKEEEDARIKRAVDMAVAAINENN 

KEEEDARIKRAVDMAVAATNEKNKKEEDARIKRIIDLTVDMRIQRIVDMAIAAATKKDKK 

EEEKRTKREQELRADLRRAMDMVNEVQKKLEDMELEKGCNKDEAKNTSNVVSSSSVVAYS 

KEIVPCLGNNNNAVIGMTSTNYSANNTKNNVFGSPHKFSFNDASRFSNIVETPKMSFNFS 

FKT 

(SEQ ID NO: 27) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 71 4e-ll 

Q26775 (Q26775) TB-292 MEMBRANE ASSOCIATED PROTEIN 70 5e-ll 

INCE_CHICK (P53352) INNER CENTROMERE PROTEIN (INCENP) 67 5e-10 

YDF3_SCHPO (Q10475) PROBABLE EUKARYOTIC INITIATION FACTOR C 66 7e-10 

Q26774 (Q26774) TB-291 MEMBRANE ASSOCIATED PROTEIN 66 9e-10 

P91257 (P91257) SIMILAR TO C, ELEGANS UNC-89 65 2e-09 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 853 to 415 of CT1040: this 
corresponds to nucleotides 138912 to 13934 9 of the genomic reference sequence. 
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CT1041 

Nucleotide 

Genomic coordinates: 

Start: 140110 

Stop: 141616(SEQ ID NO; 28) 
Amino Acid 

MVYKGFVSPSFITTRTMTSNRPTTSPLSFSEGFSLSGDKYDTYEDTLLEQFNCFKTSSPS 

SARKSEIEDKTLIFQLKEGEKFHLAKGIEELREILDDNSATIEPIISPTTFNDRNELLNH 

EGDI S S S PLYTQIMKH I SPEH DI YELDLI VGT DLLFGLGVNLRN VSKLMKKI S YGTLN VV 

DVCHRKFFNNRIIVNPISSSFSKNVCIIPLFSAAEEFS5LGECRDLFNGICDDVERYINS 

YFFYPENTTTTTTTAPSSPEMEIADEEEOSPKTIKRNDNASRNWSGVCLIFEVFKNTYYI 

INRGDRGGSFEKAVKSAISSIKEKRCKITDINGNKPRLVMVITGCYTELYFKDALKQIGE 

NRRKFLKMNGNYFSLIDEQADLIEFAMSVSGAGERIFVNGLGMFQNRKMIPVIDPLTYEN 

VVCGEHDIQKEDAILSVRRAIADYNDFVSKNKRGKKRSAEEENEDEDADASSSSSSSPPP 

SSPPAHKKSRLPDEGEKCTLC 

(SEQ ID NO: 29) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q26258 (Q26258) BR2 . 2-BALBIANI RING |5' REGION, REPEAT UNIT 


38 


0. 


. 12 


Q23804 (Q23804) SPID PRECURSOR (FRAGMENT) 


36 


0. 


,82 


097324 (097324) MAL3P8.1 PROTEIN 


36 


0. 


, 82 


Q99112 (Q99112) HOMEODOMAIN PROTEIN BW2 


35 


1 . 


. 1 


Q99111 (Q99111) HOMEODOMAIN PROTEIN BW2 (FRAGMENT) 


35 


1 . 


1 


YA55 METJA (Q58455) HYPOTHETICAL PROTEIN MJ1055 


35 


1 . 


. 1 



Comments : 

TaqMan Primer/Probe Sets: 

5'start=822 

5 ■ stop=84 6 

3'start=929 

3'stop-950 

5'primer=AAAGAGAAATGACAACGCAAGTAGA (residues 822 to 846 of SEQ ID NO: 28) 
Tm5=57.4 6 

3'primer=GCACTCTTCACAGCCTTTTCAA (residues 929 to 950 of SEQ ID NO: 28) 
Tm3=58.24 

probe 1=AACTGGTCTGGTGTCTGT (residues 847 to 864 of SEQ ID NO: 28) 

probelstart=847 

probelStop^864 

direct ionl=Forward 

Tml-69.07 

scorel=1.92 

length=129 
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CT1042 
Nucleotide 

Genomic coordinates: 
Start: 141695 

Stop: 142541 (SEQ ID NO: 30) 



Amino Acid 

MAVNLDNVLVNINNKDEDLTKLVSEAIKRRAKTVFDTKNQAGFDMRRQVEAALYEAISKKK 
EKAIKAFDELIQERGDEITPLTTMQYEEWVNRTITPSLTTENfLLGDVEHADFLLDRMTPVS 
EEDIEGFAASTFKEVSDSKTATVIVKADCETGDIDEVYNLAPSFGVTQEIKIYRSNNSSEL 
DNVADSFHIYKISATDSDSGNTKKLLYGLRNKKAGYTCLCRIFAEIESDGIMANTNIGVAE 
NNRDEIDENEEGKYGFLIPKQPAGAKLIIYFFLNCWTX 
(SEQ ID NO: 31) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



gi 124 92980 |sp|Q10043|YRPl_CAEEL HYPOTHETICAL 37.6 KD PROTEIN Rl . . . 


33 


0 


79 


gi 1401691 I sp|Q00933|YSCI_YERPS YOP PROTEINS TRANSLOCATION PROTE . . . 


32 


1 


8 


gi 1 1706480 1 spl P51892 1 DNLl XENLA DNA LIGASE I (POLYDEOXYRIBONUCL. . . 


32 


1 


8 


gi|2675701spiQ01250|YSCI_YEREN YOP PROTEINS TRANSLOCATION PROTE . . . 


31 


2 


3 


gi 1 3121979 1 sp|O07597 1 DAAA BACSU D-ALANINE AMINOTRANSFERASE (D-A... 


31 


2 


3 


gil 6686325 |3p|P71018|PLSX_BACSU FATTY ACID/PHOSPHOLIPID SYNTHES... 


31 


4 


0 



Conunents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 700 to 1 of CT1042 : this 
corresponds to nucleotides 141748 to 142447 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=350 

5 * stop=367 

3*start=392 

3 • stop=414 

5 •priit\er=GGACCGAATGACACCCGT (residues 350 to 367 of SEQ ID NO: 30) 
Tm5=57.70 

3*primer=CCTCCTTAAAAGTAGAAGCAGCG (residues 392 to 414 of SEQ ID NO: 30) 
Tm3=57.66 

probel=AAGCGAGGAAGATATTGA (residues 368 to 385 of SEQ ID NO: 30) 

probelstart=*368 

probelStop=385 

directionl=Forward 

Tml=69. 01 

scorel=l . 88 

iength-65 
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CT1043 

Nucleotide 

Genomic coordinates: 

Start: 142610 

Stop: 143699 (SEQ ID NO: 32) 
Amino Acid 

MTVLAVYTAPQrKKSKKRKTEDE^3EEEPVKTLEDFVKGRI.LNAVKEKPAEYFELLISADT 
EAALKTAEETALRDFVIENDSVEIDVEEVLEEKPREYVFKLAGATSETLTNTIIAEVQKK 
AALITEEDITIKMLKQFRAANKDNKDGEATPEEKEDFTNNSDLVGLYLNEVVEKTTNIVI 
NKIFPHEMVFERCAILIEDFDTGVVTDQAIQIPSNKYKIRLVEGDEPEVFPGDCLDLAVS 
VDKINHVLKISAKNGCENNCFVIIPRFSPVGSVSSMILGSTDQVKPKTFLFLANKNDSTH 
FQFTMDKQHSVGCELDMLIFSERNLRNLPDSKPRPLSDADILASYGKRLGTGVFTTENLV 
DD 

(SEQ ID NO: 33) 
Top Blast Hits 



Comments: 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 982 of CT1043: this 
corresponds to nucleotides 142640 to 143621 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=543 

5 'stop^Se? 

3'start=635 

3 • stop=659 

5 'primer^CAAAATATTCCCTCATGAGATGGTT (residues 543 to 567 of SEQ ID NO: 32) 
Tm5=58.37 

3 •primer=CTGATTTTGTATTTGTTGGAGGGTA (residues 635 to 659 of SEQ ID NO: 32) 
Tm3-57 . 33 

probe 1=GACTGATC7VAGCCATTCA (residues 615 to 632 of SEQ ID NO: 32) 

probelstart=615 

probe IS top=632 

direct ion l=Re verse 

Tml=69.01 

scorel=l,98 

length^ll7 



Sequences producing significant alignments : 



Score E 
(bits) Value 



Q26938 (Q26938) KINETOPLAST-ASSOCIATED PROTEIN (KAP) 

023332 (023332) CENTROMERE PROTEIN HOMOLOG 

Q9ZES5 (Q9ZES5) CTC PROTEIN 

AAD45753 (AAD45753) AN0N1A3 (FRAGMENT) 

066878 (066878) CHROMOSOME ASSEMBLY PROTEIN HOMOLOG 
AAD457 59 (AAD4 57 59) ANON1A3 (FRAGMENT) 



40 

39 
39 
38 
38 
38 



0.022 

0.037 

0.049 

0.083 

0.11 

0.11 
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CT1044 

Nucleotide 

Genomic coordinates: 

Start: 143759 

Stop: 144689(SEQ ID NO: 34) 



Amino Acid 

MSSSSSETPKTSTDTGEERIKDIVNALDNNGEWLSSYIDPIINNHISRKTAETVQKINQE 
VDERYDRKIADKINEIKSSIFTSAQTMYDQYAIDTFQEGKGANGTGPVMGPVNTVIDTTL 
NKMRGNMLEYAEDMWDGDDWKRFSSSMTTLEFDLSYSDLTMMRGSDGYFAFPFRGTKKIK 
MDGSRKKEDPINCIISVTYPNKVGDEWEEGKEREVNFNLERVDDYERDIHVSILCMLHAQ 
LDNFEQALGENANSFYFKKGQRVMFLPKKSKLFNRPTVEDSDMFSIIFPPASDQDFADDI 
YYRIIVTCS 
(SEQ ID NO: 35) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

013706 (OI3706) HYPOTHETICAL 11.8 KD PROTEIN C13F5.07C IN C 36 0.47 

Q20960 (Q20960) COSMID F58A6 35 0.61 

CAB59514 (CAB59514) HEAT SHOCK PROTEIN 70 34 1.8 

Q45851 (Q45851) NEUROTOXIN TYPE F 34 1.8 

017208 (017208) C01B12.2 PROTEIN 33 3.1 

BAA83026 (BAA83026) KIAA1074 PROTEIN 32 4.1 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 954 of CT1044: this 
corresponds to nucleotides 143768 to 144721 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=330 

5 'stop=350 

3 •start=406 

3'stop=427 

5 •primer=GCCAGTGAACACGGTTATCGA (residues 330 to 350 of SEQ ID NO: 34) 
Tm5=59.57 

3 'primer=ATCGTTTCCAGTCATCTCCGTC (residues 4 06 to 4 27 of SEQ ID NO: 34) 
Tm3=59.04 

probel=CTCGAATACGCTGAAGAT (residues 382 to 399 of SEQ ID NO: 34) 

probelstart=382 

probelStop=399 

direction l^For ward 

Tml=69. 00 

scorel=l . 99 

length=98 
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CT1045 

Nucleotide 

Genomic coordinates: 

Start: 150687 

Stop: 154344 (SEQ ID NO: 36} 



Amino Acid 

METTMDNVVQNNDVTKPTPDVATVTTATEKRQSCKEKKDQLKAECPQVLRALKLSNTLKA 
NFGKSMSAIFAQHLVDMTNAKHFKDPKTKKILELDGSSSSDSEEEEETSSSSKRKRGSGA 
RSASSKKEKCPNTIKNWLNDAQGVFRQFADIIINLPSFDDLRDEVKDEQTELKTIYDLYR 
QDMEKVVEEVLGRQDLFDHKSErAKGLARFDTHVSLLPSDRSAVLDSSlSKELEKNSKGP 
NSNIFDTLNTLKEEIKELLCHHVKYLLQNLTPEDANFVFNSSVKYVKKSYQYYIQTSEME 
SDEFKSLLTGVNIKILEKIISSDNNVATPYKHITNPRNIISSLQKVRETKPVSKDYPFRV 
DTARDIVLLPETGGISDLPIKPVTLLQLVSYINALFSLERRNVFTDGFFNAACVLISQCL 
TNANLLSNDFPKPIELAANVTRHNLLSMKMLQEGSSSEKKSKKKEKKKDKKKGGGGGDDS 
DSETDSSSSSSSSSSSSSSSSSEDEEEEKGEAVEKGKKTKRKTKKKPSKDDDLDTISKLI 
LKTGGYFHDTSELGNKIRNLIDKDDFAGVAQYAVTITEMQSTPMNQRLVSSLLDIilMRLK 
EQVKYSVDTESTSSTAKSNNALDSAKLTSQQVVTMMVDSGAELARLAAFFFVVVDNTVVN 
RHEAF I LTSKLLPSNENRGLKT WES FFKNLT I SNKVSTSNEEMMSVMPFEDEQQQQQCP 
QHEQQPDLKRVVGEVFLEMGKSIVNSFPSNKSVQLTADAFKQNYSPMGRRINLAAKIKTA 
ISIGSNISPNILFSNLPESVGNNTVTGLRLTNLLKNISQSAQANNIIKNANTLVNNTMDQ 
QNSAAMSILLFPPTSKETSIFPGNDPSSIKLQDMTTMSNLARGFYSIAEGCIGVVRSREF 
DEGGVKAYTLLVDSNTMDMAVNFAAQSLEKSMSEALTNNANMNPSNVLEGGSFVDGALSY 
MFEKNGSDCEPTPLAKYTMKDVSNRYLKKFNNDKNTQDLYKNRAERALVEQVTNKPTSVV 
HSQLANAMGVAVIGAASIKLMEAEAAE3EMRAANYQATSKSTNAINITNTIGMIRNTTHL 
CTTIAVSAAADMSKLANNHFMSVLNTANNSHSSRRGDRSSLMLQQQQPTHSAFLEQTRGR 
GGGVLGSGTEQTKDHVERMKRDWILNMISPEDKNTTTTTPSNAGRTLGYGSNITGINTIK 
QDDKSMMDKLSEMSSFRT 
(SEQ ID NO: 37) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XY26 (Q9XY26) HYPOTHETICAL 75.5 KD PROTEIN 59 2e-07 

055035 (O55035) PEPTI DYLPROLYL ISOMERASE MATRIN GYP <EC 5.2 59 2e-07 

VTA2 XENLA (P18709) VITELLOGENIN A2 PRECURSOR (VTG A2 ) [CON 57 5e-07 

Q07034 (Q07034) RNA BINDING PROTEIN 57 9e-07 

NAB3_YEAST (P38996) NUCLEAR POLYADENYLATED RNA-BINDING PROT 57 9e-07 

095367 (095367) CBFl INTERACTING COREPRESSOR CIR 57 9e-07 
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Comments : 

TaqMan Primer /Probe Sets: 

5'start=1624 

5 'stop=164 6 

3 'start^lTOZ 

3 'stop==1720 

5 •primer=AAAACAGGAGGTTACTTCCACGA (residues 1624 to 164 6 of SEQ ID NO: 36) 
Tm5=57.88 

3 'primer=CTGCATATTGGGCTACGCC (residues 1702 to 1720 of SEQ ID NO: 36) 
Tm3=57.81 

probel=CACGAGTGAACTCGGCAA (residues 1647 to 1664 of SEQ ID NO: 36) 

probelstart=l647 

p robe 1 3 1 op=- 1 6 6 4 

direction l=Forward 

Tml=69.00 

scorel=l . 99 

length=97 
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CT1046 

Nucleotide 

Genomic coordinates : 

Start: 154556 

Stop: 156932 (SEQ ID NO: 38) 



Amino Acid 

MSLVENNTQEEMILETTVEGVVEGAEVAPRGVKRPLPS5SSSSSASDSEDDEGGEQPQTK 
PPKKKRNINSGKYWKIETIEPASPEMLSAVNDIDNVSKTIPLIDNSFGVQFKKSVSEEQI 
KTLLT ET I AVE YGT I TN VKY S TFNQLERTGE PLKKKRSNNGNNN YR YWQ I RI E AAAAEN V 
TQAVLDAIVEGNDTVIKAILLPEGEGIGLQFNKSVSSQQAKNIVQAADIEFGQVAHMKCN 
LFHKMEKADESSNSSGESPKVKKVRRNKSQPTNSYYTFTMIGDSLQERIDNAIKVIEMSP 
VKRPFSNSAAAACEDTTTTTTTSTGVVNPRGIKDIHFFDSSISKGCFTVRNIVAANGEVP 
QEEFVSELYTNLLKVEEPWDHPTFKKLIHDRTMNRHIKAWYCICPYYTTGGVPPAADKVS 
AKGIATYRIYEDRTGVFQFDGAHTSTTPAQAAEATGAIHKSMLFQSPGTDIQKFLDAKKA 
EGLEPISSGEIVYRSKWSPNDSRATRCFKFYSSSDEKMNIADVLSIVHTDGLFSSVHFRK 
DTMEYGVAKSKSKIIPKTIKIKKGGDTFHSEEDIEVPVKFTAITSEELNRECNTKGMNSL 
RAHKKRKSNSSTTTTSTTSTSTTANTPKKTKKSASAASDPFAKLTLDYVDSTSFVFYNIS 
KEMVQRILAQERVKTLKAVKNEEKMEIVEGEEAQETYRGIVKIKTNAKAYNLANKTSGVL 
FPADKVCLKHTLEDLGDVLDFDVVREDNVNKTVASTTTTSSENKASGGDDEETPMEFETD 
GEKLLHELLNE 
(SEQ ID NO: 39) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q45759 (Q45759) CRYSTAL PROTEIN 41 0.023 

097003 (097003) L1156.7 PROTEIN 41 0.030 

002061 (002061) B0041.7 PROTEIN 41 0.039 

AAD55361 (AAD55361) XNP-1 41 0.039 

KI67_HUMAN (P46013) ANTIGEN KI-67 4 0 0.051 

P7906S (P79065) NOCl PROTEIN 40 0.066 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 935 of CT1046: this 
corresponds to nucleotides 155963 to 156897 of the genomic reference sequence, 

TaqMan Primer/Probe Sets: 

5*start=1338 

5*stop=1355 

3 ' start-1419 

3 • stop=1440 

5 'primer=TACCCCAGCACAGGCAGC (residues 1338 to 1355 of SEQ ID NO: 38) 
Tm5=59.36 

3 •primer-TGCCTTCTTAGCATCGAGGAAC (residues 1419 to 1440 of SEQ ID NO: 38) 
Tm3-59. 46 

probel^GAGGCAACTGGTGCTATT (residues 1357 to 1374 of SEQ ID NO: 38) 

probe Is tar t=1357 

probelStop==1374 

direct! on l=Reverse 

Tml=69.01 

scorel=l . 98 

length=103 



r 
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CT1047 

Nucleotide 

Genomic coordinates: 

Start: 159378 

Stop: 161256(SEQ ID NO: 40) 



Amino Acid 

MSCSSSSSCSSSSEEENEVGVEGGGGRIGPTEAKKKILRKRKRSSVKSTSSSSSSSSSSD 
DSDSDREEKEGRKLYVDIADTRKPPKVRKLDTPSQTLENDLYMSSSSSSSSSSSDSSSSS 
GEEESDDDDDDDYDPDNVHVLGCKKEKSPQDIEAEKEKEEEYEEEFKRMALPSRINTSVD 
DCVIPDRILTLFSTLLKKNSFQFSQPVSFLRLVMKQVNEAMNSAFSSMLSSSGMRLVEDS 
LGDTSKISSFITPQTDTSNSSSSSTFVNNCTDEDTKKRNIAMGRVAELLSNIAASSNEEN 
NFRPVVSLMRGPTCGGSNASNKKLNSNRQTIPQVLNKVIFFREIHSVIALYLSSVCVQRA 
MNNDNTNSSGYAEGMVTKILNIIGKIPYNEMSREKFISVGRDALYLYQNVITDMTGPKHN 
KRLRIPQQQADFCYIIAMLVNDVPITSDLLLTGKATNLVQFASAMVDPAYRLAVHKMASV 
FNSSYSVYKVLDLDHKMLLRANLILSILSARNKCLSERKPRTLTQSVYLFLNHLLRNKLR 
SSGLTSEESSLGTAVKLVSQQLMYEGVTRQTIEDGCSMISGNFEDEDGVTLKCLGADVKD 
VKTVGLSALLSDRLRKWIRRNVPFY 
(SEQ ID NO: 41) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 74 2e-12 

NSRI_YEAST (P27476) NUCLEAR LOCALIZATION SEQUENCE BINDING P 63 6e'09 

VIT2_CHICK {P02845) VITELLOGENIN II PRECURSOR (MAJOR VITELL 61 3e-08 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 60 4e-08 

VIT_ICHUN (Q91062) VITELLOGENIN PRECURSOR [VTG) [CONTAINS: 59 le-07 

GAR2_SCHPO (P41891) GAR2 PROTEIN 58 2e-07 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 2 to 888 of CT1047: this 
corresponds to nucleotides 160277 to 161163 of the genomic reference sequence. 
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CT1048 

Nucleotide 

Genomic coordinates: 

Start: 161717 

Stop: 165020(SEQ ID NO: 42) 



Amino Acid 

MFGSSANNFNGDKKSSSSSSAAASSDDQQLGPLGLSTADFKKVAAILANRTESLYLLPDS 

PNFKNVINNPNQISIVPFLGSSKAAESGSANKNENQAENSSKGGSDGKKSSQQNKFNLLN 

KVEAEEMAFKRVAELIADTPPSKDNPLRDDPDAIPSRNPWVKLTQKNLEYLFWEAVTIEV 

SNDRSIRSGRYLQASEVGENPFLMTISVDIRILQRMALNVVWFFNRFFRMVSGLGVENRA 

NSTYVATSDAIAQIWVEMLLKNFISGENVPQALKYLKEHYEHVYNKISKCGRQPSYFVVE 

FERVDNTIGFVNSDTEHNGSSYMEYRCFDTIRKNASSGPSGGGKSGVLSSGTFFIDNEMG 

NNNSSAAAASAPAVSAGVSPSLSPFSSDGDDDDDDCSGDDVWGKKMIFNTSGDGSGESSG 

QNGGGASTYKRFRCGENTASLSQKENVRLMAMPKGNEDKQLLKNIINFLNSALNSVENHV 

MCTDENIFDEDQAEHYTSNKELYKAIVCSNPANVYRVMVELFVNLILPRLRNPIVSDIET 

VQNLPSNNGSVRTKKMVEHGCTDMRYDIPPYAKGKIRLSAKRACECRKLCKDVRCFDKSR 

EANLTPSQKAGREVEEPFPRNHNSHRSNAHDFTFYDKYRARMNKLKKDSKKKVKKIDTFT 

TTDDFLLQDRNAFDLLRKCFLSASLHHIFCPDVLMVHRGDSFNINFANNKLECYNERNGI 

EEVTSSQTVNAKEALEDITKIKMKRGDDIIDVVKSKGLSLREFSKKVSKIVRRFNEITNQ 

LCNNCNVNSSNGDVDFHVFTSVCVYIHNIIPVLEDISIFAELGEELTKLVKECRDVAGED 

KTYDDIIRNYEITVKYFKLFNALVKFCHRNYNVAVTSAINRRGYMCMVSNLVGYYCKLSD 

NAIQYHESLCSLHSSISYADYYTSRNNNSEDGGGNSSSEKSNADVAKTMASFYDQFDKSE 

DSKKNKNKTSNEILIKMFQMDRVLDGMDDDDDEDSDSSSSENEEEEEEEEIVKKPAKKRK 

VEDVDSNKKTLPKEPAVKKVKQEEDVEMEEVKEAAAEEEKKEEQEAKEEDATEYDDDTEE 

DEKAVASDEDEDDEDSKAIF 

(SEQ ID NO: 43) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

BAA83091 (BAA83091) HHNBV-XIA 427 e-118 

Q9YTL7 (Q9YTL7) QRF 48 88 3e-16 

Q18401 (Q18401) COSMID C33G8 77 7e-13 

035788 (035788) CYCLIC NUCLEOTIDE-GATED CHANNEL BETA SUBUNI 74 5e-12 

096127 (095127) PREDICTED SECRETED PROTEIN 73 8e-12 

096229 (096229) HYPOTHETICAL 78.6 KD PROTEIN 73 le-11 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 109 to 282 of CT1048: this 
corresponds to nucleotides 164 907 to 165080 of the genomic reference sequence. 



FIGURE 3, Sheet 24 of 160 



wo 01/38351 



66/201 



PCT/USOO/28888 



CT1049 
Nucleotide 

Genomic coordinates: 
Start: 170115 

Stop: 170733 (SEQ ID NO; 44) 



Amino Acid 

MAGIGRRDNRPVLHLDIDPNKEIPYNVPPTPIICEKNPFVFNMQKCSDCAPFPPYPGTEK 

PFPPYPGTAVEEEEKQKEIEELLVDQSFPPPFPGNKLRDIPRTYPLEFPEKKEKDFPCVD 

TTGHSDIPFIDLEKTPPRSDVRHGYHYLINPNKVGELNHIVGKLTEKQENLNKLVLDVDD 

VVINLSSTLKELEKLRAGLCKFSKN 

(SEQ ID NO: 45) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



PGCA_BOVIN (P13608) AGGRECAN CORE PROTEIN PRECURSOR (CARTIL 39 0.020 

CAB40774 (CAB40774) EXTENSIN-LIKE PROTEIN 37 0,13 

AAD45972 (AAD45972) WISKOTT-ALDRICH SYNDROME PROTEIN INTERA 36 0.30 

Q22807 (Q22807) SIMILAR TO E. GRACILIS MAJOR MEMBRANE SKELE 35 0.39 

014686 (014686) ALR 34 0.67 

Q28226 (Q28226) MUCIN 34 0.87 
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CT1050 

Nucleotide 

Genomic coordinates: 

Start: 170831 

Stop: 171461 (SEQ ID NO: 46) 



Amino Acid 

MASSSSSPVALSSVASSVMMERDEENTLSLRNRNVNKPTPVSAAWVPVDEEDEDREEMRR 

LEDFSSDEEDDDNKSCHCDHSDDDDDDEEDPSCFKGFSAGLCSFVRGFFGFLRKSLTKKQ 

VFLLTSAAVAAIFKTRDVAKTEEGAATMEENSTDVITGGDGDSGIAADVVSLASEGEGEN 

GSLLESIATTLIKTTIENLVDGGEETTEL 

(SEQ ID NO: 47) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

077320 (077320) PFC0335C PROTEIN 45 4e-04 

CAQC__RAT (P51868) CALSEQUESTRIN , CARDIAC MUSCLE ISOFORM PRE 43 0.001 

O49209 (049209) PUTATIVE HISTONE DEACETYLASE 43 0.001 

Q9YPA9 (Q9YPA9) HYPOTHETICAL 45.2 KD PROTEIN 4 3 0.002 

Q9814 8 (Q98148) ORF73 HOMOLOG 43 0.002 

040947 (040947) ORF 73 43 0.002 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 637 of CT1050: this 
corresponds to nucleotides 170864 to 171500 of the genomic reference sequence. 
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CT1052 
Nucleotide 

Genomic coordinates : 
Start: 173177 

Stop: 175862 (SEQ ID NO: 48) 
Amino Acid 

MTRHGVLVPK GRSRHVILGN VDYTFCTTDN NCVSLDIDFK DNITDQNIQL 

LNKKLGKKTA KKIKKEDAPE TKENSDEDIY ATKEFEQTIK GLQTKKGATE 

ENAIAAAAAA ATAAAVEKAM LSESEGKSMV INRARMVLSK RDTSQKQFTA 

LKNRESFFSV LIFETGSVIV VGLQDPSLTK LCVIKATTDI ADILQKNISV 

ANVSIVNTVS TFNRFHLNFI RLGKFFERNC ISYSYNPETF PGMFFKLRVP 

AKPLLPGETI GEYYTKVAMM RDSKDPNFKM SDWLRIKTAL TFKVGKITVL 

GEGESGCGDV SVVSKLLFGL FHYFMDNNIK MSPKEAQRVR EKYGIPHLEW 

YLYIDMLLHS YPYVKPSAEQ VKRAMVDQQH ISEVDRTYYG TKNSMDAAMS 

ANLVPSKEES ISFIKKIRSQ QLFGHLCKPS KETTRRAIDT LSFDPINQDR 

WWNKNDQYYG KERCDPFSVA RLVSVSENTN SMMNSRISCQ GKWWLDENEY 

KDKLDHIVDL CTEEIVEECE SKGFIASPFL RKHQKEKIPT PYVLLARACN 

QKNGNKMSIN NNSNYLSGSS RAKRNAKLQE KHRVTLARLN TMMASYRFLN 

NYISTDIAPD FAKLFGNDVY SLLHLMTNLP KSRGHALTYN ERALSSNEST 

YKTPGNAYFS TLFEKSIINN QETANKGNNR KRKFSRIGQE KSSFLCNACG 

VNLNKGSDEI IKGICTSCDQ NSTSYIENAL SDINRDKKIK RFKAAATHPP 

VKQELVDSLS SSSSPSSSSS QTSNKNNRCT PSDFIDYVYK FTDETTGAPK 

VGLVFKMCDI LASLASRRGM EDRPTANYRT SLHSATQNKT NLNKLLVSAI 

KETGATETEA QIFNKIIGSE KGLSILCQLV ERRNKDNNVF D 
(SEQ ID NO: 49) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



gi 1 6647869 1 sp 1029874 |TF2D_ARCFU TATA-BOX BINDING PROTEIN (TATA-... 


44 


0. 


001 


gi|66^7868 |spl027664 ITF2D METTH TATA-BOX BINDING PROTEIN (TATA-.., 


43 


0. 


002 


gi 1 3122925 jsp I 012731 |TF2D_EMENI TRANSCRIPTION INITIATION FACTOR... 


43 


0. 


003 


gi 1 135643 1 spl P13393 1 TF2D_YEAST TRANSCRIPTION INITIATION FACTOR 


41 


0. 


010 


gi 1 135627 1 spl P28148 1 TF22_ARATH TRANSCRIPTION INITIATION FACTOR ... 


39 


0. 


038 


gi 1 135626 1 spl P28147 1 TF21_ARATH TRANSCRIPTION INITIATION FACTOR ... 


39 


0. 


050 


gi!3334374 1 spl043133 |TF2D_CANAL TRANSCRIPTION INITIATION FACTOR... 


38 


0. 


065 


gi 1 135639 1 spl P17871 1 TF2D_SCHP0 TRANSCRIPTION INITIATION FACTOR ... 


38 


0. 


11 


gi 1 2833518 1 spl Q57930 |TF2D_METJA TATA-BOX BINDING PROTEIN (TATA-... 


38 


0. 


11 


gi 1 417896 1 spl P32085 1 TF2D_CAEEL TRANSCRIPTION INITIATION FACTOR ... 


37 


0. 


15 


gi 14 17882 |sp|Q02879|TF22_WHEAT TRANSCRIPTION INITIATION FACTOR ... 


37 


0. 


19 


gi 1 121568 1 spl P22010 |GR78_KLULA 78 KD GLUCOSE-REGULATED PROTEIN ... 


36 


0. 


25 


gi 13122941 jsp IP93348 |TF2D_TOBAC TRANSCRIPTION INITIATION FACTOR.,. 


36 


0. 


25 


gi 1 3915894 1 sp J P52653 |TF2D_ENTHT TRANSCRIPTION INITIATION FACTOR,,. 


36 


0 . 


33 


gi 1 1729908 1 spl P50159 |TF22_MAIZE TRANSCRIPTION INITIATION FACTOR... 


36 


0. 


33 


gi|1351224 |sp|P48511 |TF2D_MESCR TRANSCRIPTION INITIATION FACTOR.., 


36 


0. 


33 


gi 11729907 1 spl P50158 |TF21_MAIZE TRANSCRIPTION INITIATION FACTOR... 


36 


0. 


33 


gi 1135640 I sp|P2 6357 |TF2D_S0LTU TRANSCRIPTION INITIATION FACTOR ... 


36 


0. 


43 


gi 11709903 tsp|P54 637 |PTP3_DICDI PROTEIN-TYROSINE PHOSPHATASE 3 ... 


36 


0. 


43 


gi 12833459] spl Q55031 ITF2D SULSH TATA-BOX BINDING PROTEIN (TATA-... 


36 


0. 


43 


gi 1 586175 1 spl P32623 1 UTR2_YEAST UTR2 PROTEIN (UNKNOWN TRANSCRIPT... 


35 


0. 


56 


gi 1 2833446 1 sp IQ52366 |TF2D_PYRKO TATA-BOX BINDING PROTEIN (TATA-... 


35 


0. 


56 


gi 1 3915739 j sp t P18428 1 LBP_HUMAN LIPOPOLYSACCHARIDE-BINDING PROTE . . . 


35 


0 . 


56 


gi 1 135634 1 spl P26354 1 TF2D_ACACA TRANSCRIPTION INITIATION FACTOR ... 


35 


0. 


74 


gil 135636 |sp|P20227 1TF2D DROME TRANSCRIPTION INITIATION FACTOR ... 


35 


0. 


74 


gi|4171021sp|P32103|Hl_EUPEU HISTONE Hi, MACRONUCLEAR 


35 


0 . 


74 


gi 1 2833477 1 spl 057050 ITF2D PYRFU TATA-BOX BINDING PROTEIN (TATA-... 


35 


0. 


74 


gil3041729|sp|Q03410 |SCP1_RAT SYNAPTONEMAL COMPLEX PROTEIN 1 (S... 


35 


0. 


74 
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gi|312294 6|sp|Q4 2808 |TF2D_SOYBN TRANSCRIPTION INITIATION FACTOR... 
gi|126745|sp|P14873|MAPB_MOUSE MICROTUBULE-ASSOCIATED PROTEIN 1... 
gi 1 4033393 I spl P78695 I GR78_NEUCR 78 KDA GLUCOSE-REGULATED PROTEI . . . 
gi I 126511 I sp I P12744 I LUXB_PHOPO ALKANAL MONOOXYGENASE BETA CHAIN... 



35 
35 
35 
34 



0.74 
0.74 
0.74 
0. 97 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=l268 

5'stop=1291 

3'start=1327 

3*3top=l350 

5 'primer=TTGGACATTTGTGTAAACCTTCAA( residues 1268 to 1291 of SEQ ID NO: 48) 
Tm5=57 .29 

3 ' primer=CCTGTCTTGGTTTATAGGATCGAA (residues 1327 to 1350 of SEQ ID NO: 48) 
38)Tm3-57.4 5 
primerScore=0 . 74 
allelel= 

probe 1-AACTACTCGACGTGCTAT (residues 1296 to 1313 of SEQ ID NO: 48) 

probelstart==1296 

probe 1 S top= 1313 

direction l^Reverse 

Tml=69.02 

scorel=l - 97 

length=83 
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CT1053 

Nucleotide 

Genomic coordinates : 

Start: 175839 

Stop: 177107 (SEQ ID NO: 50) 



Amino Acid 

MSSTDLSKNAFHDWVVSKTDCEVFDVHCETDRDCGAACENTYSVDGKEVTKFSCNQQSGR 
CARSVYSASSLERAANDLGHIIGIIKKNPKLEEELPESFLWFINHNGGDLFVNKRAAYYD 
TMHLS I GKLDN VDTLAQGLDKRMAS S LREHLLRKLDS I LLQI DKVKYEKAKKW I LDITQE 
AGTEEDNKEEEDAKKEDQSLSVSEIVDVLTGTHDPMPLRARGFIQKKIYPLSRNELRELA 
LKELFPEETTSPQVLSRQHDVSTREDLCNESMNAGRAESIFSDPDSGEYVATCACLYSEY 
LTGPACKHKTYRYVIDYDKWKRTGRPEFLTDPVLHFKKAEAVCKSTNPNLRAIYSPDNKG 
FLCAPVAELVKTALTFRGSHEPSLIVERDINQAENLPSNSFGVNWPYVNLLNRIQDQYT 
(SEQ ID NO: 51) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q26648 (Q26648) TEKTIN Bl 37 0.30 

RRPP_VSVSJ (P03520) RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7. 36 0.52 

RBB1_HUMAN (P29374) RETINOBLASTOMA BINDING PROTEIN 1 (RBBP- 36 0.68 

CENE_HUMAN (Q02224) CENTROMERIC PROTEIN E (CENP-E PROTEIN) 35 0.89 

RRPP_VSVIM (P04880) RNA POLYMERASE ALPHA SUBUNIT (EC 2.7.7. 35 1.2 

Q89487 (Q89487) PHOSPHOPROTEIN 35 1.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1109 of CT1053: this 
corresponds to nucleotides 175933 to 177041 of the genomic reference sequence. 



FIGURE 3, Sheet 29 of 160 



wo 01/38351 



71/201 



PCT/USOO/28888 



CT1054 
Nucleotide 

Genomic coordinates: 
Start: 177123 

Stop: 178524 (SEQ ID NO: 52) 
Amino Acid 

MSASLILDEYLKKTASAVLDVADSFEKIKGEIQSPEEAAALSVALYGAPPKPSASAVASII 

TGERTSLNDKYLSDNVLLKMSVARVGQENNRKRADQAADEIRTIMEDITGSLSGAYRQYSP 

LEEENKVHIGIMNNKTPSIVCGYYTMDTSISSEPLSLTDFQNPTVIANVTKRMESIFSKVD 

SARSTRFDAFVNGVANNMDIKSSIDWANMVENVIKLPDSTPNPCSVDTIVSRDASVVKTAV 

NDIYASVGKSYCRPATQLTFMSEIEKLRKAAVVCFEALMSDTRERAFVEFLFYVSFKEDAS 

NTNSKLFVQNKLSSMSGNPRQPIKLVRRSAEETLFGLCFMFKVMPPEFMNCIFNFPTIPHS 

TQYHGLYGTCLTPLLRKYGSSFEKSWAHFEEILSERANAVKKFGVNDTRIDCLDAVANLTG 

PVYVLILDLVRTLSAQRSCSTKFLREIKENYLLWNRFVSX 

(SEQ ID NO: 53) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

gill26194 |sp|P17279|LEU2_RHIRA 3-ISOPROPYLMALATE DEHYDRATASE (I... 33 1.4 

gi|6685598|sp|095613|KEND_HUMAN KENDRIN (KIAA0402) 33 1.4 

gi 12497227 I sp|Q04893 I YM96_YEAST HYPOTHETICAL 113.1 KD PROTEIN I... 33 1.4 

gill34393|sp|P17065|SEC2_YEAST PROTEIN TRANSPORT PROTEIN SEC2 32 2.4 

gi 1547908 I SP1Q024 55 |MLP1_YEAST MYOSIN-LIKE PROTEIN MLPl 32 2.4 

gi 12496893 |sp|Q09462 I YQ52_CAEEL HYPOTHETICAL 30.9 KD PROTEIN CI... 31 4.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 899 of CT1054 : this 
corresponds to nucleotides 177561 to 178459 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 •start=678 

5*stop=701 

3'start=754 

3 •stop=777 

5*primer=CACCTAACCCTTGTTCAGTTGACA (residues 678 to 7 01 of SEQ ID NO: 52) 
Tm5-59.08 

3*prxmer=CAATAAGATTTTCC7VACAGAAGCG (residues 754 to 777 of SEQ ID NO: 52) 
Tm3=58.08 

probe 1=TATTGTGTCCAGAGACGC (residues 703 to 720 of SEQ ID NO: 52) 

probe lstart=7 03 

probelStop=720 

direct ion l=Forward 

Tml=68.92 

scorel=l . 92 

length=100 
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CT1055 

Nucleotide 

Genomic coordinates : 

Start: 178S29 

Stop: 179348 (SEQ ID NO: 54) 



Amino Acid 

MAQTSKMGTNKRCFEEEVEEERQQPFTKKSKSEPPSFEDKSSSTSSKKKSKSNKHTKTKE 

EQLLEFVKDLERSDPTVPDEKVKQEVEEKSPEAIAEIFSMFGIAQDSKFKSLLPIERIKS 

ITTKIVIDAINQPVRKMLVDHLYHFKEMQNVVEKYKDDSDEKLSVILKSKKSPKEFDLSF 

SDYVDRLNRILVGVIKRVAGAIESKELLQSNSMIMNSVLGTVVSNIPYNMKINICVFLTN 

FICTFANDDLYTFFRDDEKFVMSQVTRYISKD 

(SEQ ID NO: 55) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

001761 (001761) C. ELEGANS UNC-89 (GB:U33058) (NI D : Gl 1 60355 45 8e-04 

Q17362 (Q17362) UNC-89 45 8e-04 

Q17595 (QI7595) SIMILARITY TO MYOSIN HEAVY CHAIN 43 0.003 

031329 (031329) ERPM 42 0.004 

FKB3_YEAST (P38911) FK506-BINDING NUCLEAR PROTEIN (PEPTIDYL 41 0.007 

AAD55361 (AAD55361) XNP-1 41 0.009 



Comments : 

EST confirmation of the predicted transcrapt: 

An isolated EST has equence identity to nucleotides 1 to 661 of CT1055: this 
corresponds to nucleotides 17 8612 to 17 9272 of the genomic reference sequence. 
Hit to public sequence gi I 685 6160 I gb I AF173992 . 1 to CT nucleotides 647 to 819 of 
nucleotides 910 to 738 of the public sequence with a 100% homology, a score of 
343 and an Evalue of 6e-97 . 



TaqMan Primer/Probe Sets: 
5* start=269 
5 • stop==289 
3* start=340 
3 • stop=360 

5'primer=CCCCTGAAGCTATTGCTGAAA (residues 269 to 289 of SEQ ID NO: 54) 
Tm5=58 . 07 

3'primer=GCTCTTTATGCGTTCAATGGG (residues 340 to 360 of SEQ ID NO: 54) 
Tm3=58 . 32 

probe 1=AAGTTCAAGAGCCTTCTT (residues 322 to 339 of SEQ ID NO: 54) 

probe Is tar t=322 

probelStop^339 

direct ion 1= Forward 

Tml=69. 00 

scorel=l .88 

length=92 
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CT1056 

Nucleotide 

Genomic coordinates: 

Start: 185432 

Stop: 186830 (SEQ ID NO: 56) 



Amino Acid 

MAVGDYLSMSS VGEATLVG FM I LNFI N FVT IliSLI I YAVTDVYRRCKRPSTNG YSGCTTN 

VVSSTLQEANLVTTEKDKPVQFVRGLVPRKMMEKYRSDLSPKNVGEYILPSEKETDKLKS 

DYKKGKECVGLLTALSNGHDSNKRIIGPRDLISRDDVKDKSYVFKRLSKDPLVYYSSATSK 

YVRKFSPFRAKKFMTSTQLGSKLVYPHPIRYGTAFVLPTGYVINKAYGMDNEDLHTWNPP 

SSSVLVPDSNNDRLTVECAKTDPTHRIGIYGFGGSDDNRRAKEEGYVEMLLCNCDNHKDL 

LKAPLITEYSTNPTEIQVDVAAKRVLFPAPGSEPVKSSQVTSAAHQLDGATGEHDISHEP 

VKLSDTGDYAVGSPIVFKPVYGTSLVNLPETGSPLALNCPCTDKADGIYQVNQKGGILYR 

DMVGYLNANPVEAASLSSSDSSSWLTTGNKISSVTCEGEKIKKIV 

(SEQ ID NO: 57) 



Top Blast Hits 

Sequences producing significant alignments: 

Q99175 (Q99175) HYPOTHETICAL 58.5 KD PROTEIN 
CAB49723 {CAB49723) HYPOTHETICAL 52.6 KD PROTEIN 
Q9Y6X0 (Q9Y6X0) SET-BINDING PROTEIN (SEB) 
Q9ZXH3 (Q9ZXH3) INT44 

055105 (Q55105) MULTIPLE LIGAND-BINDING PROTEIN 1 PRECURSOR 
Q9WXH8 (Q9WXH8) PYRUVATE ORTHOPHOSPHATE DIKINASE 



Score 
(bits) 

34 
34 
34 
33 

33 
33 



E 

Value 

2.2 
2.9 
2.9 
3.8 
5.0 
5.0 



Cornments : 



TaqMan Primer/Probe Sets: 

5'start=663 

5' stop'-684 

3'start=705 

3'stop=725 

5 'priiner=CGTGATCAACAAAGCATACGGA (residues 663 to 684 of SEQ ID NO: 56) 
Tm5=59.59 

3' primer =GAAGAGGGTGGGTTCC7U1lGTG (residues 705 to 725 of SEQ ID NO: 56) 
Tm3=59.41 

probe 1==GGATAATGAGGATCTACA (resxdues 68 7 to 704 of SEQ ID NO: 56) 

probe Is tart =68 7 

probelStop=7 04 

directionl=Reverse 

Tml=68 . 99 

scorel=l .88 

length=63 
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CT1057 

Nucleotide 

Genomic coordinates: 

Start: 190875 

Stop: 193236 (SEQ ID NO: 58) 
Amino Acid 

MEYMEEGD I AERRS EGVDY I LDENS AC VVN VKS I RNRLGAM DAEEAQ YAQDI S AQLVTH I 
IRLAHCSESNKIKDTIASIAGLFINNIFDNNSTKNKLKTYNQFKAESQNKSSVLNIFGSL 
DPLSMLSSFMGSDPAKSGGENLDKSLGVLFEVLQNYNPCKIDDIVLLEMCPSKCAACTGL 
KEAIRQEQPMEAMLLFFKCINHNRFNFGSDIKSAYASETCMRYSQDERAVVVPLRSILLG 
CLDRDDPAHTLSSFGDTIEYADSDNAWVSSLFAAVSRMPMVDRAVIAHFYVYTMLSRHRR 
VSGDSFKQFVYTVFVRMIYSAIEILFCDTENSSVECDGKHFLSYVNAMVNVSVLGSTFNV 
LKAYRSWVVDQASVAPVLDIISGGWKKNYPSPDHIKRVAYDISQVINHLASPSRMVKGNN 
KASNVTSGLDSIRSVRQAEKYIPFGILENKAGYGVINIAKHNISRPAREQSNGRNFNCNA 
LHILPSIKGCEALGAQKGSADQTVNVFDNFVASHMDIAMKKQGSGKILGLLTSMIDRQGL 
TTSFPSSEAEYKKRIHDFTRYVIFSSTPINDELVNSRCILPHSNVLNSPISLRNIDPESV 
PDTRFHFLLMMWQRPNIDEPNLSALTTSQLELLLSKNQKWDKLTTRAFFNIDRINFQMAD 
AIIKNVSGSGFLDGSKTASSSSSAPNFFQIFSGAECTAKQLQSIRKFIGESMQHVQKEWS 
SAVNNGNRGVENYDGLNAQFSEELFELLYKLIIEEDMRPSSLIASSEFLSNYVNAMDELL 
I RAN AS 

(SEQ ID NO: 59) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB52133 (CAB52133) RCOP C7 (FRAGMENT) 34 3.0 

IDHC_SOLTU (P50217) ISOCITRATE DEHYDROGENASE [NADP] (EC 1.1 34 5.1 

Q23615 (Q23615) ZK822.4 PROTEIN 34 5.1 

YSW1_CAEEL (Q10017) HYPOTHETICAL 63.8 KD PROTEIN T25D10.1 I 33 8 . 8 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 636 of CT1057: this 
corresponds to nucleotides 192639 to 193274 of the genomic reference sequence . 

TaqMan Primer/Probe Sets: 

5'start=1214 

5'stop=1236 

3'start-1281 

3'stop=1303 

5'primer^TCATCAATCATCTTGCATCACCT (residues 1214 to 1236 of SEQ ID NO: 58) 
Tm5=58. 13 

3'primer=CAGACCTGATACTATCCAGGCCA (residues 1281 to 1303 of SEQ ID NO: 58) 
Tm3=59. 10 

probe 1-AGGCTAGCAACGTTACAT (residues 1262 to 1279 of SEQ ID NO: 58) 

probe Is tar t=12 62 

probelStop=127 9 

direct ionl^Forward 

Tml=69. 01 

scorel^l - 98 

length=90 
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CT1058 

Nucleotide 

Genomic coordinates: 

Start: 209615 

Stop: 227849 {SEQ ID NO: 60) 
Amino Acid 

MDQYPEVRDTPQTEQEQAAAQQQOAATTTAAAAAAAAPTQYSNTVSAETLSAISEDGKLE 
RSIAASCWINNLNPDEKMAQRVQFHPLSSTTTYDSENVNPGSSVVFLKPRALPTGGTCLA 
PNYIAVPTLRAASEIIDSIASTSLYQCSMFNSWNLIPIFMSNSKHSQFGDRVIKRSMIRN 
CFSKQKNVENLLKELRRRKVNAAKAFSHAVQQKSAVNTALAAWNAGSAANLEKLVDFCKL 
KYSPDRKYKAGGLFSASATAQSQSGTSSSSVEHTSNDFLLDILKRHKGTSLDLDSATNTF 
DTALSRVFTEFKEQARAAVDAAADSDHLSASDPIFSIVRHNSRREGILDSVPNIGMLAPR 
SKYSVAEYLMADRDESADIAAKIGTKIATDFEALRGDNNKRRADTSVDDLKESLADSIEK 
TSIKNTGDINSVTNIPTDTEEYEFSLHITQLFAQAFLETMGSLLSCAFGVQFPFSDEGFA 
AIERIIRKTDPDTGKVSEMDPSSLSDQYLLLVGNFQVSPFHVSDPKDIVFGRQVTPNTPI 
LSIITRSKNDKNETSTIINFRDRLLVNDTVLRDATQNVSTSTPSQRRVPTAAGEPKKPML 
SGCLPIIRGPQVVTRESDDMISGLVGDWYISLGVYYAMGSSAAAIAAGHQRALASAESIN 
SPMMKKFSKKGGKYTEEEKRIKKT^RRNADRSARILALLGQTDAQYGYVEHNSTLDSFWS 
SNAAIRAKAKEDALSRAEILAVRKQLDGKCSSSRDEYSMVERYLRDSFFRSVNRSGGGYE 
MFDQGFDMGRFADFLSDNSAARNAWQQYAEVMRGLSKHEKRVFNIEGLFSALNSFKFPIiV 
PEQGRKKTVGGRHRLNNLKAANKIINGITEMTLQSAIDGTGISDIIGSVSDGWGNTTAQP 
SRVKALKTLSNFSGNGNWSIPVSRAVKCAAGSRGGETLKCVDIPSVIIANLISDKRILD 
QLCGGGMNLAHEITNFIETIAGKEHTGKESVFLSPRLSVILLRYIWFNAAWSLTDSNIK 
MPLNTMSEGTGDDTYRDYLAIRGMVNNYNSSLSSISVKAISDRYNCGSGNTSTSNKNVTI 
KTQGELLTVLQQTANALSAFTNKGGVGATPDAANMANVISPIANADVVKNTNVVVSGLDR 
ITETINFFSFLSQIKTMNENIEEYLRRYRLGEGLDKKELDNFVYPNIAAIVKRELGVSGS 
ALSSNLDTDRPITIDLNTEQPLIVKASKGYASNRYAKLFNKTTRTAAEQAQMEQYNAQMA 
ANTIPQLVNRLTIPGSITADTAINVVKAFTENGEFSNAETHLGVMGNAINEMQPLFTDGF 
NVANKRLTVNVGSVSKLIQNGLTVSLILAHSKASPYVFKPLVQDFAKLLLAVTAETSLVV 
SRSQKSFFPIPPSVFSSGGLFKIDREMFDNMKTDYVVEVIRQLSKNATAAIERCNDSDSA 
ARIAKSGEIYNKDVASTTAAPGTSSSALTLFANNLONPAKVWSMGALPHFDMAVVPKLHG 
ISHDQMFRLSTYYQGIHKMELNSDCKPEEWDNSLPGNRASKFFGLSSVSDNNRSFNLALD 
TLLASPAEICDLVTREMVKTSNDIVHNIGSNSNTDALQKSLQVGASAVEKYDESTLSTKE 
TDVYSLVSALAKSKSPLSSSSSLSSEGHLTSKEIDRTWNTPALLGTAKTTSYSVSEDALN 
APLSAVLDFRRNVVDATKSLYEVAAVCSVMSKEEDVRSSSRKIMGMVEQESPVMQDIGID 
RIASLVSTVATPKQHRRFLQTVNDYKNYLIRICVASNPLLSSRLGGISPTSGNTDYNLKAV 
YDGVVSSSSSMTPSSMSVSDRFWSGVFSQCLETGPSMFADAGHGGSNMFQITAPKLYGSR 
VNTYAALSSGVERLRDSISSATQERKNRIAKSIEALETFVTDVVGGDTLDQLRKAQNMYN 
KLSDITSNSIYSDFGNIDCAKIMKNVTSKKMTARQQSDTILSSLLHELAGLVHKQQPQLA 
TQFALASHVIKAKYVTNDLNNIHEKETFSQLMAVAGVADYYNVSAAAMCQRLVASDVTMF 
LGGTMLQQGLFVSFLLNNVLFSQVSDNIKMNELNDETKSLLVKLVGFCGTVSDALGSRHV 
SSIRRVQNEEDKKLDRSFVTSLYSAYRDLRKKTELYRETDTINKLFGHQNFMSYESSMLK 
RTSLVHDAVSGPRPRRYSTLEDVLEAPSTVHKSFMVSYPERAAASRRVKRAGLRALADNR 
MESLYGEEVLNDMRSSAVSSEMMDIEYGEGGFMMMISDDEDDIAFIDSEEESESSTDFSS 
SDEYSDSSDEYDFDDDNNGQSPYSTTSYSYDALDRLNSAAKPLTAIYGCRGEGEDDEEND 
LYEEEQERRRRRSSKMGKILRDLHESDDDDDDYFDDEFDGERSMSETIATRRAGRIQYGP 
GFLSHSNILNRPAKARAFLTRGKKFRPSAYDRFFMEDDDSLLFSDESTTSSSSSDSPFSS 
FSKGRKCKRRTSEDQCAFVKRVVRAFVPTRVTMINGRVSMITPVTSENTVGFYENYQKAN 
KRERARLIEEYKIVKGASATLPDEYVEGRASKQVSPRELRRSLIKAAAYVARTQESNLNI 
IFDALTTTSNATLVNDPSTLLGDTLLFAKQLEAITERRNRLMKDLTEISPSLFTSFGDAS 
KDTQMMADAKQIVSGGNFKSAGYLGVPLRTLASCIKGTNTVDRLLATKNKNHLEWMTTAA 
IVFARSFNDTTFHALEDTLKMTSALTDMYSAFTNLVGSEHSORLKVKSTLLDSIFNTRMA 
HTEAVMGLVYPTAFINHEMPS DYTQRREMQSLALN I LRGVNCSQLPRKDIGDTAGLLTFI 
TSRKFAGYGGERGGLSLYRMSIVDALSCPSDNRLKGAVSLEVGKWQDMGEEIFYKRSNDL 
VDFCSKNNISLENAVGPIARFVPNGTNMADIGMTDIISRTVKDDASMIRLRRAEEGAGAA 
GKFITASAMGNLYGGIDTVVNLTEKLYDSFVLLQDSDSFNTPTEMATAIINRMKSRKHKA 
LKTPFGGDIATYKNFPSSSEAIVVRAKEMRNSI5TIVMDISKSRGINSFSSRSGSTLAKI 
STSEFERILETSAVLSNTKANLRTIENRLAEHYNKLKQFSHISNDGLSETRAVVAVIAES 
LTPVYADDTSERGASVSELLTDNTLLKFIVQNELKNIEEAKRHVT7\AIEGSSQLHEKMLS 
LLVASADINRMSAQNNLECKKLTEGNSNFVPMTNDQGGTFIKHKETGIWLKTDEENNTSS 
IKDNDQRRVAKTILAIVEDNRNATIRSRLQSLCFGKYAMNDIFALDDADIKNMDKLIEKL 
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GEALAEKASPSSSAISSSSSSNTTSSSSSPSSSPSSSSSSFSMDYSNNLAKTIPYMPIVF 

QNKQSNVNSSDASSSSPSSSSSSSANIDNVEHKKVALQQLQTQESNDLSNVLSVTTKHRF 

ASHNQAATVGIFNGRQHAETVVAIPNANKANNNATVSAGQGILTRFSAPENVSSTSMQLP 

PSSSSSSNGDDNKVPVTVRLNQYANSILSSIENASEFKDLKEAERKIDLAIQAASTTETK 

EMVTVSKCPSANQTAITAISQAKSLKKSALELLERVIKAVEVYTPDSSIAAVSLPVNGDS 

tyiVSSSSGSGSAPSSSSSSSSSSSSSNVTDYFNYAYGKLKNIDENTEEGAETVQKNMVEQD 

AAVRIPLLVSYAPFSEMMRRAIDKLNEYYQLIDAIKTKIVSDTKQASSWAIKETDKELDM 

DKEQVISKINNLQQNFSNESDKIKMAISVLDWKRNELELQNNKTRSFIETTKSRIEAGGG 

DVANFKEIIDYENTSENDNNLFQSLKAFAADNSGTVYTPTDMSNGRDTKSDSKFVDMYNK 

QILEGGIKLINEGQNTVKVDFSKALEAFPRQSNGASEPVSSSVVERRQRERLQAVEMFMA 

IMMERTESLRKRLADSAAQWNTVNNVEETVNSGMVNIKSERLTEIRNQAQIAESTALNSI 

NDEIVESPLTLSLGARVDQLLIKVDRVGSIQQQQQQQQQQQQLPKLTATEQRKEQQYAAD 

RVVYDPSYTCFLQPLHETIKRISSVYNSKNKGPLSNTRGVPTSDADLQLMTITDLSRSVL 

DSSSTSSKKMLYENVPSSIVPGLCQQCAMMITNVHEATHTSPHSFNFENKRSLKQLTEML 

NAATSSSDGPAVRHDVLTMLESNNGYVKDFGFTHRQKVACITPVNTLLGGTFSGNVAPNT 

VILPTSELFNCPGVENDKFRSMVNRTTDKNVADAPKSSASIVETLARTSPNAEHLYFPFK 

DQRRHFNSITDAIISGMSGESSSQLNTTCDQNLVNIDQTTGFPVFTGRKQGERRIVHTEN 

TMEGARKDKNSGIPSCTKDRQTYIDMGTKFMVAPGSLLNANKEETLRLNRLSDINNVRHY 

GTDVHVAGANSAWRIGEVVRAASSFPDGDKESAMKKMLLLGSVSAISAQKSASHINDPTA 

LLSTNTSIQNLVKEAFPDPVCSSNYLGSAESTFATQLAYRQRLFPNGDDENVTTVSNICP 

MDLMGSTKR YN DAFNN I FGSKMTSTNKKGSNCENLLKS AMSN V PAI NTAFGAFEEAS SS V 

RNRLSPLYEDSTKYSSNQLAVQAMTDTAVDALSAVSTVVGRQNGRNTLLSLPTSITSIAT 

SGRPSLSYSSDMKSNLIKTISRINRDASLLSMGDSQVAAGSSFFNSFLRSSSIPVTTSQD 

GNVAAAEIVLGTILDKTVEINKRFEMLGGGKMVAGSPEARAIORNTMSSILQMNENELAR 

DLCEIENKIETRQLRDAFQDLKRSMLMTPGGVGAISSGASTNNVPLSLLMSRVDASSGLL 

MNNNSANVMEAVDSFNTTPLLVRHMMLDSGKSPVPMAKEIRSMLTQPRALTARALLSESS 

PLLTEICLYNTRDTQPERAVDRLLTSAYLVKQAKRFDGVDPAFPAALTCASHLMLSSMDS 

HTKSSFMDNIKLHMTDTQCFFKNIERFEKFLGRYGDEYAMSHKQNCNCPFHLHHTFTPSD 

NEHLVSSFAFARPEVSMEEIRATPYQANKLISDKHYVMNMSKIDSRVTGSSLLKKVSEWT 

EMRMNSNFNGTFEPSRLALSNSGMTTAGVNLDVIVKPNNARSVLGILECHRQHVCTADAK 

GTVASAMPAVFQATDGNGNESELIQNALPRNRYIQKSTMNAQTVVFANVLEQLIADLGKV 

IVNELAGTIAESVPESVYENTKEMIDRLGSDDLFKSNNNGGVESMDYEDSETTSNNGPVL 

ISEAMKNAVYHTLISGKAARPENVPFASCASGPLAFDFLLSKGDTFEEKNAEOGAAAAVS 

STYSSSSNTTLRKHLARVFEAISKQVTDAEFKDILNDIERNISSDYTNCPPNTNQNAFAL 

AIKREFSRIVSFLTILRKNITPALVDPKGALHEKVAIYLTLLSTKSKLENFFQYGLSNSS 

SVDLSHLKPINCSNNVKNIEDTFMYRNVHPILIMALPENFTALLQQEQMDPDTAIESRRS 

LTTFLNHPNTASMANGARAAVGAGGGNPMGLYLSSHILHESTVTTSNPVTDTTENVNYHS 

SVTODPVMVVNPFKDSARLIVNNNNTGIDVLNDKSCNYLQVSMPSESSGLVTNTGCSSSS 

SSSSSDTFKYVRRDNTPVNIiPRVTPAVLCSDASSNLLDVFSRADIVLENMNVRFGFMPEI 

lAAVSKFKGLTKEEVIKQMVSQNNINNNSNNNNGNGKKTTVDPVTGDIVITNATFPDTRP 

LYTAANGGTSSFKWGDINDRKMHAKAFPTFFIGNPTAAATANGVPLTSEGISLTEEKRKK 

I AG I SEGS I GTGALRAAANTRLSSDMEP VMKGWNN I VQLQQT FKKAS DKL.TH LLRSGGI P 

PRSQETNAIINKMHDSFKTLEECRRVIQDEAALLVATSDLLTGGYGGDAALAMVSPVRPE 

MTGLIGAISAPVRGISHLLKLGGVSAANAAIRKRLNLPTSNGKTLPEHGIVHKSAKTLLL 

DSDSISNLYNTDLQDVVSNARDNNNLGRIMQSLGLKGNNAGDLVYSARQLTDLITVPEYG 

NNRDLTKRQAILKMLISNPEILENVADTIYLTTGKNALAPVSAQEMACASLTVGGSGGGK 

LSSDDNVQSLNRLYFRV 

(SEO ID NO: 61) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



SR4 0_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 80 5e-13 

094317 (094317) SERINE-RICH PROTEIN 80 5e-13 

095815 {095815) DENTIN PHOSPHORYN (FRAGMENT) 80 6e-13 

YM96_YEAST (Q04893) HYPOTHETICAL 113.1 KD PROTEIN IN PRE5-F 76 9e-12 

AGA1_YEAST (P3232 3) A-AGGLUTININ ATTACHMENT SUBUNIT PRECURS 73 8e-ll 

Q9Y076 (Q9Y076) PROTEOPHOSPHOGLYCAN PRECURSOR (FRAGMENT) 71 3e-10 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 704 of CT1058: this 
corresponds to nucleotides 2272 4 0 to 22794 3 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5*start=8829 

5 ' stop==8848 

3*start=8894 

3*stop=8917 

5'primer=CATTACAGCCTCAGCCATGG (residues 8829 to 8848 of SEQ ID NO: 60) 
Tm5-57.7 8 

3'primer=GCAGAACGAACGAGTCGTATAGTT (residues 8894 to 8917 of SEQ ID NO: 60) 
Tm3=57 . 7 6 

probe 1=GAGGTATTGATACCGTTG (residues 8861 to 8878 of SEQ ID NO: 60) 

probelstart=8861 

probelStop=887 8 

direct ion l=Reverse 

Tml=69.00 

scorel=l . 99 

length=89 
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CT1059 

Nucleotide 

Genomic coordinates : 

Start: 228374 

Stop: 230564 (SEQ ID NO: 62) 



Amino Acid 

MDKVCVISNTRERTFKVPADLLCVATEPEISTKEEDAGIEIETRVVVFSRCVSVQELHTI 
NPNDEGFSVQLFKDYLKLOSAQGKKPIGLYIQIKAGEDLERRLISGGTAYLDPATHLFYL 
DFSLYPNYS I FNDISSRLKI I DEDTYNGVVFSNSEEKEKDALVLIRVTFSTHEKAIEAAI 
KKIMLRKVFFKDGDLDFGYLRIPKSKLDKFTPYFRSQYGIVNVEKNIPGYIWGEIMKQRV 
RCSRWYLYNTDSEWEYKNVAEERVGPRQLVKKYGAKCENLCFRDIDLRKKEAKEKRDIER 
ETESRYVVVTLTHKHEMPENMPYFGPKCSVVRLDETRILLCFVDEISYNDEDVDEILSEN 
RSLRNVSIRHKENVPVHTLLKKGVSIHARFTLNGLDDALIILKRIPKTYFEDEELQAACA 
HVNLEQYEWLCSNNRGNKVEHVKSRVVTRAVKRRRKCRHWIYFDKDTLNLNYKYFDKKVT 
ASMASKICNAKHDCLVFHRKMELEDLTESAYFKVEPSPINFAKLKSCPDVKYVQKKTDGT 
FSVIRFFRNMTKGDLIQRMDLFCRFIPDSHTITLLSRADFYACKRGESMHMCTNKHRILH 
YKFSNAPHAAIEQITNIISDTRGRKGIHIEYAIENVQEMYEEDGRRYEAKYTGTLTEYKR 
NEDKTFKSLLAPHLTPVNKPYNINHLYEQYGNFDEELEDKLRSGFISYDTYVTAKDNWGR 
CATGKGACI 
(SEQ ID NO: 63) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Y061_CAEEL {P34 600) HYPOTHETICAL 84,7 KD PROTEIN ZK1098.1 I 36 0.95 

Q9ZAJ8 (Q9ZAJ8) BONT PROTEIN 36 1.2 

Q01794 (Q01794) MAJOR SURFACE ANTIGEN MSGl (FRAGMENT) 36 1.2 

Q9X7 08 (Q9X708) BOTULINUM NEUROTOXIN TYPE B (FRAGMENT) 35 1.6 

GLND_HAEIN (P43919) [ PROTEIN-PII ] URIDYLYLTRANSFERASE (EC 2 35 2.1 

BXB_CLOBO (P1084 4) BOTULINUM NEUROTOXIN TYPE B PRECURSOR (E 35 2 . 1 



FIGURE 3, Sheet 37 of 160 



wo 01/38351 



79/201 



PCT/USOO/28888 



CT1060 

Nucleotide 

Genomic coordinates : 

Start: 230616 

Stop: 231582 (SEQ ID NO: 64) 



Amino Acid 

MCTLKTYKMTTSTEISKNLSDVLSIKATGDWCSNIKTVFSPFTEGKGNLPNSLPFTRSPN 

TTCGSREAANATEHFITVFAKDKYERKRVKRTIGFTLDNTKELTPNRYLVADVYSWQEEK 

MVFEGFCVPPGKSGTFVRYSNEDKSFLLADTGRYMKKKYDDPENKTSSGGDDDDDDDDDD 

DDNNNVDVYEENDPRNVFEVEKDEKYACTFSILVYRAMKKSPPVCRGLLVETDGPSSHPK 

RAPSAFNPFGGSSMLNGYGAGADALEEEDEVDGVPERERITNFALKRGPATGQNFVSVKL 

EHDGSKADLYNVTCFSKQRGV 

(SEQ ID NO: 65) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

097300 (097300) PFC1035W PROTEIN 51 ie-05 

KEX1_YEAST (P09620) CARBOXYPEPTI DASE KEXl PRECURSOR (EC 3.4 47 2e-04 

Q83970 (Q83970) (CPV) 46 3e-04 

SIS2_YEAST (P36024) SIS2 PROTEIN ( HALOTOLERANCE PROTEIN HAL 46 3e-04 

YBOO_YEAST (P38114) PUTATIVE 126.9 KD TRANSCRIPTIONAL REGUL 45 6e-04 

077384 (077384) PFC0760C PROTEIN 45 8e-04 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=477 

5 • stop=500 

3'start=555 

3'stop=57 8 

5 'primer-CGATGATCCAGAAAATAAGACCAG (residues 477 to 500 of SEQ ID NO: 64) 
Tin5=57.85 

3'primer=TCGTTTTCTTCATACACGTCAACA (residues 555 to 57 8 of SEQ ID NO: 64) 
Tm3-58 .34 

probel=ATGACGATGACGACGATG (residues 515 to 532 of SEQ ID NO: 64) 

probelstart=515 

probe IS top=532 

direct! on l=Forward 

Tml=69.04 

scorel=l . 95 

length==102 
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CT1061 

Nucleotide 

Genomic coordinates: 

Start: 231602 

Stop: 232799(SEQ ID NO: 66) 
Amino Acid 

MQLILSHHLTMAGRVELVTGPMFAGKSTYLKNIYQQENGGNKHCLFVKHSLETRYGCGTG 
TIVTHAGEVIEGCTTVSSIKELISVLPEVVDVILIDEGQFFTDLVLVNRLADKGKRIVIA 
ALDGTSDQQMFSPIHKLLPYTNSIVKLASKCMICKIDTKEAPFTVRFGNDNDNNVICVGG 
AEMYAAACRDCYKKINKKKNKGKLVVLEGGDRCGKSTQAKLLLTNKNSPLYGGEYMCFPD 
RSSHTGKLINDYLTKKIELDDHAAHLLFSANRWEVCSKIKQLLDDGIHVVMDRYYYSGIV 
FSLARGVDTVEWCSASDEGLPQPDLVLLMLLDVEKCSNRDTFGVERFETNSIQERARALF 
LDLANKDEKNVWIKVDARGTIEEVQTKIINIVYNIVEE 
(SEQ ID NO: 68) 

Top Blast Hits 



Sequences producing significant alignments: 



Score E 
(bits) Value 



074528 (074528) THYMI DYLATE KINASE 



KTHY_HUMAN (P23919) THYMIDYLATE KINASE (EC 2.7.4.9) ( DTMP K 

KTHY^MOUSE {P97930) THYMIDYLATE KINASE (EC 2.7.4.9) (DTMP K 

KTHY_CAEEL (Q22018) PROBABLE THYMIDYLATE KINASE (EC 2.7.4.9 

KTHY_YEAST (P00572) THYMIDYLATE KINASE (EC 2.7.4.9) (DTMP K 

KTHY_SCHPO (P36590) THYMIDYLATE KINASE (EC 2.7,4.9) (DTMP K 



165 
159 
157 
148 
143 
141 



6e-40 
3e-38 
le-37 
8e-35 
2e-33 
6e-33 
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CT1062 

Nucleotide 

Genomic coordinates: 

Start: 232848 

Stop: 233334 (SEQ ID NO: 68) 



Amino Acid 

MLPRKTLPDTENGYFVLDESLLEKVYYDNNNELIVRVGGIYMQICKSKYIFHHDDPERFF 
YSVLEDYHPIKEIVERLAEEDGVFLGPWEFLSRKQVNLQHGCYKALLSLPEDKYCNLLLP 
QQMKTNLEKMEEIQRTRLIHSRTYNTPQIELSDQLDGCVIC 
(SEQ ID NO: 69) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Val 

O60678 (O60678) PROTEIN ARGININE N-METHYLTRANSFERASE 3 ( FRA 31 5.4 

P95966 (P95966) ORF C04027 30 9.3 

Q9ZW94 (09ZW94) F5A8 . 4 PROTEIN 30 9.3 

058387 (058387) 358AA LONG HYPOTHETICAL PROTEIN 30 9.3 

TaqMan Primer/Probe Sets: 

5*start=223 

5'stop=245 

3'start=309 

3 ' stop=330 

5' primer =GAACGACTAGCAGAAGAGGATGG (residues 223 to 245 of SEQ ID NO: 68) 
Tm5-58.17 

3 ■priiner=TGGCAATGACAAAAGAGCTTTG (residues 309 to 330 of SEQ ID NO: 68) 
Tm3-58.99 

probe 1-AAGTGAACCTCCAACACG (residues 284 to 301 of SEQ ID NO: 68) 

probelstart=284 

probelStop=301 

direction 1 ^Forward 

Tml=68. 99 

scorel=l . 96 

length=108 
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CT1063 

Nucleotide 

Genomic coordinates : 

Start: 236678 

Stop: 238604 {SEQ ID NO:70) 



Amino Acid 

MVASTPCPGPGPVPTQELLSTNFLEAHKLVVELLLPSYSSDVVYCDSETYTKPIPIFGNK 

SIVSTIGDYVLSNPNEDVSYQMVSSVLEKFPLLFHCTYKrNEEDKGIPLWKKLYNKRKFK 

LLNSLLVHNNKNWTPVPAIPFDRENICDASGRSVLMSEIMSTSTFQTICKNNTHYLFDML 

NMERGKQGGSFLHFFASRKNSFTNFENEEMDSHVLSNIAKFICNEKEKLDSFIPANGKIP 

CPDKTNDEGYIPLEIAIMEDNYPALLYLVCRYGASWANTYGDHNESLKAFAIRNDAKDCL 

EIIEFISDHYSFNKNVTKEEFVKEKTVECVGCLYDIEDEKRCYKLPCGHFMHTFCLSNKC 

SKAN FRCVKC FQTFDDTI FRKCPPT I QWKMG I NQTTNHKEMDLFNRAFDTYLDFICS YN V 

KLDKKSKPKHKPENKKVEEELAKRTAEIEEAIKKKEEELAKRTAEIEEAIKKKEEELAKR 

TAEIEEAMKKKEEEELSKYNKIIEKGKRRLNEECVKLRDISTAAINMYKEKVRINGVLLK 

DSDQELAEAKERLRKILLLEEETKLDRFLFRPKRVEERIFLTKDDETLAFKLALEKKTED 

1 1 AKKNNQKG SERRDGEYTITSHIEKLPQS T AL AS VC VLNE 

(SEQ ID NO: 72) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Valu« 

Q26938 (Q26938) KINETOPLAST-ASSOCIATED PROTEIN (KAP) 74 3e-12 

Q9ZIU2 (Q9ZIU2) VIRULENT STRAIN ASSOCIATED LIPOPROTEIN 73 5e-12 

050870 (050870) HYPOTHETICAL 54.3 KD PROTEIN 73 5e-12 

Q9ZU69 (Q92U69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 68 2e-10 

023230 (023230) TRICHOHYALIN LIKE PROTEIN 62 le-08 

MNN4_YEAST (P36044) MNN4 PROTEIN 62 le-08 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 362 to 1 of CT1063: this 
corresponds to nucleotides 238163 to 238524 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=775 

5*stop=797 

3'start=850 

3'stop=872 

5 *primer=GAAGAC7VATTACCCTGCATTGCT (residues 775 to 7 97 of SEQ ID NO: 7 0) 
Tm5=58.33 

3'prxmer=GCAAACGCTTTGAGAGATTCATT (residues 850 to 872 of SEQ ID NO: 70) 
Tm3=58 .56 

probe 1=TAGGTATGGAGCATCTTG (residues 810 to 827 of SEQ ID NO: 70) 

probelstart=810 

probelStop^827 

direction l=Forward 

Tml=68.95 

scorel=l . 95 

length=98 
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CT1064 

Nucleotide 

Genomic coordinates : 

Start: 238658 

Stop: 239438 (SEQ ID NO: 72) 



Amino Acid 

MSTCSNLLSVFGGGDWTTTFPFDLVHTRQECDKKREQDYSFFITETCKGENIGIHSYEHT 
SKIIDTGNNDSTSIEELEVLNIYKAINHLENILKLNKGEKIILMDVETMILETHKILMKG 
ILPKGPCNGSFSTCVRFAVNKNNERHYYPVFETEKEAFNSIQNLVDYYNEIVAHTNDQIKI 
IKACAYFMYNFLTLHPFNDGNGRTARLLYSFLLKGNGIVPHFSPITHPRDQFVDTLVYFR 
EHGDGRPLLYVLLES IKNK 
(SEQ ID NO: 73) 

Top Blast Hits 

Sequences producing significant alignments: 
Q23544 (Q23544) 2K593.8 PROTEIN 

Q9ZHQ9 (Q9ZHQ9) HYPOTHETICAL 26.1 KD PROTEIN (FRAGMENT) 

068899 (068899) HYPOTHETICAL 29.5 KD PROTEIN 

Q48249 (Q48249) PLASMID PHPM180, COMPLETE SEQUENCE 

075406 (075406) HUNTINGTIN INTERACTING PROTEIN HYPE { FRAGME 

AAC96089 (AAC96089) HYPOTHETICAL 23.9 KD PROTEIN (FRAGMENT) 



Score 
(bits) 

50 
44 
43 
43 
42 
42 



E 

Value 

2e-05 
0-001 
0.002 
0.002 
O.O04 
0.004 



Coituments : 

EST confirmation of the predicted transcript: 

Nucleotides 1 to 677 of CT1064 : this corresponds to nucleotides 238717 to 239393 
of the genomic reference sequence - 



TaqMan Primer/Probe Sets: 

5'start=397 

5*3top=419 

3'start=450 

3 'stop=472 

5 'primer^^TGCGTACGCTTTGCTGTAAATAA (residues 397 to 419 of SEQ ID NO: 72) 
Tm5=58. 67 

3'primer=TGAACGCTTCTTTCTCTGTTTCA (residues 450 to 472 of SEQ ID NO: 72) 
Tm3=57. 92 

probe 1=AATGAACGGCATTACTAC (residues 424 to 441 of SEQ ID NO: 72) 

probelstart=424 

probelStop=441 

direct ionl=Forward 

Tml=68.96 

scorel=l . 88 

length=76 
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CT1065 

Nucleotide 

Genomic coordinates: 

Start: 240712 

Stop: 241192 (SEQ ID NO: 74) 
Amino Acid 

M E DLKST I ERV YEERVENLEQWTNTVEEEERT VS AI DS VLEEQKRAL DAWEAA I KEREN D 
LAVKEGISALVFNAADAKTRKELINTWIAERETSEKRRKEATSTNNQLKNQMSSLVNTTK 
TLKEK YNKYYRRS AI LNMQY I NNKRDYEASQFWV YTNNA 
{SEQ ID NO: 75) 



Top Blast Hits 





Score 




E 


Sequences producing significant alignments: 


(bits) 


Value 


RADI^HUMAN tP35241) RADIXIN 


43 


0 


.001 


RADI_PIG (P26044) RADIXIN (MOESIN B) 


43 


0 


. 001 


040947 (040947) QRF 73 


42 


0 


.002 


SSP5_STRGN CP16952) AGGLUTININ RECEPTOR PRECURSOR 


41 


0 


.005 


Q9Y489 (Q9Y489) CENTRIOLE ASSOCIATED PROTEIN CEPllO 


41 


0 


.005 


Q21952 (Q21952) SIMILAR TO MYOSIN HEAVY CHAIN 


41 


0 


.005 



Comments : 



TaqMan Primer/Probe Sets: 

5*start=228 

5'stop=24 9 

3'start=297 

3*stop=320 

5'primer=CGCCAAAACACGTAAAGAATTG (residues 228 to 249 of SEQ ID NO:74) 
Tm5^58.34 

3'primer-TGATTATTGGTAGAGGTTGCTTCC (residues 297 to 320 of SEQ ID NO:74) 
Tm3=57.45 

probe 1=AATACGTGGATAGCCGAA (residues 253 to 270 of SEQ ID NO: 74) 

probe 1 s tar t=2 53 

probelStop=270 

directionl -Reverse 

Tml=68.84 

scorel-1 . 84 

iength=93 
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CT1066 

Nucleotide 

Genomic coordinates: 

Start: 241184 

Stop: 241505 (SEQ ID NO: 76) 



Amino Acid 

MHKFSNKFYFIIKGVLIIIFVPDVVFSIFLLPPLGVRHKNGGGGNEEQKSGPSQKHHIPG 
PVLIFVLIIVIVGSVVIIIGVLISVRIAVLLWSHPYIHDGQDEDTN 
(SEQ ID NO: 77) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q19507 (Q19507) F17A2.11 PROTEIN 33 0.78 

Q37367 (Q37367) NADH DEHYDROGENASE SUBUNIT 1 (FRAGMENT) 29 8,9 

PSC_DROME (P35820) POSTERIOR SEX COMBS PROTEIN 29 8.9 

YG35_YEAST (P53273) HYPOTHETICAL 117.0 KD PROTEIN IN ASN2-P 29 8.9 

AAF05147 {AAF05147) ORF33 29 8.9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 452 to 1 of CT1066: this 
corresponds to nucleotides 241246 to 241697 of the genomic reference sequence. 
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CT1067 

Nucleotide 

Genomic coordinates: 

Start: 241774 

Stop: 243409(SEQ ID NO: 78) 
Amino Acid 

MFRQFCSLYLLQRRVNDNLRSTASASAAASLKGDGTEFITGEPPSHKMRGPSYSVLGPDP 
CEDPERVYV DI VVS I LQTNN I QVTKEWELFS DKLRKLGPW I DRSGIENNGEGEEDGDENE 
DGGGNGGRIEDREAHRRKMMKKLSFVGREDPVAVDLPTWRENSTEFARRLTLKELCDLIV 
ECGCIKSKEELFDFIFEEPWEIKEAADVRGMANRSKFTKESLIDWFFEFDTYSKCVVFFE 
AVNWYLKSQASPISLVLDDIYCCVFSYIRRQTFLTRAKNPSLTVASSFSPTPDTKLLAID 
ECVQHFLKSDINISQMALTERDCFFPLLTEMPRQQKKVNTFLDTMKRPTLSLLPSTSSSS 
SSNNKRKRNTAAANILLPVYRSNFSTASNNKRLKTDDGENASACILIEGYANGKISPIRI 
MVRKSTIIPEVFNHLLFPVFASKDTGANILFFIKMKSFASASLLLPGLFRHPKQFLNGPC 
KWMTLAENNINDNNINSSTMWSYTLADYCPLGYYTQESPQPYQTCGNFTSTTNKRLQNVQ 
PLYF 

(SEQ ID NO: 79) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



CAB38842 (CAB38842) HYPOTHETICAL 35.2 KD PROTEIN 




39 


0, 


.061 


CAB52581 {CAB52581) CONSERVED HYPOTHETICAL TBC DOMAIN 


PROTE 


39 


0. 


. 10 


084402 (O84402) RIBONUCLEASE FAMILY 




37 


0. 


.31 


Q59085 (Q59085) RNA POLYMERASE SIGMA-54 FACTOR 




37 


0, 


.31 


080815 (080815) T8F5.22 PROTEIN 




35 


1. 


.2 


PI4K_DICDI (P54 677) PHOSPHATIDYLINOSITOL 4-KINASE (EC 


2.7.1 


35 


1 . 


.6 



Comments : 



TaqMan Primer/Probe Sets: 

5' start=717 

5 • stop=740 

3'start=791 

3*stop=814 

5* primer =TGAAGCAGTCAACTGGTACTTGAA (residues 717 to 74 0 of SEQ I D NO : 78) 
Tm5-57.74 

3'primer==TTTGGCGTCTTATGTAGGAAAAGA (residues 791 to 814 of SEQ ID NO: 78) 
Tm3=58,08 

probe 1-ATCTC7VAGCGTCTCCAAT (residues 741 to 758 of SEQ ID NO: 78) 

probelstart=74 1 

probelStop=758 

di r ec t ion 1= For ward 

Tml=68. 99 

scorel=l .99 

length=98 
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CT1068 
Nucleotide 

Genomic coordinates : 
Start: 243216 

Stop: 243798 (SEQ ID NO: 80) 



Amino Acid 

MDDSSRKQHQRQQHKLFHDVELHASRLLSSGLLHPREPSTLSDMRQFYFDYKQETTKRAA 
IILLNTLLEYYRTPSEEWEIPFNLLLNVMNNKWSTLIPGVKISAGIISKLPWTMKTMYEI 
VSSPNNNNNNGDYYSTCRRMVMEYPIGGLLHTPAITNKYPRSRMVTCTKGKDHQKLYDIS 
RQMFDIIEANGQL 
(SEQ ID NO: 81) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P3K3_DICDI (P54675) PHOSPHATIDYLINOSITOL 3-KINASE 3 (EC 2.7 32 3 . 0 

001590 (001590) K09H11.1 PROTEIN 31 6,7 

RRPL_DUGBV (Q66431) RNA- DIRECTED RNA POLYMERASE (EC 2,7.7.4 31 8.8 

Comments : 



TaqMan Primer/Probe Sets: 

5'start=234 

5'stop=257 

3'start=308 

3'stop=331 

5'primer=GGAAATTCCGTTTAATCTCTTGCT (residues 234 to 257 of SEQ ID NO: 80) 
Tm5=57 . 99 

3 'primer=GGAGTTTCGATATGATACCTGCAC (residues 308 to 331 of SEQ ID NO: 80) 
Tm3=57.75 

probe 1=GAGTACACTCATTCCAGG (residues 279 to 296 of SEQ ID NO: 80) 

probelstart*=279 

probe lStop=2 96 

direction l=Reverse 

Tml=68. 82 

scorel=l . 82 

iength=98 
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CT1069 

Nucleotide 

Genomic coordinates: 

Start: 244241 

Stop: 244856(SEQ ID NO: 82) 
Amino Acid 

MDLSFTLSVVSAILAITAVIAVFIVIFRYHNTVTKTIETHTDNIETNMDENLRIPVTAEV 

GSGYFKMTDVSFDSDTLGKIKIRNGKSDAQMKEEDADLVITPVEGRALEVTVGQNLTFEG 

TFKVWNNTSRKINITGMQMVPKINPSKAFVGSSNTSSFTPVSIDEDEVGTFVCGTTFGAP 

lAATAGGNLFDMYVHVTYSGTETE 

(SEQ ID NO: 83) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

085179 (085179) FLAGELLIN A 
FLAA_CAMJE (P22251) FLAGELLIN A 
BAA83944 (BAA83944) UNKNOWN 
FLAB_CAMJE (P222 52) FLAGELLIN B 
AAC25644 (AAC25644) FLAGELLIN A 
AAC25648 (AAC25648) FLAGELLIN A 

Comments : 

EST confirmation of the predicted transcript and hits to pblic SBV sequences: 
Nucleotides 1 to 803 of CT1069: this corresponds to nucleotides 244132 to 244934 
of the genomic reference sequence. 

Hit to public sequence gi I 68561 62 ( gb t AF173993 . 1 : CT nucleotides 1 to 615 match 
nucleotides 323 to 937 of the public sequence with a 100% homology, a score of 
1219 and an Evalue of 0. 



34 0.60 

34 0.60 

34 1.0 

33 1.8 

33 1.8 

33 1.8 
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CT500 

Nucleotide 

Genomic coordinates: 
Start: 2425 

Stop: 1537 (SEQ ID NO: 84) 
Amino Acid 

MKNSRQRSGVWRGNSCLYKSFYFSGAIIECKKIRIIMMFLLLSLILFVCFVGVVVGVIFM 
SRPNKTTTTSNKKTKKDKEKEKEDDTEGAVLGRREPENRPIGRDEEGAVEDGKEEEEVFE 
FEQPSVNTGSNTGGGGTGTVPGEGLLPPPPPTPTPTPPPTPTPTPPPPPTRTPSPSSSLG 
EDDDDDIDIDFDDNDIDEFLDSGEEMEEDEEEEDLDTLLSRLETGMSGEEVDFDASSAYI 
QPDPVVVKNIERSDYTLDPMESWKVLNRSEGDIRFFVDRGITNKIKAMTEDLKEL 
(SEQ ID NO: 85) 

Top Blast Hits 

Sequences producing significant alignments: 

Q48373 (Q48373) CHITINASE PRECURSOR 
086476 (086476) CLUMPING FACTOR B PRECURSOR 
092451 (092451) ACMNPV ORF91 

Y091_NPVOP (010341) HYPOTHETICAL 29.3 KD PROTEIN (ORF92) 
Q69023 (Q69023) (B95-8 ISOLATE) U2-IR2 DOMAIN ENCODING NUCL 
Q42421 (Q42421) CHITINASE PRECURSOR 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 24 4 to 513 of CT500: thi 
corresponds to nucleotides 1979 to 2248 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5 • start=419 

5 ' stop=440 

3'start=505 

3 •stop=523 

5 *primer=TGCCTGGAGAAGGTTTGTTACC (residues 419 to 440 of SEQ ID NO' 84) 
Tm5=59. 11 

3'primer=GAGATGGGGTTCGTGTCGG (residues 505 to 523 of SEQ ID NO: 84) 
Tm3=5 9. 88 

probe l=CCTCCTACTCCTACTCCT (residues 448 to 465 of SEQ ID NO: 84) 

probelstart=4 4 8 

probe lStop=4 65 

dx rectionl=Forward 

Tml=69.04 

scorel=l . 95 

length=105 



Score E 
(bits) Value 

64 2e-09 

59 4e-08 

58 9e-08 

58 9e-08 

57 2e-07 

57 2e-07 
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CT501 

Nucleotide 

Genomic coordinates: 

Start: 7645 

Stop: 7042 (SEQ ID NO: 86) 
Amino Acid 

MTMWNKTVITTKRMNWPMVVGVFFILAITALAVALYIRHASKQEKYSTSHINEQFTAKQL 
PVTYLSKTGKLKDMHLTHSDFMAYVDVHNRTKTLKHPMCTDEAGWAHFCLLASAEAYRRI 
RYGRGEFGPEKHSLAETIQSTVQDMSEPYITHIFKKNTDVDGHGMQSVLEKNRNKIRMGD 
GKTSSETYNLSDKS I SI VGV 
(SEQ ID NO: 87) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 587 to 1 of CT501 : this 
corresponds to nucleotides 6996 to 7582 of the genomic reference sequence . 

TaqMan Primer/Probe Sets: 

5'start=245 

5 * stop=266 

3*start=315 

3*stop-332 

5 •primer=TGGCATATGTTGATGTGCACAA (residues 24 5 to 266 of SEQ ID NO: 86) 
Tm5=S9. 44 

3 'primer-AGCAGGCAAAAGTGGGCC (residues 315 to 332 of SEQ ID NO: 86) 
Tm3=5 9.78 

probe 1=ATGTGTACTGACGAGGCT (residues 292 to 309 of SEQ ID NO: 86) 

probelstart=292 

probelStop=309 

direction l=Reverse 

Tml=68. 96 

scorel=l . 96 

length=88 



Sequences producing significant alignments; 



Score E 
(bits) Value 



Q21859 (Q21859) R09D1 . 3 PROTEIN 
Q9Y3S0 (Q9Y3S0) EMDC II PROTEIN 

AAD25099 (AAD25099) METALLOPROTEASE DISINTEGRIN CYSTEINE-RI 
031548 (031548) YFJL PROTEIN (RIBOSOMAL PROTEIN L6-HKE PRO 
Q4 4 602 (Q4 4 602) PHOSPHORIBOSYL ANTHRANILATE TRANSFERASE 
CAB52230 (CAB52230) HYPOTHETICAL 33.8 KD PROTEIN 



34 
32 
32 
31 
31 
31 



1 . 0 
4.0 
4.0 
5.3 
6.9 
6.9 
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CT502 

Nucleotide 

Genomic coordinates: 

Start: 8502 

Stop: 7641 (SEQ ID NO: 88) 
Amino Acid 

MSSGSINNHPSSNMDTNKMEEGEEQDFDVLELDYSKIIHDITAMLSVAAPPPNSILDASD 
GLIATASATAPAAETGNSNRMRLDKDVCQLIERDIELVKSDTIEVDSIIRQLLYFGESAS 
EKNIKTNSTEKEPVYFPKEPKGEAVKLAKNTPVLDTITKLDWMANICQSNKIGVENLASA 
LQSGOLIWTTFPAAVYASLDSFYHIAIMWKLLGSFINIEALSKGSKDNLLPRDDIQVVHA 
KQEIAAMLQSRQNILGRGPSEYPPVPITAILSRTIIPLLRNFSEKL 
(SEQ ID NO: 89) 

Top Blast Hits 





Score 




E 


Sequences producing significant alignments: 


(bits) 


Value 


043631 (043531) SPINDLE POLE BODY PROTEIN SPC98 HOMOLOG 


38 


0 


.063 


060853 (O60853) PROTEIN ENCODED BY SACCHAROMYCES CEREVISIAE 


38 


0 


.063 


O60852 (060852) PROTEIN ENCODED BY SACCHAROMYCES CEREVISIAE 


38 


0 


.063 


AMPR_HUMAN (P15514) AMPHIREGULIN PRECURSOR (AR) (COLORECTUM 


34 


1 


.2 


BYN_DROME (P55965) T-RELATED PROTEIN (TRP) ( BRACHYENTERON P 


33 


2 


. 1 


AAB32396 (AAB32396) T-RELATED PROTEIN 


33 


2 


.1 



Comments : 

EST confirmation of the predicted transcript: 

7^ isolated EST has equence identity to nucleotides 1 to 753 of CT502 : this 
corresponds to nucleotides 7704 to 8456 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=315 

5'stop=336 

3'start-392 

3'stop=412 

5'primer=TGACTCCATTATTCGCCAACTG (residues 315 to 336 of SEQ ID NO: 88) 
Tm5-58.70 

3 •primer^-TGGGGAAGTAAACTGGCTCCT (residues 392 to 412 of SEQ ID NO: 88) 
Tin3=58 .70 

probel=TGGAGAATCTGCATCAGA (residues 345 to 362 of SEQ ID NO: 88) 

probel start=34 5 

probelStop=362 

direct ion l=For ward 

Tml^68 . 85 

scorel=l ,85 

length=98 
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CT503 

Nucleotide 

Genomic coordinates : 
Start: 9248 

Stop: 8552 (SEQ ID NO: 90) 



Amino Acid 

MDPGASAASRRALWSSTVTNTRHYQQQLNRALNKIEEEDDVEEEHGQVTTTNKEMASTST 
SSSSSSSSSPTSSAIPSSDEEEEEEEEYDSESDTNVDSLLGEEEEEDSDTESTSADANFL 
RSSSRNSTTRNRLIKKYVDRFIKYEKDILLADRNKRKKRHRNRQPQIHKLNNKRLKKPTD 
KKQKTNKKKTWRRLPKFIKKMSPASRLKFFSACIISGIKITSIIVLSIMAL 
(SEQ ID NO: 91) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

NSR1_YEAST (P27476) NUCLEAR LOCALIZATION SEQUENCE BINDING P 56 2e-07 

FKB4_YEAST (Q06205) FK506-BINDING PROTEIN A ( PEPTIDYL-PROLY 55 4e-07 

GAR2_SCHPO <P41B91) GAR2 PROTEIN 52 5e-06 

095367 (095367) CBFl INTERACTING COREPRESSOR CIR 52 5e-06 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 51 6e-06 

008904 (008904) BRAIN X-LINKED PROTEIN (BRX PROTEIN) ( FRAGM 50 le-05 



FIGURE 3, Sheet 5 1 of 160 



wo 01/38351 



93/201 



PCT/USOO/28888 



CT504 

Nucleotide 

Genomic coordinates: 

Start: 13936 

Stop: 9328 (SEQ ID NO: 92) 
Amino Acid 

MAHKLLFLEEEDAKEIGTLSHPEPSFALYESETFRSVGFCKNVTDAYPKFLPRPMDINSV 
QALAVRLALIQFYKGRGWKKNMSIIDLVKDKVERNFKVDKKTSGGFIIGDGTGVGKTREL 
AAFVMSVILQEKALLDVQKHVGPSIFGQDSDKVITAINSGVWKRHPFFIWLTCSKPLFNS 
CQQGMREVVTNSRGLRDPKFSWRKLQVPCANKPTSFKSDGKSGSMTVDVENSVSSAKDSV 
DIRFFTLRDVKEFHSKRSSRSIGDFLTETPTILFMTYSDLRTNLEFVLKFITGGTDLDSN 
KVMPIDNFVTALLCDEFHKTQNISDSFRKELAKTWEEEDTRVLRNIQKRANPSVSDLINR 
FKSAMSDDRNFKVKRMKSSNNKGRVTMSNYLKLLSQADAFRIFLEILKYDTFTVMASATP 
FQSNADLHMIDHILRKSAPAYTSIQAFKEVSSATPDAMAEHSEYVTVFLEQVIKLLRNRG 
QLV S RS I SMAGVDCSTTNCKAS PLQKYAI DELAS YC LNARQVL I DSEKVGGH VRRAFT K I 
IREHQEGGILEEEDVEKLVAEINSPSRKRKRAANDDDLYEVMENIDRRFKVVVVRDRDVA 
HDGKTTLRSIVQDAIKTYSQKKDALSNGGGGITTSPEVDISSIDMVAQDLYDAIKKKEKP 
SKGKTDFNEDYDDGANEEDGWGEVFDDECFEKLRRQYFINTASTSVAACKGALLNIKATS 
VTDAVKRLRTTNESKKMVMSLEQTGDSFLKNLTTRILQTIAKDESDAKYGIVDVGTFDSS 
PVANTIFSGYRLLCRAVMiylASAFTISLKNKTNRRTSPAHVMLVPSVPDTEPLMALAGNPI 
DSITQSIGEDSNAEITNRKLCSRITNRGLFLVKNNTKTANTNKCISAFNNTKEVDVIMLG 
PKGNTGLSLHDSSNNSMYAKRYHCVLDVPYNAIAFLQTIGRTHRNGQLSVPQFLIFSTDS 
PAERRFFDSLDKRIKDSKAGTYADRYSNNSIDIAAAVMREQFIDQGLVLKTMGNIVQIVT 
ASMTKVHLMEHFSKMMMRTNRGGVAFVEGLTLENGIFTEVIVLAMHIALVVIGAQNKITS 
SDDLGHALSFTSVLPHNQILSIVKSASQFVFSNLCLHLVHFKSDCDNLLPREKRVRDAAS 
ALIDTLNTKNNEVTSKTNKIESDAPSLTALMLPSGPRNRKMDVFSNIMAYNNNNGMDFDE 
DVPDNDEDEGCLPLQEENATTLALSNFPHDYDRAIKDAHQLVTVRIVGQGEKEGVIPISE 
CLDVPELDMTNLIPVVTATNVIQSLAKENPGLLFTIHNAALAHSHREGYGGSHLLGLAKK 
LSRGFINFRQFQNQLFSPKKESKIMYDIFLSVKAIMARDDRYDGLCDMRMNSMMDASFLK 
VRKKPECVFITKLLDKNFRRHIINDEEEETRERFGGEEEEEDDDEEFEDEEEEQAEREWG 
EEEGESAyDISVINDKNNTIGHDVDIILCNRKKLTLTKENSVFVNEHIDSFMVGNLIGAE 
GSLIQICFDNCTGEFEGLPKFCLYDSSSKDKDTIP 
(SEQ ip NO: 93) 

Top Blast Hits 





Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 


O0194 0 (001940) STRAWBERRY NOTCH (SNO) 


72 


3e-ll 


Q9Y2G9 (Q9Y2G9) KIAA0963 PROTEIN 


62 


3e-08 


075257 (075257) R31180 1 


62 


3e-08 


O01737 (001737) F20HH.2 PROTEIN 


62 


4e-08 


085862 (085862) PROBABLY METHYLASE/HELICASE 


59 


2e-07 


064516 (064516) YUP8H12R.3 PROTEIN 


53 


le-05 



TaqMan Primer/Probe Sets: 

5'start=2131 

5*stop=2154 

3 • start=2210 

3'stop=2233 

5 'primer^GGTGCACTATTGAATATCAAGGCA (residues 2123 to 2154 of SEQ ID NO: 92) 
Tm5=59. 03 

3 *prxmer=CAGTTTGCTCTAGTGACATGACCA (residues 2210 to 22 33 of SEQ ID NO: 92) 
Tm3=58.06 

probe 1=TTCTGTCACCGATGCAGT (residues 2157 to 217 4 of SEQ ID NO: 92) 

probe lstart=2 157 

probelStop=2174 

direction 1= For ward 

Tml=69.00 

scorel=l . 99 

length=103 
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CT505 

Nucleotide 

Genomic coordinates: 

Start: 16983 

Stop: 14064 (SEQ ID NO: 94) 



Amino Acid 

MDERRRDPLLYPTNRSMRFTAQITLFVTVFVLGCIFALVCAAMAYNVAKPMSVNFQAIHE 

LGMKSKLKAVQGANPEKTLEEYLEARGRHDGVEDASNYPPHPALLDMMNLTVKGNKWNVP 

SETKERNSRFESHDLAANRSSSLLVPEHHIDRLSEATIEKSNKYLDAVSGKKFRQRMVNL 

KDNIEKDDTELYDSLFGVHDIHHHSASGVSGDAPPPPPSTSEGHDEDVDILAYNTGGYCS 

NPVPLKEGQTCTSVCYTSRAVRVMTPFVAGGTFITHKSGEDPKPYCWSGNVPGDHIETSP 

TTGERVVKECSVHTSIVVLTDDGGWQCRPKYPTYFGGSGGTSMTACAFNPSTHKGPPPPS 

SSTPIYYDVLKKQQIRNHTEFRNSSYISKLRQSSSLAEFKIKCNDPEFLYKNPITCFCNN 

KKDVLNNDLLSQDVTKDMKFRGMYECMENPCVMMPNIDPSFVTFDVSTMKCVPGVNNPQD 

SNRHAIIGDDRTPLVGTVPAMGIFLADQSKRGDQIHQQRPKSSIDETTAKKIALAQAPII 

TPLNLDATNTSKNVLFVPIPSTVLPPLENIPHVIIRPSSLLHRSCLAPVLNKPSSGQHRP 

FCTAPFYIEPAANVLAGNIPQKPYEHSMLATECLRNSRMVSGSVHGGSELLFSTLLSQNK 

PSSYIRTPPGGTPAPEYNSTGDQRLEEIRDFFERNFNDERRLSQTEYVIKKHARGMRTSE 

lYLKSSSWDSLMKRKEFLRHIIKKSEDTFVLKEGLLMRSYGPYAATVLARDMFDLDYLKG 

KPASKTSSTLKVSNPLQYAFPTSYSVLPEEGATDDIFSVDHNRIFDSETIPSYFDCSNVT 

PGSEKLFGTSSSSSEYRVDIDDDAWGLQSFRLDHNPKSGPVVQSDPRLAFDASNISSTPE 

GATITPLSLFKKSLVEWGHKKADVQETSWFRDGVDTSEAYRRLLVETSMAVRNSWFSLAW 

ENKNYYFAKNSS 

(SEQ ID NO: 95) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CUP5GALME (Q24998) PUPAL CUTICLE PROTEIN PCP52 PRECURSOR { 46 9e-04 

CFHDHUMAN (Q02985) COMPLEMENT FACTOR H-LIKE PROTEIN DOWN16 38 0.33 

P8991I (P89911) PROTEIN KINASE 37 0.57 

CAB39619 (CAB39619) AIGl-LIKE PROTEIN 37 0.57 

CAB53064 (CAB53064) DJ15D12.2 ( FHR-3 (FACTOR H-RELATED PROT 37 0.75 

Q24552 (Q24552) TF125 PROTEIN 36 0.98 

TaqMan Primer/ Probe Sets: 

5*start==1446 

5*stop=1466 

3*start=1511 

3'stop=1532 

5*primer=CAGGCATGCAATAATAGGCGA (residues 14 46 to 1466 of SEQ ID NO: 94) 
Tm5==59.66 

3'primer=CTTTTAGATTGGTCGGCCAAGA (residues 1511 to 1532 of SEQ ID NO: 94) 
Tm3=59.2l 

probel=GACAGGACACCGTTAGTG (residues 1468 to 1485 of SEQ ID NO: 94) 

probe 1 star t=l 4 68 

probe 1 Stop== 1 4 8 5 

direct ion l=Reverse 

Tml=69.00 

scorel=l . 99 

length=87 
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CT506 

Nucleotide 

Genomic coordinates: 

Start: 25878 

Stop: 25197 (SEQ ID NO: 96) 



Amino Acid 

MASVFEDPADLFANMDLTGKVPTRPNILFFEGLLPNSGKEIMENRLIHKGKCGAFEADTQ 
UVYFFPSNNEENTKKLNIGFQIKSNCLSFFIRDFLNDWLEEIKDCGPYCTFSQYMDGDKE 
IFGNSVFGQDFTIVAMDWIDKGVTFYIFVDGSDSMENMASLWMCDKLKRMNANVVKVFVD 
NASKPKFSVCKTCRWEFPGPVSYVIEGHGMGHSDLSCDEISEFLVQ 
(SEQ ID NO: 97) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AMY_BACCI (P08137) ALPHA-AM YLASE PRECURSOR (EC 3.2.1.1) (1, 35 0.44 

Q85449 (Q85449) PROTEIN 8 32 3 . 8 

Q85439 (Q85439) COAT PROTEIN 32 3^8 

VP8_RDV (P17379) OUTER CAPSID PROTEIN P8 (STRUCTURAL PROTEI 32 3^8 

Q85451 (Q85451) OUTER CAPSID PROTEIN 32 3.8 

Q24284 (Q24284) PLC-GAMMA D 31 6.6 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 652 of CT506: this 
corresponds to nucleotides 25216 to 25867 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5' start=287 

5* stop=307 

3*start-351 

3'stop=373 

5 *primer==ACGACTGGTTGGAGGAGATCA (residues 287 to 307 of SEQ ID NO: 96) 
Tm5=58 . 18 

3'primer=TGTTTCCGAAGATTTCTTTGTCC (residues 351 to 373 of SEQ ID NO: 96) 
Tm3-58 . 28 

probe 1=AGGACTGTGGACCATACT (residues 308 to 325 of SEQ ID NO: 96) 

probelstart^308 

probelStop=325 

directionl=Forward 

Tml=68 . 99 

scorel=l . 99 

length=87 
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CT507 

Nucleotide 

Genomic coordinates : 

Start: 29077 

Stop: 28330 (SEQ ID NO: 98) 



Amino Acid 

MIAIANHKHDVSDALVGAHGAKINMLYGKSSTLSVTEAALLMFNDTALTQFAQRGYEPSI 
PTILKAALDFSLQEEEPLVAATGLDVNKAPRSWPILNCRLGYIASSNYPWAEHIISGDKE 
EIKRALEEHEKNANVRFDSDNCPVCLEDFSSTNIIRTTRCGHCIDEKCWDRLVLSTQRGE 
ITRCPVCRERTSLRPDADQVKEMLVEPIVSCKRMAVPDEQVSCKRRRIGYNRYQFLINDV 
WTDESETV 
(SEQ ID NO: 99) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q92T52 (Q9ZT52) RING-H2 FINGER PROTEIN RHAlA 4 5 7e-04 

CAB51420 (CAB51420) RING-H2 FINGER PROTEIN RHAIB 44 0.001 

Q9ZT51 (Q9ZT51) RING-H2 FINGER PROTEIN RHAIB 44 0.001 

CAB51421 (CAB51421) RING-H2 FINGER PROTEIN RHAIA-LIKE PROTE 44 0.001 

017099 (017099) F42G2.5 PROTEIN 42 0.005 

016682 (016682) 2K1240.2 PROTEIN 41 0.008 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=257 

5'stop=279 

3'start=308 

3'stop=330 

5'primer=TCAATAAAGCACCTCGTTCTTGG (residues 257 to 279 of SEQ ID NO:98) 
Tm5=59,2 6 

3'primer=CCAAGGATAATTTGAGGATGCAA (residues 308 to 330 of SEQ ID NO: 98) 
Tm3=58.96 

probe 1=CAATACTGAATTGTCGCT (residues 281 to 298 of SEQ ID NO: 98) 

probe lstart=2 81 

probe 1 S top=2 9 8 

directionl=Reverse 

Tml=68.8l 

3corel=l .81 

length-74 
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CT508 

Nucleotide 

Genomic coordinates: 

Start: 30861 

Stop: 29073 (SEQ ID NO: 100) 



Amino Acid 

MAEAAPRYRQVLEEVLENIEPYMSFLDVFTERELALLNDIITSRNSPPVPSSSFKKLDNK 

EEFRDIIYFFINNNTKSDSSPICEGMTFINALTTVCKTFRGLYENIHDDFLFVKYSLLVS 

MDNGFLRRETHGIKFGTGDDSRTGFKFTSKEQAEEEREKVMRRIKKLDGVLASLKKSTSS 

ARSGIVFYVEKCSSVIRFRLFSRIVNITSDYVAEMKKSAPLEPFDISFGYKYFVDESPCV 

TKAKRLISNGNFIIGRPFSCLETSPSSVSTDFREEMNMDARSIARLNWTNEERASAYRSV 

IIKSFLSSIEEEMVEEYCETTTKTVAEMAVEFVDVFIEKAETIQHFQTLY5IFDTMPKFS 

AEMMDNILKNVAINEAVGSGLCGAILLWMINSRPFEEIDYNYFKICLREIMVRKKTDKLC 

DNLIVKRIVSHKNVVITDPHEVKGYVRLCVKVSCFMEDLEAFLTKNPWLKHTYFDEKGNT 

LLCYCIINKYSHTSKLVKQEKLNILKPSAKGMSPLMVCAAISSPFTTRVGIEILTTNSLA 

FSFINENNENVFHAAAVATSCNFLDALAKKYKNIIYDFDRSIVNARRRAMVQRP 

(SEQ ID NO: 101) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CUL1_CAEEL (Q17389) CUL-1 PROTEIN (LIN-19 PROTEIN) 36 0.99 

02329G (023296) HYPOTHETICAL 74.2 KD PROTEIN 35 1.7 

057577 (Q57577) CARBON MONOXIDE DEHYDROGENASE CORRINOID/IRO 35 1.7 

GIDA_COXBU (P94 613) GLUCOSE INHIBITED DIVISION PROTEIN A 34 3 . 8 

Q47427 (Q47427) PLASMID P15B GENES R, S, SC, SVMl , SVMR, SV 34 3.8 

Q39068 (Q39068) CYCLIN 2A PROTEIN 33 5.0 



Comments : 



TaqMan Primer/Probe Sets: 

5'start-^742 

5 • stop^764 

3'start=822 

3'stop-845 

5'priiner=^TCCAATGGCAATTTTATTATCGG (residues 742 to 764 of SEQ ID NO:100) 
Tm5=58.75 

3'primer=GATCTAGCGTCCATGTTCATTTCT (residues 822 to 845 of SEQ ID NO: 100) 
Tm3=57. 51 

probel=CATCCGTGTCAACTGACT (residues 797 to 814 of SEQ ID NO: 100) 

probe Is tart =7 97 

probelStop=814 

direct ion l=Reverse 

Tml-68.91 

scorel=l . 91 

length=104 
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CT509 

Nucleotide 

Genomic coordinates: 

Start: 37245 

Stop: 36048 (SEQ ID NO: 102) 
Amino Acid 

MAGTDITSSSSSGSSSSKKGGCIVSKKGKTIKGKNIVFKTSIKTSSSSEMMKKHKKRMEI 

KDMVKKCASCKKVDYSSSTLENDALRASIESTCSALNRFPEIKYGEGEIGDVLSAIRLMA 

GCLLAKNEKSFYKFFLRGFQFDKNGFMMLSEGMKRIEPCMHTKIAKKVFGGCKAAPLKEDR 

EGKIPCQEFHKPSSYKGEYTTPLPPTPAPVKVLPPLLPYKNVKNKPVFVPDLAVGEAKKP 

CWVHKLFSDDPEERKRLFERHQAGRRDALMEDYGVIPN!>3DNEAEDTERFVSNALEYQAQM 

LELLDTANMPPPASTPVRRGRTRIVRDYDASPVPSPYSSPLHTPFDAPNVNLNPGSGRMV 

DRVRDGRRNTSRRTSAVMARRINQLQHQFLYYSSDSDF 

(SEQ ID NO: 103) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



001693 (001693) COSMID T08B2 


39 


0. 


.073 


CAB52863 (CAB52863) PUTATIVE MEMBRANE PROTEIN 


36 


0. 


.37 


001348 (001348) ZINC-FINGER PROTEIN KLU (KLUMPFUSS PROTEIN) 


36 


0. 


.48 


Q92DN6 (Q9ZDN6) VIRBIO PROTEIN (VIRBIO) 


35 


0. 


.83 


Q9Y2W9 (Q9Y2W9) ENDOCRINE REGULATOR 


35 


1. 


.1 


GAR2_SCHPO (P41891) GAR2 PROTEIN 


35 


1 . 


. 1 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 973 of CT509: this 
corresponds to nucleotides 36148 to 37119 of the genomic reference sequence. 
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CT1070 

Nucleotide 

Genomic coordinates : 

Start: 249425 

Stop: 253221 (SEQ ID NO: 104) 



Amino Acid 

MASSGGFFTGIDDLFKTVIQQEKQEKNKPTQAPETEPKPGPSQAPDPVPDPVPKTPTNFC 
PPPPNPLPPPPPPPPKPSREERLKTSKIRLNICALSDIVEATNERVDALKENQALNTEYDK 
KDNYFQVLKCSITPSVPTAIIGAHVKQVAKSSEIELAVNELDIKNKCSLVYNENESLKFF 
RDHENLILQIAVQLFSRHDNTKCVGAEICVKGNEKNKFVNKLVVKKLPNAPSSSSTVLEI 
RGATRNLLENNFNKGENNTVNENKDIPPSERANLDTTKAEISHVFSTLHRLDTKRKLFFK 
GNTFYQRKPTFDNKFRWTEVIGWTESEASKQTTKSLDKPTDDNLFVLPHSFNNLADHLRL 
KFKNVLYKNSTAHPGKRNYYKTQETLINPQIDSAKEYKMVFAEIDKCLDVLLAIGKNDKY 
TKSTVIQYRGKFRRYLIFCYAFYALNKAKHSRAVSPLPFNFFNLFSFMYCHGPFLHSASF 
LSTLTFVYQHMFFPMGTAAPSVSAKRLMDIDSALMKGGKGVGVRDFGSPSKTSLHTRTLV 
SFLGFAEMAMGTMTALLSGVEVRVSPALQQRISKSLERWCDSVIFIYFTFVLFHRFSGAK 
KVSLESALRLIMGQTHAHTNKVRAAKRCRIEAAEMEGVEEEEAGLTLSYAHLLGLPYSIQ 
KALGLPVPKINPLMTASSSQYNLGDFVGVEQLLKAKREFPAEGETAGFLGMFDNLVKDSI 
DKYYGEGAFSDVVENVKQGMEQNTPYDTSSALMTPIPKAFYEEEKDVPQQEENSTQQRYR 
LNRDVEEYLMASPMKMVFVSILDKTNQKERFMSVGDIALLAVWCKRNVLKKDWNEYAIAK 
GNYEWLGAKMCNHLLLADLVNFGILGDLKITNKLDTNTDTFHRDSDRLPSVADQKKFIKN 
TSLSDRKQLALVHSCVNVSTRTHVGRVTATSWAVDALRTYTRGDKDMFAALSSSLDMYHL 
GHTNSANFVPYFSRNYLCNEQENGLWGYTRRTSEKLAKEELGRGRLGGLNKVGVAKTELA 
AAAIAISSALDMGEVEAVMDDSSKVRKIASTCLNVNAAKVSAAREKAREASIKRLLLATN 
APAAGSSRNSNRFLLKDLWGFFSDPDKRQKLIKGEAVSVLCPNTGFLHAAVPDFVIEYSF 
ESETSIVRLRLRLIKPEKQDEMVCPSTAPEANKKRKLVRNNQDAVLTLDDEDNIVKYNKY 
DMVEDEEARERLRHQDKQSVIAARISKVCERKNPKKKRRLEDPELQSVDEQLIRELAAIA 
Y 

(SEQ ID NO: 105) 
Top Blast Hits 

Score E 

Sequences producinq significant alignments; (bits) Value 

SSGP_VOLCA {P2i997) SULFATED SURFACE GLYCOPROTEIN 185 ( SSG 64 8e-09 

YPRO_OWEFU (P21260) HYPOTHETICAL PROLINE-RICH PROTEIN (FRAG 62 2e-08 

013305 (O13305) PROTEASE 1 61 5e-08 

P93797 (P93797) PHEROPHORIN-S PRECURSOR 61 5e-08 

AAD37432 (AAD37432) M-LIKE PROTEIN PRECURSOR 59 2e-07 

Q54071 (Q54071) M PROTEIN PRECURSOR, MSZW60 59 2e-07 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=1884 

5'stop-1904 

3'start=1931 

3*stop=1952 

5 ' primer=CCGAATAGAAGCAGCGGAAAT [residues 188 4 to 1904 of SEQ ID NO: 104) 
Tm5=59.32 

3'primer=TGGGCATAAGAGAGTGTCAGGC (residues 1931 to 1952 of SEQ ID NO: 104) 
Tm3-60.21 

probel=G7VAGGTGTGGAAGAAGAA (residues 1906 to 1923 of SEQ ID NO: 104) 

probelstart=1906 

probelStop=1923 

direct ion l=Reverse 

Tml=68.98 

scorel=l . 98 

length=69 
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CT1071 

Nucleotide 

Genomic coordinates: 

Start: 253296 

Stop: 255120 (SEQ ID NO; 106) 
Amino Acid 

MRDDTFNQETAVKLVRWYTEYDCCCPLVNRVERLLGSFGGGVDATSVRSRPALYEEDKKG 
DKCIPFRITSLIEGILLERALTKPDLAAAAFDVSEKLVYCSCNNTQGNFDVSSMTIWIDG 
NNSKKYEVTCPSCTVEKISGGAESIHKKPMSLLAFFNNLVEKEAFAERIELKKLYLSLLT 
GSAAGGGGMYKDSSQQSSFNGSWTSLLFHTSKKDKTRLEAEVLVSNKIKHTSRLQPRCVC 
S DLL Y ALCSTTNNSAS YAYKARNLCV I EGGE FLY FKYT I FEENGPFDSKT DLQSLVNNEP 
VSETNSSALAASSSSLEDDDDCCDDDDDDDDDEDEKTKKKQPKKQTKKQKTTTSTLPPIS 
KTNHDNMLMNVLKKGAVNGKRKMMDSLSGKKGQHSKKLKTSAAAGGGASSDVVAGENEEE 
NNPSSVSPTNNRDRKDYVLPCPQIEEVTIFSQHRMNNNKLAESVVKHSVVINGNCLNLFV 
TQHRKKYILPHENILFCPPLVQHVGFNKFRILTGVSCFFDRIEIVFSDQSDSVVLSNNAA 
HSAILRLLSYIRENSLKRSVRTASVKGIDFVVKSQDTNIGIPLSNKEIRERQLCSASTLS 
MLAGLGK 

(SEQ ID NO: 107) 
Top Blast Hits 





Score 


E 


Sequences producing significant alignments: 


(bits) 


Value 


Q20497 (Q20497) F47A4.2 PROTEIN 


52 


2e-05 


KSl HYDAT {P38978) KSl PROTEIN PRECURSOR (HEAD-SPECIFIC PRO 


50 


6e-05 


YIL2 YEAST (P40480) HYPOTHETICAL 123.6 KD PROTEIN IN POR2-C 


48 


2e-04 


FKB4_SP0FR {Q26486) 46 KD FK506-BINDING NUCLEAR PROTEIN (PE 


47 


3e-04 


Q06459 <Q06459) NUCLEOLIN 


46 


9e-04 


NPM_XENLA (P07222) NUCLEOPHOSMIN (NPM) (NUCLEOLAR PHOSPHOPR 


46 


9e-04 



Comments : 



TaqMan Primer/Probe Sets: 
5 • start=1007 
5'stop=1032 
3'start=1084 
3*stop=1105 

5'priiner=AAACTAAGAAGAAAC7\ACCCAAGAAA (residues 1007 to 1032 of SEQ ID NO: 106) 
Tm5=57.02 

3'primer=TCAACATGTTGTCGTGATTGGT (residues 1084 to 1105 of SEQ ID NO: 106) 
Tm3=57.4 4 

probe 1=CACTTCCACCTATCAGCA (residues 1064 to 1081 of SEQ ID NO: 106) 

probe lstart==l 064 

probelStop=1081 

direct ion l=Reverse 

Tml=68.99 

scorel^l . 99 

length^99 
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CT1072 

Nucleotide 

Genomic coordinates: 

Start: 255074 

Stop: 257477 (SEQ ID NO: 108) 



Amino Acid 

MLSFNPEYASWFGKMITDPGVILPVSKDVVLFGSRGQSDVGIMTLDPHDLDIKITSKRIG 
VEERLAQYNTLPMDFTRAMEKELNNSRNMKESIFTGIFLDTGSAIFEDNMFNGGGSALRL 
IRSPALNSAVFSSKNYIIKQLPTITKSLRRSQARDKQVDKTREKIVVDSFSILSAIAAQV 
MHLTDGEMTYVPDGHCVNWMSETNASSIYLIINDPTGSGWKIMPNNFNKTLEMRDGVID 
RVETLVEFACKCVASSLIKRGMDLVDMQRTIRSMDFLPPASSTSNNTPRVAIMTSGSSTT 
TGIGSLSILAEDGSTHHQIKLSEYRTGLSITENNREVSFTVEPSIDGVQAEHPLSPSILQ 
WLPPLVKRPEVVAAAAAAWEEENGDNKPSDKDNEDKYSDTDFWSNVPVTPLITPKKWRA 
CKINDRAMISSWKNNLVKLHKYDWTNKTTKVDYFDKMAAFVALMTFRKFQDILADNYVPP 
QTPSQGSEYAVTMSNVATLFTDVYGFESNGNKPLFALEOLENETGIESIYVLNIIGNSPD 
GNSVRVVRLEKEMSFLLKAKQYFTEMAIPPINEKCKWTDKAPSSVKEYKYFCDLTAPISK 
RPRKDNNDGGVEHSALTYTPRCIYHTERCLVHLYSEPEKITEHVSFNKDLNILEIGKNIT 
NQYQTNYKSIFEIVDVPIIVASMSSTKTMTVNNYIISTPSATTKFVQDPPKTGKQLLAVE 
EVRNFKLKSVLVPPPYFRDNKRNTTLCSQITEQNCPSSSEGGRFSCPSESLILKYSNLSK 
KRALEEIAPETETSILSLAM 
(SEQ ID NO: 109) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

TFC3_YEAST (P34111) TRANSCRIPTION FACTOR TAU 138 KD SUBUNIT 39 0.12 

Q9XIC1 (Q9XIC1) F13F21.2 PROTEIN 38 0.20 

P95629 (P95629) PUTA GENE 37 0.60 

Q9XGN1 (Q9XGN1) TTGl PROTEIN 35 2.3 

085475 (085475) CELL DIVISION PROTEIN 34 3.0 

Q00741 (Q00741) TAMA 34 4.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 932 of CT1072: this 
corresponds to nucleotides 2 5652 0 to 257 4 51 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 

5'start=1048 

5*stop-1068 

3*start=1099 

3'stop=1120 

5*primer=GCAGAGCATCCTCTATCCCCT [residues 1048 tol068 of SEQ ID NO: 108) 
Tm5=58 .38 

3 'primer=CTGCTACCACTTCTGGCCTTTT (residues 1099 to 1120 of SEQ ID NO:108) 
Tm3=58.57 

probel=TTCAGTGGTTACCTCCTC (residues 1076 to 1093 of SEQ ID NO: 108) 

probelstart=107 6 

probelStop=1093 

direct ionl^Forward 

Tml=69.00 

scorel=l . 99 

length=73 
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CT1073 
Nucleotide 

Genomic coordinates: 
Start: 257S51 

Stop: 259132 (SEQ ID NO: 110) 



Amino Acid 

MGVQKNILVGGGGGVSLLLGVVTLLGTVTEGAPAVPPFSSSSYSFTPESSVFWVEGNRVL 
SGTKKDTLINVLGKKI PYYANSIFRHDCSETRSIQWPETSPLGLNLI FCSCASHEHQHRT 
HETTEPDDLLWDGSRKTTTIILPKKWWSDVVWTSLWRDNDQKCGCGQAFVSSFTSTQKEV 
QGEWLAAHTNGKTSEGDTNSAYLFISLQRTTLKPIITDVTEDNMMMGRMSGTPMNPKDMT 
YFVNDFSDDIGSTPQCLVSNSDILNKREEWIAVWGVADSKDLLTKHQLGEREYGSEGRRR 
NPGVEEEEEERVEEEEEVEVALPYIKKSGKLIGPRRRPLTTTTTTTTTTTTNPIVREVVE 
DFDYESFNEPEIFGSNSKLPFIRFLDQKNWRLGIMSRVSSSIANFKIEQESSKALFCLAV 
WVGDEHTPKFRLSVWKNWKPFTSAPIIVQNVGYSSDVFWHETLRSKIVDRSRDLIETKVT 
KKIGEDWANKKQTVVAMFISGIVCITVTVISIFSIVIYYKIKMPKF 
(SEQ ID NO: 111) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



G1184543 (G1184543) NAD+ : PROTEIN (ADP-RIBOSYL) -TRANSFERASE, 44 0.003 

017412 (017412) CHITINASE 41 0.015 

AAF00095 (AAF00095) HISTONE ACETYLTRANSFERASE MORF 39 0.10 

AAFOOlOO (AAFOOlOO) HISTONE ACETYLTRANSFERASE MORF BETA 39 0.10 

015087 (015087) KIAA0383 (FRAGMENT) 39 0.10 

AAF00099 (AAF00099) HISTONE ACETYLTRANSFERASE MORF ALPHA 39 0.10 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 511 to 925 of CT1073: this 

corresponds to nucleotides 258 607 to 259021 of the genomic reference seq^uence. 

TaqMan Primer/Probe Sets: 

5'start^698 

5'stop=721 

3'start-823 

3' stop=846 

5'primer=CCATGAACCCTAAGGATATGACAT (residues 698 to 721 of SEQ ID NO: 110) 
Tm5=57,04 

3 'primer^GAGGTCTTTAGAGTCTGCAACACC (residues 823 to 84 6 of SEQ ID NO: 110) 
Tm3-57.2 9 

probe 1=TTCGGACATCCTGAACAA (residues 780 to 797 of SEQ ID NO: 110) 

probelstart=780 

probelStop=797 

direct ion 1= Forward 

Tml=69.00 

scorel=l . 99 

length-=149 
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CT1074 

Nucleotide 

Genomic coordinates: 

Start: 274526 

Stop: 275153(SEQ ID NO: 112) 



Amino Acid 

MYIFVEGSPLTGKSSWMSKLIDTGSCGMSFLNFLRMNTSDYYNWPAEIGTEHLQLGFRET 
RVVDGMFEPVLKTFVDSWKKEQGKESLKEYLDYNGQVMEIYIAEWLRQRPLAFHVFTYTD 
EAVKSGFLNEEDLDMDTATKWMAEIIREKRGNIQEIKVTPRVVFNGNVCSACFSNTKRNL 
YNFGTNYNNVVHCDLLCPFARHRIVHFL 
(SEQ ID NO: 113) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9Y7X6 (Q9Y7X6) HYPOTHETICAL 75.7 KD PROTEIN 38 0.079 

Q9YBC0 {Q9YBC0) 431AA LONG HYPOTHETICAL PYRUVATE DEHYDROGEN 35 0.52 

067329 (067329) DIHYDROOROTATE DEHYDROGENASE ELECTRON TRANS 33 1.5 

DP01_B0RBU (051498) DNA POLYMERASE I (EC 2.7,7.7) (POL I) 32 4.5 

YHC3_YEAST (P38742) HYPOTHETICAL 130.0 KD PROTEIN IN SNF6-S 32 4.5 

051342 (051342) ATP-DEPENDENT CLP PROTEASE, SUBUNIT A (CLPA 31 5.9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 563 of CT1074: this 
corresponds to nucleotides 274555 to 275117 of the genomic reference sequence. 
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CT1075 

Nucleotide 

Genomic coordinates: 

Start: 277704 

Stop: 278079(SEQ ID NO: 114) 



Amino Acid 

MWRSCISNIREMGDNKDYETRLIQRINDLESEIENKTELCEKINEQMKNTQLKYDKCFVE 
EETEKFRKMEERVKYLKEQGIPLDPEERRTMLAEIDKSNKELDALLEENERIIKLIDEEL 
ESMK 

(SEQ ID NO: 115) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

044929 (044929) MICROTUBULE BINDING PROTEIN D-CLIP-190 47 5e-05 

Q13439 (Q13439) TRANS-GOLGI P230 (256 KD GOLGIN) (GOLGIN-24 43 7e-04 

059037 (Q59037) HYPOTHETICAL PROTEIN MJ1643 43 0.001 

025662 (Q25662) REPEAT ORGANELLAR PROTEIN 43 0.001 

KINH_DROME {P17210) KINESIN HEAVY CHAIN 42 0.002 

AAD29948 (AAD29948) MYOSIN HEAVY CHAIN 41 0.002 



Coinments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 388 to 1 of CT1075: this 
corresponds to nucleotides 277689 to 278076 of the genomic reference sequence. 
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CT1076 
Nucleotide 

Genomic coordinates: 
Start: 278221 

Stop: 278473(SEQ ID NO: 116) 



Amino Acid 

MQKKYDKLFEDDKRFREIEERILQQKEKGNPLDPEERLVLSADIDRSMKEIDDCLEEINH 

lELSIDTLLDECENLHYGLQTTK 

(SEQ ID NO: 117) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: - (bits) Value 

018164 (018164) Y6B3B.1 PROTEIN 35 0.076 

P70388 (P70388) RAD50 HOMOLOG (S. CEREVISIAE) 35 0.10 

Q06268 (Q06268) INTERMEDIATE FILAMENT PROTEIN 33 0.30 

US01_YEAST {P25386) INTRACELLULAR PROTEIN TRANSPORT PROTEIN 33 0.39 

Q07380 (Q07380) HYPOTHETICAL 206.5 KD PROTEIN YDL058W 33 0.39 

096275 (096275) RESA-H3 ANTIGEN 32 0.51 



Comments : 

Hit to public SBV sequence: 

gi 1 6007410 igbi AF178573. 1 : CT nucleotides 13 to 252 match nucleotides 2050 to 22B9 of the 
public sequence with a 100% homology, a score of 476 and an Evalue of ie-i37 

TaqMan Primer/Probe Sets: 

5 ' start=88 

5 'stop-Ill 

3*start=147 

3'stop=170 

5'primer=AACCCTCTAGACCCAGAAGAAAGA (residues 88 to 111 of SEQ ID NO: 116) 
Tm5^57. 69 

3'primer=TCCTCGAGACAATCATCAATCTCT (residues 147 to 170 of SEQ ID NO: 116) 
Tm3-57, 97 

probe 1=TATTGTCGGCTGATATTG (residues 116 to 133 of SEQ ID NO: 116) 

probelstart=116 

probelStop=133 

directionl=Forward 

Tml=68 . 98 

scorel=l .88 

length=83 
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CT1077 

Nucleotide 

Genomic coordinates: 

Start: 278723 

Stop: 280976(SEQ ID NO: 118) 



Amino Acid 

MEKKTETAATTEKDPEPSVSKRSRNKEPKTTSTVYTSVKCYLSSIIKSESSRSNVTSTKE 

RFEERCKSVSKMMVKGSLFLRLVVDECLRRYNHLEDEIDKWPDMTKDNFYVQLLRKGLDK 

KKLKEGSTHPVVEDVWNSPIVQETFLSQQGEGNNPIKRHLMDFNTITYAAKQLKTCFETN 

LRTHFRTRQQRAISGWLAENGFDKKYTKLVQHWIIGCTYKSDWVDSGDLERVKEGTKNFV 

TLHRKHLCVISDKKNGTISYSPEEKYPIPSILNYYKFLQTEYPQNKKIQKMIVVPKHKLK 

IHYCTFDQTTIQGICKDLGVWKDMEERHKQSEDILYKQGWYLLFDVKKIKKLRPNWNFHS 

IQTDGEGVSVLFSREVEEVETVSKKSKKNKKPRGDEDRRNYPPTNAKYVVGVDPGRTNVV 

SCSVFDTRQKRVVRKHRMTAKQYYQESWMTDRRKANETYKKNNKEYKEALEEITRYDNGE 

EIINDGNGDTSTPTKKFEAYLKVVNEHYRLLWNEKGKKKYRKNAMKVYSRKQKCXSNFID 

ELIPKRDKIEDYHIAFGDAKFACTGRGEQYASPARIFAKKIKERVGGDKRFTFVDEKYTS 

KVCHRCNQPLNMLEKDCFSPNKKRKPPTIVTTTTTTTTEEDEENGKWKKATPLRENRDTR 

RCSSEKTQFGYSSNRKVSTGDISMETPVPSSTSSSFCTPTSITCVLGGKFVDRDFNASTN 

IVHKFLGFWDKKLMEKKDKMPLKYHFIRVA 

(SEQ ID NO: 119) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9ZAY5 (Q92AY5) SURFACE PROTEIN C 46 7e-04 

CYL1_HUMAN (P35663) CYLICIN I (MULTIPLE-BAND POLYPEPTIDE I) 41 0,022 

062231 (062231) F35E2 . 9 PROTEIN 39 0 086 

Q12080 (Q12080) P2610 39 q.h 

MYS2_DICDI {P08799) MYOSIN II HEAVY CHAIN, NON MUSCLE 39 0.15 

069188 (069188) C3-BINDING PROTEIN 38 0.33 

Comments : 

Hit to public SBV sequence: 

gi |60074l0|gb|AFi78573.i: CT nucleotides 12 to 297 match nucleotides 2549 to 2833 of the 
public sequence with a 99% homology, a score of 551 and an Evalue of Ie-159 
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CT1078 

Nucleotide 

Genomic coordinates: 

Start: 282175 

Stop: 282586 (SEQ ID NO: 120) 



Amino Acid 

MGNSESRSSGIEIVHKNGAPKRSHKTLYLSNRTERHAQIQKQIEELHHKTNKQFEQAQKV 
LDKNEERKKHQQQQQIITPLDPEERRAILAEIDKHMKEIDGFIEESERLGLLV DAEINNL 
EEKEVEEEHLLKQKED 
{SEQ ID NO: 121) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

ARP4_STRPY {P13050) IGA RECEPTOR PRECURSOR 45 2e-04 

AAF05247 (AAF05247) 0RF133 43 6e-04 

MYSG_CHICK (P10587) MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSC 43 8e-04 

AAD46501 (AAD46501) LATENT NUCLEAR ANTIGEN 43 0.001 

040947 (040947) ORF 73 43 0.001 

Q9ZGM5 (Q9ZGM5) M-LIKE PROTEIN (FRAGMENT) 42 0.001 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 5 to 344 of CT1078: this 
corresponds to nucleotides 282190 to 28252 9 of the genomic reference sequence.. 
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CT1079 

Nucleotide 

Genomic coordinates: 

Start: 286863 

Stop: 289635(SEQ ID NO: 122) 



Amino Acid 

MSSSSSSSFSFRISTYQTFLKALAHPDLVDKITQKCDETGRNQKCPIQFXADISHLIQGE 
RNGGNLFPLHPFKNQPHLEPRIVGSLHGRTLDNDIEESYCYFVKDLYNGVFSYVNGVKEL 
QGVLDKKISGSGSGESSSSRAPLIPITDVDLLYIFGTLVVLPPRSKAYRVITEAVLALPF 
NEFSNNWPPTNIKGAYVSRDFRMFNLLAGLDHIEGEVGGESEWESIHASWKRMVTIMRN 
KAEKKPPSTSRIFRVYVAEPVNDAVTKIPIRVLSKLFGSRLAGILQKVYSYSMLNLPYLL 
SSNSIDIKQGVKGTTLSIPSARKLGFYLLQKDTTLQSSLSQDVADCIVSINAGIIGDDFS 
EKIRQCIEEKNKPENCCMCFCEIDKTPDFSYSEHVARHNFFPVHAFSSSHDDKCCGAKIC 
SECIFPYIISLYEKMTGVAGVKVVDLFQCPGCKSGMLNLKGRCYEFSNLCKRMILPYTST 
HCSSLFDATINRAEACFYSLEFLQYDFETARRIAHGAKDIPHVYNKVVKNVKDLDRLCAL 
YCYKCVSPVVCDEPNE5TDYEMVDVTPPLINLTEIVDSEEYDDGPGNHMWPAKFTCNFIA 
GSSGETPTISTCRDAVTFLGRAPRKKMAGWDDQSAVGQAIIALANWRKSGELPKNMFDLL 
EGVNAVLYRGDSFLLRAINYPCVIGRSMSPSLELVKRKVNKIALIKAFFHEKRVRPDASK 
KLLEWAELLVKSYLMEVLLQTPECVIHRAHSFVGKTLLITDELVHMRPDDATRNAYIQNL 
NAARQNAAAAASFSGSLPKPEFVPCKERTIEWMYEKDNDDVRVVNCPSCKKAIQKYGGCV 
NVFCECGTNMCWICEEKVSPADSNHCVEKHRIVYSNCVRVKYALESMYGFEICTMKNVEE 
GVKNYYVMENGFFFDVQEMVAKK 
(SEQ ID NO: 123) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



AAF04637 fAAF04637) HYPOTHETICAL 84.4 KD PROTEIN 213 3e-54 

097226 (097226) PFC0175W PROTEIN 49 le-04 

Q94981 (Q94981) ARI PROTEIN 48 4e-04 

CAB45785 {CAB45785) HYPOTHETICAL 262.6 KD PROTEIN 46 0.001 

CAB36714 (CAB36714) HYPOTHETICAL 68.5 KD PROTEIN 45 0 . O02 

Q9XII0 CQ9XII0) F7H1.11 PROTEIN 45 0.003 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 910 of CT1079: this 
corresponds to nucleotides 288641 to 289550 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=114B 

5'stop=1167 

3'start=1192 

3'stop=l211 

5'primer=TTGACAAGACGCCCGATTTT (residues 1148 to 1167 of SEQ ID NO: 122) 
Tm5-59.14 

3'priiner=TGGACGGGGAAGAAATTGTG (residues 1192 to 1211 of SEQ ID NO: 122) 
Tm3=59.35 

probe 1=AGTGMCATGTGGCAAGG (residues 1174 to 1191 of SEQ ID NO: 122) 

probel3tart=117 4 

probe 1 Stop=l 191 

direct ionl=; Forward 

Tml=69.00 

scorel=l . 99 

iength=64 
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CT510 

Nucleotide 

Genomic coordinates : 

Start: 38917 

Stop: 37381 (SEQ ID NO: 124) 



Amino Acid 

MAETVAVDEVPTCPICMGDYDSDTDCYNWSNGGMPCCRKSVHLECLFTWRFEEHMVNENH 

LLCPMCRAYIPPVWFFRKVYEEVYKYASFHSFLLSADYVNDEGVKDTLNKMSTILAPTFF 

VPNAKGVNENE DVYMERAYTKLS FMLETLSRQEMHAFS E ET FEDNHEAALMGKFKDI PP Y 

EYEGEWLKYVAPNTIDITQCLSNDDDDDEGDNNVSPSLLSGVTSFNFIEDDEDTVVFVPP 

EVDDNDDSESLPDLTVPPRSNNITFDTISGISSSLYDVNDDDDDDDTMSLPDLNMPSAST 

SSAPTSSAPTSrSLNINVNLCFNVDSDSDDEEVIPSSSSVNQPSTSSGSSSSSSNSRKRP 

RYGRDEDRMSNISSESKRLCVDVKRYMCRLDNIDEEYNEIANRYLAELSALRERRQETEN 

KLGDCISRGNLFHTTVNDVIGKSLCSKKLKVKRKYASKWSANKQLIGSCLIKSASNNARL 

DDEIAHVHSSLLNGFDTDPSEADQISSLPNL 

(SEQ ID NO: 125) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



070567 (070567) DENTIN SIALOPHOSPHOPROTEIN PRECURSOR 56 7e-07 

P97399 (P97399) DENTIN SIALOPHOSPHOPROTEIN PRECURSOR 56 7e-07 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 53 4e-06 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 53 6e-06 

P87736 (P87736) RING-FINGER PROTEIN (FRAGMENT) 51 2e-05 

Q53653 (Q53653) CLUMPING FACTOR 51 2e-05 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 44 4 of CT510: this 
corresponds to nucleotides 37411 to 37854 of the genomic reference sequence. 
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CT511 

Nucleotide 

Genomic coordinates: 

Start: 40718 

Stop: 38972 (SEQ ID NO: 126) 
Amino Acid 

MGGPTVITTTINTGGDHHHQQYVYHQGNKKRPVEEYNNNNYASGSTSEATTVPAYNNNNN 
NITIKTWDDVINLSITPPPPKRFKKSEVAPSPPTTRTFSNVCASKVIRQCKRQYNEWIER 
DSPYYFKGIEKSCSLEDNYDTCQQLRIGHRSIVKSSKYVHDTCFYGKDPKVGFYWPTSSC 
DEEMRFFDTRHILKELSSRNIPSSQIMDIMYMAVEVFQLPSSACERIROKTSTLIKEVSD 
OCENWENFRKTARGCLSDI.VEVPEDVKDFNTFICPWETFFEIKYGVYYIVNRGTVVKFMK 
DMNYEEFVFECVNGLSVYRKNIKGVVGVTGVCPQGLCLEMPFAGISIDDVIRCVKDSLDG 
GEYYESRDARLLYGVVMLQRMGRLPEVKGVDTVAPITDSFIARKVVRSMFEKLKVNMPFV 
LAETCNVITRVANEGIINVDIKADNFVIDSISGQPKMIDLGLSYPLGYCYNDEYFRNTEE 
LIRQYIHTPPEFFRGHCLGAYSMTYSFSVMASSILEDVVACSNMEGPAFNLMSNMHFLML 
LQSGTDTDFYQNRPSITEYALAMKHIFPFKGTVMNLFKVKK 
(SEQ ID NO: 127) 

Top Blast Hits 





Score 


E 


Sequences producing significant alignments : 


(bits) 


Value 


Q15208 (Q15208) NOR PROTEIN KINASE 


41 


0.017 


Q56921 (Q56921) PROTEIN KINASE A 


41 


0.017 


085239 (085239) PROTEIN KINASE YOPO 


41 


0.017 


068717 (068717) PROTEIN KINASE A HOMOLOG 


41 


0.022 


CAB54949 (CAB54949) PUTATIVE TARGETED EFFECTOR PROTEIN KINA 


41 


0.022 


YPKAYERPS (Q05608) PROTEIN KINASE YPKA PRECURSOR (EC 2.7.1 


41 


0.022 



Comments : 



TaqMan Primer/Probe Sets: 
5 •start=878 
5 • stop=902 
3 •start=981 
3 •stop=1001 

5' primer =GGACTGTTGTCAAGTTTATGAAGGA (residues 878 to 902 of SEQ ID NO: 126) 
Tm5=57.4 7 

3 • primer ^TGAGGACACACACCAGTCACC (residues 981 to 1001 of SEQ ID NO: 126) 
Tm3=-57.98 

probel=TGAGTGTGTTAATGGCCT (residues 927 to 94 4 of SEQ ID NO: 126) 

probelstart-=927 

probelStop=^944 

direct ion l=Reverse 

Tml-68.98 

scorel=l . 98 

length=124 
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CT512 

Nucleotide 

Genomic coordxnates : 

Start: 61872 

Stop: 60672 (SEQ ID NO: 128) 



Amino Acid 

MLSTCDLKHPSSTDGNVLKNIHFSESIPANDIISFPSSDTEELNKDLLDSVRNQIKFGFD 

PITETLKNCITTQTLLHSFLKSSLLTLQEKFNEWGSIQLEKGGQEMALCASLKIMGQISA 

LIETAKEASMDNKKKNNNACANCRDSKCSASLVTLFNKTIDEKYVKQNSSSASALLANTF 

TAGANKPPKEFITKDNAHGNSDTNYTAMSDNLICPGKYYSSDITYEVTKQAKERIKNNNK 

KMRLATGVEMVMKELEAENNKEGGRVEVEVEGVEQQQPSTSGEEMQMEIMLPTPPPPDLE 

SLVTEGVDDYPVFSPLPSLLSPMPASPLPSNGNSALEDGGPFAPSADIVVDKTSEIMGRT 

PGSEWVHQRDRNSKMEIRNYGARGSGINTGRYRRNNTVL 

(SEQ ID NO: 129) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 
YMEIYEAST (Q03433) HYPOTHETICAL 32.0 KD PROTEIN IN CAT2-AM 
Q9WTU0 (Q9WTU0) PHD-FINGER PROTEIN 

015029 (015029) KIAA0312 (UPSTREAM REGULATORY ELEMENT BINDI 
MML3_MYCLE (O06081) PUTATIVE MEMBRANE PROTEIN MMPL3 
Q06166 (Q06166) MATURE PARASITE- INFECTED ERYTHROCYTE SURFAC 



38 


0. 


. 12 


36 


0, 


.36 


36 


0. 


.36 


36 


0. 


.36 


36 


0, 


.47 


36 


0. 


.61 



Comments : 



TaqMan Primer/Probe Sets: 

5*start=549 

5 • stop=570 

3 • start=626 

3 • stop=64 9 

5 'primer-TGCAAATAAACCACCCAAAGAG (residues 54 9 to 57 0 of SEQ ID NO: 12 8) 
Tm5-57.79 

3 'primer=TGCCTGGACAAATAAGGTTATCAC (residues 626 to 64 9 of SEQ ID NO: 12 8) 
Tm3=58. 13 

probe l^GCACATGGCAATTCTGAT (residues 58 9 to 606 of SEQ ID NO: 128) 

probelstart=589 

probe lStop=606 

di r ec t ion l=Re verse 

Tml=68. 99 

scorel = l - 99 

length=101 
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CT513 

Nucleotide 

Genomic coordinates: 
Start: 77506 

Stop: 76273 (SEQ ID NO: 130) 



Amino Acid 

MEEHLSFNKPSPENGVVFFDFSDNTSMSNMVDNIRHRLPMDKKFSSKALLLASTPIPSDE 
QLSTKVNKAIFSHRETIVLSKALKIVVTGLYVDGEYVDDVICLYPEKHTLNGILRYVVHL 
NMMLMDKAEDADEIRCGLIPLGRGFNREAFKFVDPVIPCAGYNILNGYHPDNGHQISPSS 
TQPQVQRRCAVKQMYKQINGMFEVVKQFSIKHNNRIFTINQVDFKGEEMKMFFALYSEEL 
LPFYSETGKLLSEKHVSKSFSQLPPHVTISVFYLRNMEEYNTLMKTDFGSCFAPAIKIDT 
GDNFELFGMNNNILVSKVCVGDDALDLRRRIMEHISDAIGRNVELADNRLNPHITHGKIN 
EGVVGEWVSRFAPCNFLCKPREEIVFGGTKFIFGRVSNGNYVIKOPVDYV 
(SEQ ID NO: 131} 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

032743 (032743) RECOMBINASE 37 0.23 

SYA_RHILV (P24075) ALANYL-TRNA SYNTHETASE (EC 6.1.1.7) (ALA 34 1.5 

O03942 (003942) LACTOBACILLUS BACTERIOPHAGE PHIGIE COMPLETE 34 2.6 

Q9X257 (09X257) CONSERVED HYPOTHETICAL PROTEIN 32 5.8 

022993 (022993) CELL DIVISION PROTEIN ISOLOG 32 9.9 

YKG5 MYCTU (Q10851) HYPOTHETICAL 30.9 KD PROTEIN RV2003C 32 9.9 



Comments : 

EST confirmation of the predicted transcript; 

An isolated EST has equence identity to nucleotides 1 to 982 of CT513: this 
corresponds to nucleotides 76314 to 77295 of the genomic reference sequence, 

TaqMan Primer/ Probe Sets: 

5'start=502 

5 ' stop=522 

3 • start=559 

3'stop-578 

5*primer=TACCACCCAGATAATGGCCAC (residues 502 to 522 of SEQ ID NO: 130) 
Tm5=58.25 

3 'primer ^TGCTTGACTGCGCATCTTCT (residues 559 to 578 of SEQ ID NO: 130) 
Tm3=58.35 

probe I'-ATCTACTCAACCACAGGT (residues 537 to 554 of SEQ ID NO: 130) 

probelstart=537 

probelStop=554 

directionl=Re verse 

Tml-69.03 

scorel^l . 96 

length=77 
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CT514 

Nucleotide 

Genomic coordinates : 

Start: 102885 

Stop: 100046(SEQ ID NO: 132) 



Amino Acid 

MDSTSTTTIEAEKALLKEYVNENLTWEFVDRVIRHEKLMQRTDMRLKTSSRRLFSFISIY 

SFLQDFFTARDGVNSDEWCTQSALYHMLDGVASIISCFRKRIDYYNKKMERLACTSIREG 

YFLVDVKTIESRHVELLDPDKKIWQRLYAEKIAPEKVVDAYNEVSKLLPDEAMANYNYRT 

GLVHLSDTLKNAKKPPTDLTMTDFDFYEKYIRSDIVLGKSNKLSGMFSENFEILPDINIK 

VPRRLERYFNVETNYSLEHNFRFPSNHXRGLIFAYFIGNIFGGAFSCVQLYLLGFTLSTVA 

SACRENVLDTPFSKLKQYIKNDNKTKNSSSNEDNDGEEYYPCELQYARINSNDKNACRKS 

IVKAVKFVADRVEKASVTMMRTPIAEHESDGYMADWLSLQISKLLGRKVSASYALLFIVN 

WVAHKYKQSFTNDVNGSEKYEILLKKLTVACGLTYNHKCGMVVPVIGFGSGMTNRKLRQY 

AVHCIENVIGSFISSGKRKKDIHEDPKKLEEMSLMQLSARLFKNNDVMKRGQDGKVTFAN 

EDNVQDFLEELKTKEFVPNERRRKIHEEEYTKSLHTNLKMTFRFGVCGFQHPLPASSDKP 

TQVSLQLLKQRQTFVQRETAAAVNWTRLLQFLFPSDERDNKRHQNSLPWNRLGSNLNRHF 

ISLASKFIKRSVHCERVVNDIISKFNADILPLGKDPDHFUyiTKAGLVIEDHARENIDNAM 

YSLCGGFNNQTTEQKLNSIRLRISAEALKNARNCVLATTFSKSYNEDRPFLPRTDEAKFV 

PIPLFGVEPLHPliLNSFIDNTANKCNDSVSDFWLEESDDIFKEALVSHTILTDSSVYSTL 

VGEDEDYCDNNKSGKRIGNTLVCTLYDMMGRANYNGLHSDKPRKHDPTPWSSKNTGQSGR 

STTDFSPNSVIVLLDTENVADDYEDEEEDYEALKQSERDNVITLNNX 

(SEQ ID NO: 133) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q12537 (Q12537) GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GLUCAN 39 0*11 

AMYG_ASPAK (P23176) GLUCOAMYLASE I PRECURSOR (EC 3,2.1,3) ( 38 0,32 

AMYG_ASPSH {P22832) GLUCOAMYLASE PRECURSOR (EC 3.2.1.3) (GL 38 0.42 

MSH6_YEAST (Q03834) MUTS PROTEIN HOMOLOG 6 36 1.2 

YMM1_CAEEL (P34489) HYPOTHETICAL 81.8 KD PROTEIN K01B6.1 IN 36 1.6 

AMYG_ASPNG (P04064) GLUCOAMYLASE Gl AND G2 PRECURSOR (EC 3. 35 2 . 1 



Comments : 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 3 to 841 of CT514: this 
corresponds to nucleotides 99939 to 100777 of the genomic reference sequence . 

TaqMan Proe Sets: 

5'start=1126 

5*stop=1147 

3'start=1173 

3'stop-1194 

5'primer=TCCGTCACAATGATGAGGACAC (residues 1126 to 1147 of SEQ ID NO:132) 
Tm5=5 9.41 

3'primer-GGAGAGCCAGTCTGCCATGTAG (residues 1173 to 1194 of SEQ ID NO:132) 
Tm3=5 9.8 6 

probel=CGAACACGAATCAGATGG (residues 1155 to 1172 of SEQ ID NO: 132) 

probe lstart=l 155 

probelStop=1172 

directionl=Forward 

Tml=69.00 

scorel==l . 99 

length=69 
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CT515 

Nucleotide 

Genomic coordinates: 

Start: 114953 

Stop: 110132 (SEQ ID NO: 134) 
Amino Acid 

MSRNSLRVKGLKENGGIIPNPFDPLYVDTDAPFGMAGVKSDIIGKGFVESLLPGEISSHY 

NTFDCFKTPKKCRVGGNDFECISCRSLGGGTCVKSSRELKTEYGIEDDDEYDGVCVPLAD 

TIFSASSAFDKHDDDVATDAAYRNVNPFTTVEEAYLHYESGGVITGGGKKGTTYITKKRG 

CVDSSVVRKDPSLLNKDPRLEPILGCTDIVLCGGKGVGRPIHPTTFSIIDDVDDIDFDIS 

SMTSTMDCLCEPGYSQQRDPATNAPKCEKKEGGIQEKEQGLGCPVMFRYGVVGDTGTKGC 

LCDESTQIRLEEVAGIDLPDAAKTDYAQPFVEGAKLLLQITERYETLGGSTKDACLPRPG 

NDTRMSALGYSYAASLFGRAPEITAFNGGHLITGGLLRESAMDAAGNWHSRIEDSDEQGK 

LTVSESVGGVVPYSGTGSVAAHIWNGDALNDNGLVGAGGGNFTEHPNASLRVVPLPHSNI 

PGLGIDSIDHAVGIIASQGKIFPETVHMRAGDPSGVKTDRRDAHNDTTIETSFLKDSDKA 

GYDSYKDNPLQKLRKSHDSGICATAYVVPSLHRVIKEKPSAKNDKTVNKILPLVHYRPTA 

KRMAHTPIETIFKHSLLTAQERDQSFANSTLNSMMVTNSSNSFDDVTNLLLDYFFPNLNG 

EGKERSGLPINTRSIYNEPNNAKFKEIGGIILQPVTAQGAKKSSTFARFSEKILSTNSPK 

IIDHYKAGSSAVFKKVGEKEAYEMFAHPPTAWRIASNEGTFFSGRGLNNGIEGTGMREAE 

RVAKTLSKKPDIFAGAILTGDGVLMNGASSPLVRPMEIPASSLPEHTWFERRSPVNARGD 

PGSADNLTAINNTYDRVTKGDIRAILNSTTDIKTSFNSYAPARPFSKPLAPPAGVSAAAQ 

ATSFLGVLGGFPLPIPSSFLIQKSVQESVSNGTVGSMHGIVPLKFHEGDELWQQCEVKET 

EGALNFIPPPMALFESLLRVRTLSSETFIRPELIPNRFRADWGLSPHTAGHYLNGVYSPP 

CVREETGQSFGYPCSGALSQYTTMMVPKPLGPQSHSSLSKFSIKSYVEEQTRLLPANIGE 

KSIFEMQDPNSKNIFDKIGELGEKENCNCTKGLFCPKVNGGGRNKTDPIAATPSRGNRHS 

RFPLMTTLPKNDVHLSAALLRAQSGDARILNTIGETKTNGRKIWLKAATENIWDISSNVM 

LAPNKFCTVMRRSTAYTPYSTRQEKVPAAVLDERKGTFDRNAELLGDVGMTDIVSNDILME 

DYERLPGVPPAEAEIFHIIRDAAKTGQEGAECARRIVDFFESSHGVTASTFNVGTFSPYVE 

GVKDIVSLYATPCFTDIDSPTISADSATINEGASIEPTDGSEVVVEVVNSNMEMLGGSTA 

GSTKKRRLSISDYVDLEEDAEVFTINKQGKATENLRVRTSSSSKYVEGGQKDMVGFYEAS 

KRVPRVMRRVHVLPVLTPYHGGFESCAPTAAQSACTRGVEITYADFMRPSDLSGTKTTLE 

GVRVKGPEPFDDLSTLYFRSVGGPNLRKFAHHHHFGYEGLMSRYYYTREKTVSVSEGDLK 

DRFPFVCQSDRGPFPPKRDGTIQPLALVDMGVLPEGALTRRTISME 

(SEQ ID NO: 135) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q24 341 (Q24 341) LARGE FORKED PROTEIN 37 0-96 

Q24340 (Q24340) SMALL FORKED PROTEIN 37 0.96 

013788 (013788) HYPOTHETICAL 59.6 KD PROTEIN C17G6.10 IN CH 37 0.96 

CHOD_STRSQ (P12676) CHOLESTEROL OXIDASE PRECURSOR (EC 1.1.3 37 1 . 3 

HAGA_PORGI (Q51845) HEMAGGLUTININ A PRECURSOR 36 1-6 

P94986 (P94986) HYPOTHETICAL 88.5 KD PROTEIN 35 3.7 



Comments : 
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CT516 

Nucleotide 

Genomic coordinates: 

Start: 118885 

Stop: 115402 (SEQ ID NO: 136) 



Amino Acid 

MKIVQNNFTPDERSGVIHIRKPAKIEKAVFGNIAAAIDDSAAVRKDPKKKRNLKNGLEPA 
SKKLAKNIERISSEEiKRVTDVQDPKLLHSIMKRTARQIGYDIGDDISPQSAPDRDGSSS 
SSLLPIRMINIRTEELLEKGGKDTIVRIHILDGILPDNVPLPFKAEIKVDLVDEKYEGED 
GGGSSDSGPSLFETFPEFVPAGWPPITNDPNAFSRNNGNKQQAVFKHVEVNSLADGITLS 
TKGSIFNTGNRLKISIVTEDKNKTVLFDSQVTISSPIPKITEVFACRNVALMRLDMPKAI 
NYDNVEYTPDTLNEKYVSDYPANFPRLSRQAEIASNLAAKLPRENQLSDINKPSVSFVYS 
KTNTVNTPVLNKVLYNETLKNMEGNESEGYKILNATEITHLRNPSNPARTFICVSVPESE 
lEAQWKMLGWIVGFKTSSDVLTTSSGYNIVFPASKVTQSDKLFSVISTDVNANTNKVVVH 
NTPSRVGCFGSSVNFRVDAATAPDWPGPTNGPDFFSYQLRPCIILKTDNDNREPRITAVL 
SSPATEYAGERTTSLLPRALNVSVGPLTEVRGGDIITPVQTALLGGEOPTFKAPAEPTKL 
YAVFPVLDSHNGLVKASDNPFQPIHSITSRNKTTVLTVSDVIVNDDDDDVVLEDKSYHIT 
VSDPVSGSILAKENVLSSRITSRPIFIDGARDDRVFSVKMEVFGGDDKGIQMPFTMDGHF 
EGQFSDMSVPSNELAIWNDPSTFTAPVRDTPATDITNKGIVYCRTTLPPISNRGIRDPFM 
KQTSLVPLPTSIPEWAFADYGGEIKYPRHIFISSIRTNDTTNIVNTDTQTEFSIENWLRE 
QIDKEQERHRQLLPAPSEAYTQGEKVYAKMYMGDGVSEETLDQIVHTSNTTYVVDESGTK 
KENLLVNKEDKKLAAILGKWGIVVFGANKYPDEPADRYTNWRNTGRLRAVGSYSQLRQPV 
APLQTRLATWPSGDPVTRLADGQFLVRLDPRCGGIGSANGFYNNNGANNEFTSSLLFAIV 
GNQDKVVSYAERVRFYMKIVARNEGKKHLKNDDGLVLVDRNSALHRRLWNRTTFDHDDIV 
LCVKIPQNVMSKIEPGTSSGVLVDPLVFANVASSTDREEFYKKFIDTSSGPVVIDRASVT 
SSYNISVPLNFYTTCGFIVG 
(SEQ ID NO: 137) 

Top Blast Hits 

Score E 

Sequences producing Significant alignments: (bits) Value 

DNA2_YEAST <P38859) DNA REPLICATION HELICASE DNA2 37 0.68 

094534 (094534) PUTATIVE SPINDLE POLE BODY-ASSOCIATING PROT 37 0.89 

BAA84527 {BAA84527) ALP14 37 0.89 

YGS4_YEAST (P4 6947) HYPOTHETICAL 30.5 KD PROTEIN IN SAE2-KE 36 1.5 

028907 (G28907) GTP-BINDING PROTEIN 36 2.0 

Q49547 (Q49547) LMP3 PROTEIN 35 2.6 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 888 of CT51G: this 
corresponds to nucleotides 115494 to 116381 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5»start=1869 

5'stop=1892 

3 • start=1984 

3'stop=2001 

5'primer=CATTCACAGTATCACGTCAAGGAA (residues 1869 to 1892 of SEQ ID NO: 136) 
Tm5=57 . 80 

3 •primer=ACCAGACACGGGATCGGA (residues 1984 to 2001 of SEQ ID NO:136) 
Tm3=58.39 

probe 1-AGCTACCACATCACAGTC (residues 1966 to 1983 of SEQ ID NO: 136) 

probe Is tart =1966 

probe lStop=l 983 

directionl== Forward 

Tml=69 . 00 

scorel=l , 99 

length=133 
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CT517 

Nucleotide 

Genomic coordinates : 

Start: 180036 

Stop: 179421 (SEQ ID NO: 138) 



Amino Acid 

MEFGNLTNLDVAIIAILSIAIIALIVIMVIMIVFNTRVGRSVVANYDQMMRVPIQRRAKV 
MSIRGERSYNTPLGKVAMKNGLSDKDMKDVSADLVISTVTAPRTDPAGTGAENSNMTLKI 
LNNTGVDLLINDITVRPTVIAGNIKGNTMSNTYFSSKDIKSSSSKITLIDVCSKFEDGAA 
FEATMN IG FT SKNV I DI KDE I KKK 
(SEQ ID NO: 139) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SCH9_YEAST (P11792) CAMP-DEPENDENT PROTEIN KINASE SCH9 (EC 35 0.45 

039292 (039292) COUNTERPART OF HSV-1 GENE ULIO AND VZV GENE 33 1.7 

HS78_YEAST (P32589) HEAT SHOCK PROTEIN HOMOLOG SSEl 32 2.3 

G435756 (G435756) LYSOSOMAL MEMBRANE GLYCOPROTEIN LAMP-2 HO 32 3 . 0 

LMP2_HUMAN (P13473) LYSOSOME-ASSOCIATED MEMBRANE GLYCOPROTE 31 5.1 

AX01_CHICK (P28685) AXONIN-1 PRECURSOR 31 6.7 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 711 to 1 of CT517 : this 
corresponds to nucleotides 179367 to 180077 of the genomic reference sequence. 

Hit to public sequence: 

gi |6856ieoigbi AF173992 .1 : CT nucleotides 1 to 615 match nucleotides 50 to 664 of the 
public sequence with a 100% homology, a score of 1219 and an Evalue of o.o 

TaqMan Primer/Probe Sets: 

5'start=227 

5 • stop=247 

3'start=294 

3'stop=311 

5*primer=TGGCCATGAAGAATGGTCTCT (residues 227 to 247 of SEQ ID NO: 138) 
Tm5=58. 19 

3 •primer=GTCCTTGGGGCTGTGACG (residues 2 94 to 311 of SEQ ID NO: 138) 
Tm3=58.60 

probel=TGCTGATCTTGTCATCTC (residues 273 to 290 of SEQ ID NO:138) 

probe Is tart =2 7 3 

probelStop=290 

direction l=Forward 

Tml=69. 00 

scorel==l . 99 

length=85 
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CT518 

Nucleotide 

Genomic coordinates: 

Start: 190743 

Stop: 188172 (SEQ ID NO: 140) 



Amino Acid 

MTESKDYVLALVAETKTDEKRLNYVSEGLVAAISNLQNTPEKQRKVVISSDVFGPTWFNK 

TTEFFNSGLRLAKGHLSKDAVMRSVYRDIEGVREHIIDPSWRLTETAAEELCDFTFLKQA 

PLLNLLNAFENIMDGVFRSAANLVLYSTRGDTNEPSWVIDSEMLANRNNSTVADLAMGRA 

KRAIALFLGYTLCDILRWKQSIASRMKERGLDPFAAMPPHLEYGRAADMIEKRIKDFIEG 

SFSDGVTVSEEDGQSYVVPTISTVLTNMVSVIQEGFYPPKVGSFHEALLGREIMVLLSAA 

IDAEYRAVLSRTRNAKPNPLTTKLDKYVNNPHLQMPSESVTEREKEWVERERERIKTTDM 

TAENLFRDHPYLPKAIDGILGPKRTPTALQALQREYKRCNKFKDIVSPETLEYFLVNNRQ 

VMFSNYSVTRVLDPDSAARFSMYVLWNALFLCSGGLTQKTNSSAVKSRLILOVFLKDMHS 

LFVCQRCESGFITKSLDTFTISLKEQSKPSMGEQELETYWKAVLDALGGGGGWNKGAENV 

NGLGELMVEILSADSGLLRGGGLGGDIGFEGKMKQKREDEEVRNMHLVDKKGYVFEAAKY 

VHVSKGFAALSFYLLYAAAATSNPSITNNFDRAVYLLLARWGDLKFPTHNLWGNVPTDEN 

TSSLLSFASFWALRNAVRARRNVIDASNTSFVPGRPLPLLSAFSSKMLVDNMLKNNYVKV 

ENVNREKLIWKAFREMQTESEIWKTSGSKAASDRNVKKAKQDLIRNASIGRLIVEPVGKT 

PISSIALFRSMKRSRSEDLKMGSNNKYRLARDTKTATPRNPLSYTGKIVFSLDDLKNFSK 

DSYTTMKVFPLTPLDG 

(SEQ ID NO: 141) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAA22155 {CAA22155) HYPOTHETICAL 46.4 KD PROTEIN 37 0.49 

095517 (095517) DJ1170K4.1 (NOVEL PROTEIN SIMILAR TO KIAAOl 35 1.9 

Q43688 (Q43688) GLYCIN-RICH PROTEIN (FRAGMENT) 34 4.3 

YHO0_YEAST (P38800) HYPOTHETICAL 149.7 KD PROTEIN IN IREl-K 34 5.6 

P97572 (P97572} CALPAIN SMALL SUBUNIT (EC 3.4.22.17) (FRAGM 34 5.6 

048591 (048591) GTL2 GENE 33 7,3 

Conuments : 

EST confirmation of the predicted transcript: 

Nucleotides 515 to 1 of CT518: this corresponds to nucleotides 188146 to 188660 
of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=1308 

5'stop=1327 

3 • start=1383 

3*stop=1404 

5 ^ pr j.mer=AGCTGCCCGATTTTCAATGT (residues 1308 to 1327 of SEQ ID NO:140) 
Tm5=58.2 6 

3'primer=CCTGCTTTTCACTGCAGAGCTA (residues 1383 to 1404 of SEQ ID NO: 140) 
Tm3^58.54 

probel=TGTGCTATGGAATGCATT (residues 1329 to 1346 of SEQ ID NO: 140) 

probelstart=132 9 

probelStop=134 6 

direct ion 1= Forward 

Tml=68.99 

scorel=l . 88 

length=97 
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CT1003 

Nucleotide 

Genomic coordinates: 

Start: 23709 

Stop: 24300 (SEQ ID NO: 142} 



Amino Acid 

MDVSSYKSTIDYHNIEDMDDLQRATYKDRMETELVLEMAKKEGRYVRSLATMDELEVPEE 
PATCYTCGYTFIRRRAPPPKRKSIFREPCAYPELLPDAPSPVRLEELVDVPEGASFFTYP 
PYDDGSSTSSSQAECEDDYPPPYDPSENPQRSQVCDYCTTRQVLSSMTDHARANLIKNLK 
REKKALGLGRRNNFSY 
[SEQ ID NO: 143) 



Top Blast Hits 

Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



VP40_EBV (P032341 CAPSID PROTEIN P40 (VIRION STRUCTURAL PRO 
Q9Y4G2 (Q9Y4G2) KIAA0356 PROTEIN 

088508 (088508) DNA CYTOSINE-5 METHYLTRANSFERASE 3A 
Q9Y6K1 (Q9Y6K1) DNA CYTOSINE METHYLTRANSFERASE 3 ALPHA 
Q23804 (Q23804) SPID PRECURSOR (FRAGMENT) 
088799 (088799) ZONADHESIN 



35 
34 
33 
33 
33 
32 



0.46 
1 .0 
1,8 
1 .8 
1 . 8 
3 . 9 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 736 of CT1003: this 
corresponds to nucleotides 23688 to 24423 of the genomic reference sequence. 



FIGURE 3, Sheet 77 of 160 



wo 01/38351 



119/201 



PCT/USOO/28888 



CT1004 

Nucleotide 

Genomic coordinates : 

Start: 26630 

Stop: 27257 (SEQ ID NO: 144) 



Amino Acid 

MAPNSFQKFAPVIKTEKKEEERDEHDDPLRQIDFRDRKTLICLTANCVSRKRKAGSAHDR 

VYKViRYGNPYKYRRPNRTHRGLALSMDQGEVGTCLPLRPMEETEENPIDKCGVAFLYSN 

YNEGDGMTHLYNDEEYIKKCKTIEGGTRTWVKKNRQEYFRQALETLMMSHSIKQYSNFIF 

FKEDMEEGFVHKLHTFINMVHPKKVSVL 

(SEQ ID NO: 145) 

Top Blast Hits 

Sequences producing significant alignments: 
Q9XX10 (Q9XX10) Y39A1A.22 PROTEIN 

P70392 (P70392) RAS PROTEIN-SPECIFIC GUANINE NUCLEOTIDE-REL 
075381 (075381) PEROXISOMAL MEMBRANE ANCHOR PROTEIN HSPEX14 
Q40112 (Q40112) HYPOTHETICAL 28.4 KD PROTEIN 

ECEl_CAVPO (P97739) ENDOTHELIN-CONVERTING ENZYME 1 (EC 3.4. 
O706S1 (070651) GAG POLYPROTEIN 



Score 
(bits) 

34 
32 
32 
32 
31 
31 



E 

Value 

1 
5 
4 
4 
7 
7 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 654 of CT1004: this 
corresponds to nucleotides 26601 to 27254 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=248 

5'stop=266 

3"start=291 

3'stop=309 

5*primer=TGGCCCTCTCAATGGATCA (residues 248 to 266 of SEQ ID NO:144) 
Tm5=58.53 

3' primer =CTCTTCCATGGGTCGCAGA (residues 291 to 309 of SEQ ID NO: 144) 
Tm3-58.32 

probel=AAGTAGGAACATGCCTCC (residues 272 to 289 of SEQ ID NO: 144) 

probelstart=272 

probelStop=^289 

direct ion l=Forward 

Tml=68.97 

scorel=1.97 

length=62 
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CT1005 

Nucleotide 

Genomic coordinates: 

Start: 31091 

Stop: 31961 (SEQ ID NO: 146) 
Amino Acid 

MEGEHQYLNLVREILERGVKKDDRTGTGTLSIFGPQMRFSLRDDTIPVLTTKKIFWRGVV 
EELLWFIRGNTDAKELAKKKIHIWNANGSREFLDSRGLYDRAEGDLGPVYGFQWRHFGAE 
YDTCSSDYTGKGIDQLANILKTLRENPDDRRMIMTAWNPMDLHLMALPPCHMTAQFYVAN 
GELSCQLYQRSGDVGLGVPFNIASYSLLTHLMASMVGLKPGEFILTLGDAHIYNTHIEVL 
KKQLCRVPRPFPKLRILMAPEKIEDFTIDMFYLEGYQPHSGNLQMKMAV 
(SEQ ID NO: 147) 

Top Blast Hits 











Score 


E 


Sequences producing significant alignments: 








(bits) 


Value 


TYSY_HUMAN tP04818) THYMIDYLATE SYNTHASE (EC 


2 


.1 . 


.1.45) (TS) 


410 


e-114 


TYSY_MOUSE (P07607) THYMIDYLATE SYNTHASE (EC 


2 


-1 . 


.1.45) (TS) 


407 


e-113 


TYSY_RAT (P45352) THYMIDYLATE SYNTHASE (EC 2. 


, 1 


.1. 


.45) (TS) 


406 


e-112 


Q8994 0 (Q8 994 0) THYMIDYLATE SYNTHASE 








390 


G-108 


P90463 (P90463) ORF 70 








387 


e-107 


DRTS TRYBB {Q27783) BIFUNCTIONAL DIHYDROFOLATE 


REDUCTASE-TH 


381 


e-105 



Comments : 



TaqMan Primer /Probe Sets: 

5 •start=424 

5 ' stop^446 

3 ■ start=479 

3'stop=:501 

5*primer=ACCCTGAGAGAAAATCCAGATGA (residues 424 to 446 of SEQ ID NO: 146) 
Tm5=57.82 

3 •primer=AAGAGCCATAAGGTGAAGATCCA (residues 479 to 501 of SEQ ID NO: 146) 
Tm3=57 . 91 

probe 1=ATGACGGCATGGAATCCT (residues 460 to 477 of SEQ ID NO: 146) 

probelstart==4 60 

probelStop=^4 77 

direct ion l=Reverse 

Tml-68 . 98 

scorel = l - 98 

length=78 
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CT1006 

Nucleotide 

Genomic coordinates: 

Start: 32124 

Stop: 32802 (SEQ ID NO: 148) 
Amino Acid 

MAFNFEDSTNLFANMDLTAGTTTDPTRPNIIFFESLLPNSGIEVMKRRLVRQGKCGNFEA 
SGGAMSYFWLEDNAEDMENLNSGSHVKTNCLALFLQEFISNWIEETDRHGQYCTFPQYMD 
GGDGSRGGYFTSLAMKWMARDVTFFVFVDRNNTVENAASIWMYQKLLAIGAKVVKVIVDN 
ASNPMFSVCNACRCKYPGPVSYVIEGHGVGHSDLTCDEISGFFV 
(SEQ ID NO: 149) 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 624 of CT1006: this 
corresponds to nucleotides 32182 to 32805 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=268 

5'stop=28 9 

3 ' start=340 

3*stop=361 

5*primer=TGCTTGGCATTATTCCTTCAAG (residues 268 to 289 of SEQ ID NO: 148) 
Tm5=58,01 

3'primer=CGTCCATGTATTGGGG7iAAAGT (residues 340 to 361 of SEQ ID NO: 148) 
Tm3=58.72 

probel=CGACATGGACAGTACTGT {residues 322 to 339 of SEQ ID NO: 148) 

probe Is tar t==322 

probelStop--339 

directionl=Forward 

Tml=69.04 

scorel=l . 95 

length=94 
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CT1007 

Nucleotide 

Genomic coordinates : 

Start: 32947 

Stop: 34216 (SEQ ID NO: 150) 



Amino Acid 

MDSNTSILPPSKRPGLNLLQVLGIIITVALIASVSSFIFYRVGKRKYYPSSSSSSELSDV 
DNGVEGGGGTTTTPTQPSPDGGDGYVDLSPQKKAELRTRVANVIFQEVSKDQGVAFRRAM 
NDSTDKIMEETEARINNFSEPFREATVEREVFKDDTDKNFILSTLDLTEEQFKDIVMAEV 
KNQLENFDYEDMTRLIFDNIPETDYLWTTHFDPKKYDTYSEKVLGFSDINSIERISSTFY 
KGKKYEVTTGNVAVLVDFESETIKEKAGNSLIRNVEFIVVDEQTYKSFFPAFNQVFFSFK 
VNKEKREVTVSINNGCVGIVANITPLTTPVGAASGHYIYGTSTAKEKTYLFVIDKYDTTE 
FVCGLSNKSTPLMALNILFMSDTVFPSFDEAERPLTDAKAVEILGKRLGVGRYTNANIRN 
TQ 

(SEQ ID NO: 151) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q56711 (Q56711) HOOK-ASSOCIATED PROTEIN TYPE 3 39 0.045 

077363 (077363) MAL3P4.5B PROTEIN 39 0.059 

YLM5_CAEEL (P34379) HYPOTHETICAL 49.8 KD PROTEIN D2007.5 IN 37 0.23 

GBA2_CAEEL (P22454) GUANINE NUCLEOTIDE-BINDING PROTEIN ALPH 36 0.51 

Q9YW03 (Q9YW03) ORF MSV089 PUTATIVE NTPASE, RABBIT FIBROMA 36 0.51 

P87199 (P87199) KINESIN MOTOR PROTEIN 36 0.51 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 907 of CT1007: this 
corresponds to nucleotides 33238 to 34144 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=648 

5'stop=-674 

3'start=773 

3'stop==7 99 

5*primer=TGACACGTACTCTGAAAAGGTATTAGG (residues 648 to 674 of SEQ ID NO: 150) 
Tm5=58.05 

3' primer =^CCTTCTCTTTTATTGTTTCAGATTCAA (residues 7 73 to 799 of SEQ ID NO: 150) 
Tm3=57.41 

probe 1-ATGTAGCTGTCCTCGTTG (residues 752 to 769 of SEQ ID NO: 150) 

probe lstart=752 

probelStop=7 69 

directionl- Forward 

Tml=68.89 

scorel=l .89 

length=152 
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CT1008 
Nucleotide 

Genomic coordinates: 
Start: 34217 

Stop: 35048 (SEQ ID NO: 152) 



Amino Acid 

MEGVILDKIETIAKRASPSYGSIDVGTAILRRQFMEKIRGKINEETTMEKIMGTKEERED 
TIRSIVANVIKENTVKENVTEKIRAMTDKELNDNRE FMHDFGKISTGDGGTFHLFEDTPG 
FESALKAEYKNVPGATTPKYVSMNSLRIDAINGKIEEVYNPSPIMGIREYGTIRRGRYEE 
NAGSKELVFMTKIEKRPNNVAENLIIRVANQQYNVMRMVFFIDYETKKGVSKEEMFIPYN 
VQKTKALKGRSTYFSFVRKIPDEPEGSIIIHALGFY 
(SEQ ID NO: 153) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

BAG_STRAG {P27951) IGA FC RECEPTOR PRECURSOR (BETA ANTIGEN) 41 0.007 

Q99051 (Q99051) IMMUNOGLOBULIN ALPHA FC RECEPTOR PRECURSOR 41 0.007 

YIBA_ECOLI (P24172) HYPOTHETICAL 31.8 KD PROTEIN IN RHSA-MT 38 0.079 

Q46749 (Q46749) ORF-Al 38 0.079 

Q25920 (Q25920) MATURE-PARASITE- INFECTED ERYTHROCYTE SURFAC 35 0.53 

Q06166 (Q06166) MATURE PARASITE- INFECTED ERYTHROCYTE SURFAC 35 0.53 



Coitiments : 



TaqMan Primer/Probe Sets: 

5'start=361 

5 • stop=384 

3 • start=429 

3*stop=451 

5'priiner=TTTGAAAGTGCTTTAAAGGCAGAA (residues 361 to 384 of SEQ ID NO: 152) 
Tm5=58.35 

3 •primer=TCGCATCGATACGTAAACTGTTC (residues 429 to 451 of SEQ ID NO: 152) 
Tm3-57 . 96 

probe 1=CCAGGAGCAACTACTCCA (residues 397 to 414 of SEQ ID NO: 152) 

probe Is tar t=3 97 

probelStop=4 14 

direct ionl=Reverse 

Tml=69. 05 

scorel=l . 94 

length=91 
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CT1080 

Nucleotide 

Genomic coordinates : 

Start: 291719 

Stop: 292205(SEQ ID NO: 154) 



Ammo Acid 

MTSPAPSPSSTPKSSCTTIVNRCGFLLDNNKEVVIYDTNSKFKCEPKNLELIGVLSGVSD 
NVVTQISPDQIFVGTYMVKYNWSKSGHERFSDMSNNCLDNITRPSEVIESVIKKTSSDFK 
MKYTRSLMDHTEKYYFSGDQKLSKISSWCTTPIRQWVCNSV 
{SEQ ID NO: 155) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB4S519 (CAB45519) GIBBERELLIN 2 0-OXIDASE-ARABI DOPS IS THAL 32 3.0 

Q52743 {Q52743) CELB 32 3.0 

Q39110 (Q39110) GIBBERELLIN 20-OXIDASE 32 3.0 

060963 (O60963) L549.2 31 3.9 

FLPA_ARCFU (028192) FIBRILLARIN-LIKE PRE-RRNA PROCESSING PR 31 5 . 1 

DPOEYEAST (P21951) DNA POLYMERASE EPSILON, CATALYTIC SUBUN 31 6.7 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=201 

5'stop=220 

3 • start=290 

3'stop-312 

5'primer=CCCCGACCAGATATTTGTGG (residues 2 01 to 22 0 of SEQ ID NO: 154) 
Tm5-58 . 70 

3'primer=AGGGCGTGTAATATTGTCCAGAC (residues 290 to 312 of SEQ ID NO: 154) 
Tm3=58.09 

probe 1=CGCTTCAGTGACATGAGT (residues 265 to 282 of SEQ ID NO: 154) 

probelstar t=265 

probelStop=282 

direct ion 1 ^Reverse 

Tml=68 . 90 

scorel=l . 90 

length=112 
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CT1009 
Nucleotide 

Genomic coordinates: 
Start: 35073 

Stop: 35967 (SEQ ID NO: 156) 



Amino Acid 

MALQEKDITIGNVSAALRELMYSPTHMQHHDKLNTFLDRNVESSSEEKIRQIVDKIRSQT 
TSDISETVNNVTTNGTAFSLFEDTLEGMVKKNIGDNLQSGDFIDGRKKLNDMKSLATGAI 
LSRQRDFVAESITGTKDWLKAIMGCGIIRYTVFVNNLARSTLDWDDDKAATYYNTPIYGG 
YCKMAIKDYEIPDSYSKVEAEHTVEGRKMTFNIKWRGDTINNLITIIPSVTGYLASISED 
ADVQAPLLLNCNNCFIEADMSSLYMDEKKTEASFTLNLPEIEGADANAVYEICIVVV 
(SEQ ID NO: 157} 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



YB75_YEAST (P38321) HYPOTHETICAL 101.2 KD PROTEIN IN FAT2-P 

Q08281 (Q08281) CHROMOSOME XV READING FRAME ORF YOL138C 

Q92271 (092271) 12,8 KBP FRAGMENT OF THE LEFT ARM OF CHROMO 

Q26032 (Q26032) VARIANT-SPECIFIC SURFACE PROTEIN 

002179 (002179) CALYX PROTEIN 

001394 (001394) POLYHEDRAL CALYX PROTEIN 



34 
33 
33 
32 
31 
31 



1.3 
2.9 
2.9 
3.8 
8 . 6 
8.6 



Comments : 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 1 to 780 of CT1009: this 
corresponds to nucleotides 35166 to 35945 of the genomic reference sequence. 
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CT1081 

Nucleotide 

Genomic coordinates : 

Start: 292189 

Stop: 298777 (SEQ ID NO: 158) 



Amino Acid 

MQLRLENFVKEEHETVVVHNPSGMTGFNIFNSSPVYFEVHNEMDALIFMAAFLKHNSLWG 
EINANMDLYTFDYAGAFLDERWCHHEKSFSVVRAQLINSYYKCRRKIMQALDNNYNNKNK 
KRKNVGGAPAFTFMSGDGEGGKEALEASFDVIGGTRGGRFGVDSTPCPHSSAMQLKLDNE 
GNYGCIACFASMFFVLENPGDESSFISTDASKIGQAQAWIDERLRNNENGGEENNVFKKT 
FHMIiADITQKAHETAYSNTIPLGPNGRQWNWPTHTVEPIAHEFVTHSLVNTLKNLGDRKL 
PRFNFDILYNLLNPFGKMLLVFIQNCHILTGHKNNENVVPRGSASGKWWTINFVGVNMWT 
FQVTKCKVEKDRKISDLACMETLPRLPNPGSTTVDDRIVFKGFCRGENLGSVGEVVSDXT 
QSVKNFCLMVENRKFSVDKETGFISSESIVSDPFFSLEVTGCRSNRAQDTINNGRVSARV 
MRILKSREGARVWLAKDENAIIFENVNHDTAISTD/IMERAIGQHKILYYDIETTDKDFTD 
KKSVITSIGFCLCTGGDMTHGGERGVFGLVAPGSDVEKVKETIINSYDPEEKEDIMKQCP 
QVIEIFTNEFEMLLGFGKYIDKVKPHVISGWNNVAFDDPFVFTRIVKHLSDHTKDMSYCV 
ADASTAESVLPRATEGGGGGETPYRLSTPQERIQLASTGIFNKLGKFVDKKTGMLKPEMT 
ADLLAGAESQANTKFKERNKLSSSNKGSAGWFQKIIGGMCSAIRLDLMKVCEKAYKESLS 
EFNLNAVLAKVSSVGDKVKNVKDEVDLHFHLLGFLKLKKAQDQAKVHVYCCKDAYLTGIV 
STSINKEGEIFRLCMDSALTEAVVTANLATPLCIGEGAICRNMGEERADRRGVGVRRHSI 
ATDTKGGMVSQPIVNHVPYQTIDMTSLYPMTMCQNNLCTTTFVTHRQIMQLRDRLVLEKM 
KNKTTDSLLLLDVIDECNQIVLSEYRPIDIAVASWKNSNSNRQTPITRIEESLGLRFIEN 
LDAEKTNNKTWCTNTSPNMNVTAAGMDYFPEIVCDINMQFAAKVNDDMHIAPASLEYMLQ 
VLPIMLIDRPYIGAHITAGKCRTLEDILSELEKDFSVEKDEEIIRTHWTFKGQKQYDFCH 
SPVTQMARHIIESTGRNIRDYEGNEKFERLVSLSDRIYRRVGAFDSANDPAVRLWSSRLI 
WVGMLVRTWNVKTDILKGIIPQMQATYRADRVVMQNKAKEFAKMGDMKRAGLNKVGQNIM 
KLGMNSMYGHLALRARSSRKEFASGSANTASSI3NMSATGGIGGGTRHSVTANQITENAR 
CVFGNIGCGLQMALPGTKQTYGDTDSVFCVHNIVGDGGMIPEYDEQTGKYYYVMDIALKN 
KMAAI I PILVNSLTKG I QFVERRDAGVGMMN I AHERLAVAGLLFAKKT YHMLHFNENSAA 
FNDMIKLKSTDNNNKFASFIKRPSHADGYVVPHNPSLILRAAEGPAGKKLKSFLEEEGIH 
DEKSMEEWFTSSPTWMT^DASVINNLYASQIVGVEKGNWIDAMTSRPIEAGTEMMEAVTQ 
ANAAFTPYKKGAFVKKGITPTTKLKGLQSLIARFLPKIEEKKSCYLDVMKNHVENFASHI 
TNPAMMITSSRVNKFDTSKEQSRPNPLALAINNHLNPSSEISLGQKFKTVTSVSSWSLSA 
EEGEVPAGYFNAGSVRWDATNMKGSVPAFSVKNLSVVPNAITSVYKMVESDKTAIKSMIA 
KNVEVLCSTSANTGFSLRRGALSFNTGVIVTKDVAMACIRSLNNKQMLLFVGGGKDYGED 
DDDDDEEAEEEDEENGENEENKGDCVTEKKIPGRSTNKDVGEETKTSEKTEGERKGSKTA 
KGKTEEIASSLSKCGKKDARDVILDRLLKATHSSCTNNEERTRVLQQYSNCTLSSYITSV 
MKLDQRVADQMENLISQLDQIRNLSNKKRQEKGGPFKSELDAMVAAVKVKFFPVLDASRK 
LTQDHWKKCPVSIPETREEKPLMGVPFEVALNSLIGKHKCTDTCDMACCQSLYFVLLYTL 
ALKFENERLARQIGLDDSVDLMAEMLFGGDKLLAQEVLKRVKDAQDRKLVKSLLPLNYNH 
DTNTIIFLFESLRFAQKPVAGMSVSEIKDAVRGLAFSTTTGTVWNYTDERFFGPLYNMDE 
LCNERVNGNCKLSFITGIYHTAAVELAAACLSCVL 
(SEQ ID NO: 159) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



DPOD_YEAST (P15436) DNA POLYMERASE DELTA LARGE CHAIN (EC 2. 52 5e-05 

DPOD_SOYBN (048901) DNA POLYMERASE DELTA CATALYTIC CHAIN (E 51 8e-05 

DPOD_SCHPO (P30316) DNA POLYMERASE DELTA LARGE CHAIN (EC 2. 50 2e-04 

CAB58156 (CAB58156) DNA POLYMERASE DELTA LARGE CHAIN 50 2e-04 

UBF1_RAT (P25977) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTREA 50 2e-04 

UBF1_HUMAN (P17480) NUCLEOLAR TRANSCRIPTION FACTOR 1 (UPSTR 49 4e-04 



FIGURE 3, Sheet 85 of 160 



wo 01/38351 



127/201 



PCT/USOO/28888 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 4 to 1205 of CT1081: this 
corresponds to nucleotides 297608 to 298809 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start=2738 

5'stop=2760 

3*start=2817 

3'stop=2838 

5'primer=TCGTCAATCATGTTCCCTATCAA (residues 2738 to 2760 of SEQ ID NO: 158) 
Tm5=58.05 

3'primer=TCGATGGGTCACAAAGGTAGTG (residues 2 817 to 2838 of SEQ ID NO: 158) 
Tm3=58.56 

probel^GATGACCATGTGTCAGAA (residues 2787 to 2804 of SEQ ID NO: 158) 

probelstart=2787 

probelStop=2 804 

direct ion l=Re verse 

Tml=69.00 

scorel=l . 99 

iength=101 
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CT1082 

Nucleotide 

Genomic coordinates: 

Start: 300935 

Stop: 305108 (SEQ ID NO: 160) 



Amino Acid 

MTEQGDQGIKVRKLHGPRGERGETGPAGAVGPAGPQGERGAIGPAGKDGAVGPAGPQGER 

GAIGPAGKDGAVGPQGPPGERGENGRPGRDGAVGPQGERGAIGPAGKDGAVGPQGERGAI 

GPAGKDGAVGPAGPQGERGENGRPGRDGAVGPAGPPGERGAIGPAGRDGAVGPAGPPGER 

GATGIPGRDGVDGSVGPQGERGEIGRPGRDGAVGPAGPQGRRGATGRAGKDGAVGPAGPQ 

GEKGEAGKDGSIGPQGIQGPRGETGPPGRDGTAAERGERGFPGPPGETGPPGKDGVDGSE 

GPQGKRGETGPVGPRGEPGLAGLPGRDGAIGPAGPPGERGATGLPGRNGVDGSIGPQGRR 

GATGRAGKDGAVGP AGP PGERGATG I PGRDGV DGS VG P PGERGETG PAGRDGS VGPAG PH 

GERGENGRPGRDGATGPIGPAGPQGEKGENGRPGRDGATGPIGPRGETGAMGKNGVDGSM 

GPQGRRGATGRAGKDGAVGPAGPPGERGETGPAGRDGSVGPAGPQGETGIiTGSPGRDGAT 

GPIGPAGPQGEKGENGRPGRDGATGPIGPAGPQGEKGENGRPGRDGATGPIGPAGPQGET 

GLTGRPGRDGATGPIGPRGETGAMGKNGVDGSTGPQGRRGATGRAGKDGAVGPAGPPGER 

GENGRPGRDGATGPIGPAGPQGETGLAGLPGRDGAIGPQGEKGENGRPGKDGATGPMGPP 

GERGETGPIGPAGPQGATGLPGRDGVDGSVGPQGKRGLIGRTGRDGAIGPVGPAGPKGET 

GLAGLPGIDGKDGSVGPQGAIGPIGPRGERGETGRPGRDGEDGSTGPMGPQGLRGATGAP 

GPQGERGLKGRPGKDGETGPPGRQGRDGIMGPRGLRGEKGAPGNDGLEGPEGRDGAPGPA 

GPIGPQGIRGLKGIQGRPGRDGEMGPAGKDGIEGPRGQDGTTGAKGPRGLRGFQGRTGET 

GAQGSRGEKGDRGLTGPQGRDGPPGEEGPQGLRGERGAPGPRGPRGIRGRSGPQGSNGVO 

GPRGPRGTKGRTGIQGLTGIEGPRGPRGIQGKEGRMGKIGHRGEKGDKGDRGEQGIAGAD 

GEKGPRGLRGIRGPIGAPGKPGTEGVRGPRGVRGVPGYPGAQGELGPQGPTGPQGPAGPQ 

GPMGRTGDTGPMGPPGAVGPRGEKGGRGRKGKNGPKGADGKDAVNIIQKYSITHARAEIM 

WEGNEIGEAYIGRSYGTDTIPVMIENRIGMTNEDKKNEYCIQVMTMHSITTRGRTSGVFV 

VSNKTDYILLVTLLMPESVSCRTDVSTNARSERVNAVRERESKSYRFIRPSDQSIGTHSR 

SKIAVVMYPDASMSYSVDTLDADVARRETTSVLLLAETIHGEKDRGFYADRGTVGRLMVP 

PTEEELLVLQX 

(SEQ ID NO: 161) 

Top Blast Hits 

Score E. 

Sequences producing significant alignments: (bits) Value 

Q14054 {Q14054) COLLAGEN TYPE VII PRECURSOR 940 0,0 

CA11_CHICK (P02457) COLLAGEN ALPHA 1(1) CHAIN PRECURSOR 935 0.0 

CA1'7_HUMAN (Q02388) COLLAGEN ALPHA 1 (VII) CHAIN PRECURSOR { 933 0.0 

Q63870 (Q63870) TYPE VII COLLAGEN 928 0.0 

Q60444 (Q60444) TYPE VII COLLAGEN (FRAGMENT) 925 0.0 

CA11_HUMAN (P02452) COLLAGEN ALPHA 1(1) CHAIN PRECURSOR 923 0.0 

Comments : 

EST confirmation of the predicted transcript : 

An isolated EST has equence identity to nucleotides 1 to 567 of CT1082: this 
corresponds to nucleotides 304552 to 305108 of the genomic reference sequence. 
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CT1083 

Nucleotide 

Genomic coordinates: 

Start: 50094 

Stop: 50295 (SEQ ID NO: 162) 



Amino Acid 

MAVTEIPCGTRNIAEEDVELELILVTAEAEVREAMAAALAAAIIGAVVVQIGRVLDEVVA 
AEVELM 

(SEQ ID NO: 163) 
Top Blast Hits 

Sequences producing significant alignments: 

LIPB_MYCTU (Q10404) PROBABLE LIPOATE-PROTEIN LIGASE B (EC 6 
Q64033 (Q64033) ANTIGEN LEC-A 

OL56_STRAT (007017) OLEANDOMYCIN POLYKETIDE SYNTHASE, MODUL 
RPSD_PSEFL (P52326) RNA POLYMERASE SIGMA FACTOR RPOD (SIGMA 
054540 (054540) RNA POLYMERASE SIGMA FACTOR 
Q9YAS2 (09YAS2) 136AA LONG HYPOTHETICAL PROTEIN 



Score 
(bits) 

30 
29 
29 
28 
28 
28 



E 

Value 

8 
1 

2 
9 
9 
0 



Comments : 



TaqMan Primer/Probe Sets: 

5 • start=24 

5 ' stop=4 5 

3'start=95 

3' stop=112 

5 •primer=TGGTACTCGGAACATTGCAGAA (residues 24 to 45 of SEQ ID NO: 162) 
Tin5=59.21 

3 'primer^CTGCTGCCATCGCCTCTC (residues 95 to 112 of SEQ ID NO: 162) 
Tm3=59. 19 

probel=TTGTAACAGCAGAAGCAG (residues 71 to 8 8 of SEQ ID NO: 162) 

probelstart=7l 

probelStop=88 

direct ion l=Reverse 

Tml=68. 98 

scorel=^l . 92 

length=^89 
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CT600 

Nucleotide 

Genomic coordinates: 

Start: 1118 

Stop: 491 (SEQ ID NO: 164) 
Amino Acid 

MHMWGVYAAI LAGLTLILWI S I VVTN I ELNKKLDKKDKDAYPVESEI INLTINGVARGN 
HFNFVNGTLOTRNYGKVYVAGQGTSDSELVKKKGDIILTSLLGDGDHTLNVNKAESKELE 
LYARVYNNTKRDITVDSVSLSPGLNATGREFSANKFVLYFKPTVLKKNRINTLVFGATFD 
EDIDDTNRHYLLSMRFSPGNDLFKVGEK 
(SEQ ID NO: 165) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



051578 (051578) EXODEOXYRIBONUCLEASE V, BETA CHAIN (RECB) 


39 


0 , 


.033 


068195 (068195) DIOL DEHYDRATASE-REACTIVATING FACTOR LARGE 


32 


3. 


.2 


BACC_BACLI (068008) BACITRACIN SYNTHETASE 3 (BA3) [INCLUDES 


31 


5. 


.5 


TKT_MYCPN (P75611) TRANSKETOLASE (EC 2.2.1.1) (TK) 


31 


5. 


.5 


Q9ZER8 (Q9ZER8) REPACl PROTEIN 


31 


7. 


.2 


CAAlOOOl (CAAlOOOl) REP AC PROTEIN (FRAGMENT) 


31 


7. 


.2 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 569 of CT600: thxs 
corresponds to nucleotides 513 to 1081 of the genomic reference sequence. 
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CT520 

Nucleotide 

Genomic coordinates: 

Start: 209342 

Stop: 196799 (SEQ ID NO: 166) 
Amino Acid 

MFKANVLNLGGGKFLESDVRDHLIKCANQMKEEPTTLRICLSNKLPEYDNRRLPLLLLNE 
GEQILVTDNLTKNGNPLVKQMGHLAVQDRVGGDGSVNPNNLLYAGCNVVEYDTVNRGNDG 
KLIMYSQPATLKDVAKSKKKGMYKVKKVPEITGDQFLDKLNERSCQNENREIMDEEGPHVG 
TGKLLRELIIMMRLYEEETSSAEKLCVTPAFREFLGCGRTATDVPVFKVAFITWASLMGL 
KVIFYPTILEEERLAAVSDTENVVLLKSILKVQLELLSECMPRIVERVESMIKKTVACFK 
I DIGGS DNWNLPGHCKVSDTAFPYHHAQLVGEKKNTLS I SNENMVTSLGVVKADRAEEWM 
CKTLESFEKKCLYLENLMGSMANTDDWRRKILFSELGPEMPYRNKSLIMDQDFCTIGMCY 
KFLAEGGGLLLTKTNATLLKEKMACKGLDDSGDGDDEEEDNEEGGSGGKSGGGSGDENNI 
NKPPPAPKQIPPLAANVYNSIINDDDKLDQIVCFFKRKHGFFLSDIDNSPLLAMEFLLPQ 
KAMSKKNCVERVKPETKNIIRNLTGVNTIKFDTIMPFAILQIVVRYENRNLKLPRDTDIL 
QQRLKNNTWDALSKGKFAEMWQFTHKESLKPPTIEELESIPPPPTQSEEEAAAAAAASTS 
STTPDMVSSLEEGATSTSSSDENQIASLEN I KKLLS IITSTFATGADKNDTI FAWTVVTL 
AERFCALYNITSHPEEYYQQIIREDFEFEGGFEKFRHMCDAINRELSIYVPKSVLEKQSV 
CRMGVAAYENSMERIKNKTNSKLCKIKYDESTMVYELNNDTFKTFDYDESDKSFGPMYEC 
APMETFQRLFASVKSDKEAVLADKKSEKREKLYQQKQEYLRKCDNDDVSARQILNNVASN 
ESDEESDEESDDEENYGAAKGGATGDYYGGDDEDDCYGFLGEFGSSDDENVPSDNASSIN 
NVQDDVFRDVNFIKTFNFRSSLCHRQKYVSTVIVEEMEKNLCDVLTLDNSAAESGDILKE 
INRRSLRMRNWVVPFTMPVREIVKPNVNSEDGTANSNNNIPPFCSCASLNNFKSDSPLSS 
NNTMSNEKCIKLLPIPSSKHLKDLTVALRFNTMACERRYFSDVTAALGFVKKDKVNGNIR 
SILDNKRWDAIKQCKLAGKCLSSALPLGIYENVISEDNKLINTFRPRSi.ARLACSSGGDG 
VSDKSVNNGFFSGIWALCANQDLESVVLGSTVVDPLKPTKVFNQSLSEKELKEKRQQMCL 
DAANYFKDHNVSKLNIYECFKMMEECIMRTALNGKTSNDSEFFSNLITRYGSGTNSPASR 
LWTILETVRECFNNSLPIDWGSLVKDWDGSDMLNLKAGVSNVDESGAVFELSEFLGVSAR 
AFFGKDLDTNLDADTWECLLNDDNKDWKAQVAKAYEFALKDNDIRSVENFINSSNLLTNN 
NVIKKLKIKPTPSNDVRHQIWVEDEYYPRNKSTLRSRAEWMAATEEVLKTEMSLSCVLAM 
VAMYRIMMQGESVREIATAPLRLSVDKMVPLIRCFKITSKWCSCTGKGDSPKKADASIKE 
GRFYDIEEDPLHFYRFAAYVIGQVASNDIVIEEMTRKILMSFDFNGFDTSNWLQFITYRF 
SHVLMGRRSRLLSRPLSLVKNLVSVSSLADKNSEKSNDMYEKRVGKVMKRIARLVLVKAA 
DSVRASSNDLLDCCILDVNDVSVKSLDEFRAKTRQELQETRIDTNYNLVSNSCTTAQLAA 
VEKSSRIINTNISFHNIPAGQAKVMDANEEAFIDPSLEEINKEDNSGAKQMTGKGGSNRG 
RSKKSGGGGFNNAGGFYNDDSSRGSSSVVDEDSRSRTGFSQIHMDARNEEDRESGLFSYD 
GYVLNRIKNMITQNQINNDIVKVISDIENFFKICVPFSKKEYALYGVTETALSAGMDAIE 
RWNKAVEEETNKIRKECRDLTDTGSVYDMNIICPGDYMSSVGEGGNGGCGGGSSSSGHLL 
SNNNNEANQTNEISEDQLKHEGSDCSFWFNFYKKVVNSSEKKQGKGSVLANTGHEGRIVG 
RPLRTFIQYKGKGFAETKVLTRYFSNHISDSYWSQVMPICYIKNMALGDEDKSKKKFGKR 
PWKNFNNNSNSSSNSSVKYVSIQDLEKKDSLKNVPMGYDEDLLSLYDDSLTTSTEKLENI 
KIVNDSKDAYVILGSSNQSSFDQTFSQQYFTHQKISNINTYKSLGKMWNCNNGMSPKNOI 
VLLKKLLFKNLNILWIKLYERHISVLCNWGCIHPNSSKNSHFEMTKNNAPCGVTDSNPPL 
SVYHSGFLSVEDYGQLLKDTFPLMNLHRTFSAKSKDWNSSDPSPEKISAASLAKAVYARE 
VLSSCLDPEGNFCTSWITNSCSVLFTPGTNIRRGGDFFNKSCYRQQDNDYCFIGKEETKK 
CPNFVSSEIEIVSILKTAVFLSTNSDGHKRVLRVINYNKDHSGLYAGIDTGCADDEDDDD 
DQGGTDKTCLLQEDSMDAKRMLISMRSVINGKSLDESSLAIKKDNFNFLAGTDKGFYLDN 
SFFNSPVQGKFVAPRGTKIFKKCCDFLLNKGTGGVFARIFFTDWACIVSSSKGKNNKKAI 
ESTLQIRNGGCFSERLTPSMFDNESEQGELFHDRYCPDFLSDYNKQNIFSEQAYKCSFLA 
NPVCPAKNMLKRAKNIRLCITNAGTALISKIMAEVEKMGNARTFISNGTAIPFRLAENTA 
CISVDNNRYFLIDGTYLLGGRLEGINLVTDMYTRCKLKAEKHVILNSLFSTEFISAALAS 
SMEGTTMGRGLCLIEHVSYMKNTDSVSNMNKNFWSMAEDQEETDENEDDDDENEEDEDEN 
EENTENTSVVKYEPVSKTAFSSSLKPPSIFIADEDYIFLSILYELAKATSDCETASSSSS 
SSSSSSSSSKHSSSSSSSNKKRKQKDDVNSTTTALHALRKCYISCVDQKTGMPRMDVVYL 
LRGLMNFGGMCTAIASGDGEKAHHMVQTLCSVALNIATKTAVVFVGTKGNNLKTTLVDLC 
KRTWFERFTNINVTALNNAGDSSSSTQANLASFAGKKGIVIIDEVGHQGSFGSKKSSSED 
DKDESASRSGNVDFGGSGGEMNSVDINEARNAYGDGGNSKIVFSNINRLMTESKLKVCDQ 
EYDFISELKHEKNRKNACNDTKKRKRGGEIEDEGVECEEIERNDGKNDENGVRIKDPINI 
SFFARKAHWWNCSSGVVSTTFKEKNIVYNMLHRGAMPFSIKDCTDSPWLNETDAVYRHCK 
KPIEYEGKFSKSEVKTALKCILGKFGSKICDNESFESIIDENCQVNNLHSWNDCKEDIDE 
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WNEKFMSKNKKNKQNMKIEDKVDAIMNIIQKNNGLLKWNTSFDRDGSPVLVCNPATERFS 

EMITSSLSAQDMLEIKKYLGDNCLSTNGGVKKSVIDGNTSAPGVLIAYHCVYTGKISDDL 

SKTNYPVLLPPPKKQHFVAVDDAAEKALLGPTLSNINIDSIRNIKTISRKLSSIIKDPEA 

AKLLVDRDLDFMNMYERYDASLFDVVKKPSKYSFPGFTSDGSVVLSTSTSDCENVLSCLK 

KRIEKDKMSAKNSGSFIRMCMDKNLLSDEKDDSSSNSSKNTSSLPKTDDNSSDIANFLSV 

FGENRQQSSQFSFASNSSGGGDSNKEACFNVDTPKRRQLVSALQKHNSDGSSSIITEIAK 

AIPQKNDVSSSITKHMLPGQFPSSLLKNMTSPQNSVMIRGIFQQGAKSSITVSPIMMSNS 

YIFSFFVDEAMSKRLIVFPCDTTFVFENKNEDVKKIIGLLDRGMKYIHSSLMMERCIKFG 

KHGIKQRQHEFNHHKKAWNDFSGHSSDNKKKDRISDVSSVLPSVLMKNLIRNKVLELRDV 

KSVSRLEENTNTFFHLYTSMSLCAKAATNYGESSSSSATITEVEEDNSCDAEEQQLRRKK 

PANYESMCNKLPSPLQMCQINPKSLNTMAMNIARSRQGAWAQLNSMLNSVLFVEMPFVKT 

TRFFGRDFNIKMHSPATKNRPAINFDNCIGMSLPNPDMDVVGYDKEGELIGVGSSLTKHL 

CDAWGSMDVRDLMYSCHHLHMLFEMALQYTECKRRLSSLKTLKSDKTGVDYVAVMLACMV 

YQLMVSNLKYPVFLSSSSHKRANTEDIADENQVSSLSVPMFLAMVVNKPLHALRHSTNLA 

LPNASQKSDHSDIVKYIVMNQWGLRLNPDYLCPNCVKHVL 

(SEQ ID NO: 167) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SR40_YEAST (P32583) SUPPRESSOR PROTEIN SRP40 54 2e-05 

0921T1 (Q9Z1T1) AP-3 COMPLEX BETA3A SUBUNIT 54 2e-05 

AAD56625 (AAD56625) NUCLEOLIN-RELATED PROTEIN NRP 52 7e-05 

P90493 (P90493) HERPES SIMPLEX VIRUS TYPE 2 (STRAIN HG52 ) , 52 9e-05 

P70475 (P70475) NEURAL ZINC FINGER TRANSCRIPTION FACTOR 1 ( 50 3e-04 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 50 3e-04 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1064 to 1 of CT520: this 
corresponds to nucleotides 1967 7 8 to 197841 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=7375 

5'stop=7397 

3 •start-7450 

3 •stop=7472 

5 'primer=GATGACGATCAAGGTGGTACAGA (residues 7375 to 7397 of SEQ ID NO: 166) 
Tm5=57,66 

3 'primer=CCATTAATGACAGACCGCATAGA (residues 7450 to 7472 of SEQ ID NO: 166) 
Tm3=57 .78 

probe 1=CAATGGATGCTAAGAGGA (residues 7424 to 7441 of SEQ ID NO: 166) 

probelstart=^742 4 

probe 1 S top=7 4 41 

directionl=Reverse 

Tml=68.98 

scorel=l . 98 

length=98 
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CT601 

Nucleotide 

Genomic coordinates: 
Start: 1511 

Stop: 1196(SEQ ID NO: 168) 
Amino Acid 

MYLSHIRQTPLVEERRALTFKMYHHNNNNQHSFVNCQCRRTSSSINCSSCSRETFNSVKA 

IQYFNKTSRNNTAHHFKMPASKDRNYSSFEYAETAVAAHNISQW 

(SEQ ID NO: 169) 

Top Blast Hits 





Score 




E 


Sequences producing significant alignments: 


(bits) 


Value 


YT66_CAEEL (Q11082) PROBABLE G PROTEIN-COUPLED RECEPTOR B05 


32 


1 


.4 


000885 (000885) MAP KINASE KINASE PROTEIN DDMEKl 


30 


4 


.2 


CHD1_HUMAN (014 64 6) CHROMODOMAIN-HELICASE-DNA-BINDING PROTE 


30 


4 


.2 


097292 (097292) PFC0965W PROTEIN 


30 


4 


.2 


096226 (096226) SER/THR PROTEIN KINASE 


30 


S 


.5 


096563 (096563) C-13 ANTIGEN (FRAGMENT) 


30 


5 


.5 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 353 to 3 of CT601: this 
corresponds to nucleotides 1184 to 1534 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start-^96 

5'stop=116 

3' start=142 

3*stop-162 

5'primer-CTTTGTGAATTGCCAGTGCAG (residues 96 to 116 of SEQ ID NO: 168) 
Tm5=58.03 

3' primer =AGTTTCACGGGAACAGCTTGA (residues 142 to 162 of SEQ ID NO: 168) 
Tm3=58.4 3 

probe l^TCTTCCTCCATCAACTGT (residues 124 to 141 of SEQ ID NO: 168) 

probelstart=124 

probe lStop=l 41 

direct ion l=Reverse 

Tml=68.98 

scorel=l . 98 

length = 6'7 
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CT521 

Nucleotide 

Genomic coordinates : 

Start: 272387 

Stop: 268691 (SEQ ID NO: 170) 
Amino Acid 

MSKSSSSTVKSASFFNSLMENAPSSKIELLEDGWTKKAAAADTDTPTAKPTGLSISLMDI 

SGSMGSVKSAVADSCSGIMATLNVIAPGIQNAIVYYNDFDKHSIESGPVVRAPDCSEWEG 

GDFVKHMRKTEVCGGGGGGSEALHSSLMYVFNNMIPAFKKMHGITRDEKFPILIFVFTDE 

DVRIAWSDTGKLCANSYDSETAPEEEFIMKTWGQKPLTILDMRKALVENDCWLRILNFSR 

CSGSNQSELCQEDVINFSGYDNNRWQLFESFDRRSCNVRKNIATFIMRQSISLFKNLNDQ 

FSAFPILREINQEELNVFIESEGRSEPAGFEKYGDAQRESFKSRVLNMAPLDFGRVVQGG 

GRYNNHKRSVFLNCAYDSAFCCSKQTFNPQQQQQQQQSSSGGGGISKLAVVTQRAQSITG 

GGNAASTLALHMNACFQSLDDFGIDHTNLCDCKGCTKLMASVEATSDQGRKTKLSRKYAR 

VHWAKMFAEKLFKMMIKEQSMMYACSAVPDEIGAIYAFVTGNWAGVCSRVSTILSDLGTE 

CGNKAEYAFLKEGKHMKSASYDALQVINNTDLTPEQSSMFMWFYVPNDALEEAGKIFHQS 

FSFSNSYTGGGLLSLDEYKRFEFGQCFDFIKKLVSCLKITRNVEDVLLETSKTSNRYFAI 

PVFCGSDDQKEVLREELASDLFGGREDVAEMMFIDLETVIQKLGTLYDVRLSLPEGGYAA 

IKSVCAAASWAASCEVPSNTSNMILSIAKMAFTKYYQEQNSSSETDLDIILPSILEGTAD 

GEIENNLSGVVFLRCLITWANKIGVDKNFTNKLEHFLALRILTKAGDSKIGEKYETFPVR 

RLDLSEKDLKY I CKRCG VKS LKME Y DN DEKLCLRCKGN YRMGKPMV YHWDNKLTRDPRAK 

TASPTTLNLLNAKKI DDKVKEMASDI IGALNLPPTDKDNEIAVSAAAKAVGILYGKTCLL 

YKLLNEGNIDIPVAVCVECDCCKSKYMMSTLGPDKPQNRKCPWCRYANKLVAMGRGGKKL 

LMDLIECGAPSLAMVEEAIRTSGDVMYEELGEGEEFYIIDYFLKLKNTAIAEGNKLQQNN 

NKRPAPLQVTSPSSPPKKMRSDLPDSLLAAIGECAIETKEKTTVNLIGLGEVKVVENVGP 

NDLDGKDPFISLQEYCSWDKFNSLFVNPWLGYRLDEQWDDWNTFLIHVKKNDVWKFLCNK 

TSPFSVVVMNDGSGLLNVDNVNVLVRQKICV 

(SEQ ID NO: 171) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF04634 (AAF04634) HYPOTHETICAL 45.2 KD PROTEIN 76 le-12 

Q9Y1T1 CQ9Y1T1) DNA POLYMERASE ALPHA CATALYTIC SUBUNIT (EC 36 1.3 

P9ia05 {P91805) ARYLPHORIN GENE-SPECIFIC BINDING PROTEIN-2 34 6.4 

045322 (045322) Dy3.5 PROTEIN 34 6.4 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 8 95 of CT521: this 
corresponds to nucleotides 268736 to 2 69630 of the genomic reference sequence. 
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CT602 

Nucleotide 

Genomic coordinates: 

Start: 2996 

Stop: 2702 (SEQ ID NO: 172) 
Amino Acid 

MDILE D I YKS AI TLVLQS PEFVNDVKQEASQVVEGL I PS IREAVFRRLLEEERKKHEDEV 
GDVEDKRQAVIDKANTMITTMAAEYLESVDILEEFGF 
(SEQ ID NO: 173) 

Top Blast Hits 







Score 




E 


Sequences producing 


significant alignments : 


(bits) 


Value 


CRCM_HUMAN ( P2 3 5 0 8 ) 


COLORECTAL MUTANT CANCER PROTEIN (MCC P 


36 


0 


. 055 


ACVS PENCH (PI 97 87) 


DELTA- (L-ALPHA-AMINOADIPYL) -L-CYSTEINYL 


33 


0 


.48 


ACVT PENCH (P260 4 6) 


DELTA- (L-ALPHA-AMINOADIPYL) -L-CYSTEINYL 


33 


0 


.48 


Q9XJC5 (Q9XJC5) HYPOTHETICAL 26.4 KD PROTEIN 


32 


0 


.62 


NOSZ ACHCY (P94127) 


NITROUS-OXIDE REDUCTASE PRECURSOR (EC 1 


32 


0 


.82 


EZRI_BOVIN (P3197 6) 


EZRIN (P81) (CYTOVILLIN) (VILLIN-2) 


32 


1 


. 1 



Comments : 



TaqMan Primer/ Probe Sets: 

5'start=121 

5'stop=144 

3*start=200 

3*stop-222 

5'primer^AGAGAAGCTGTCTTTAGACGGCTT (residues 121 to 144 of SEQ ID NO: 172) 
Tm5=58. 14 

3'primer=TGCCTTGTCTATCACTGCTTGTC (residues 200 to 222 of SEQ ID NO: 172) 
Tm3=58, 14 

probe 1=AACACGAAGACGAGGTGG (residues 164 to 181 of SEQ ID NO: 172) 

probelstart=l 64 

probelStop=181 

directionl=Forward 

Tml=69. 14 

scorel=l . 85 

length=102 
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PCT/USOO/28888 



CT522 

Nucleotide 

Genomic coordinates: 
Start: 276736 

Stop: 275206{SEQ ID NO: 174) 
Amino Acid 

MASGFAIKGIVKNYRRIPSIIESIKSIRRSELAEGVYIVSLHKNTPKHEVDEIVNKIRLS 
AGNPCLEKTSLFLQHHSQMRNFYTRKGAESESDWLKRLPEDLRNINNIVKREALPHDKSF 
TFSPLYRILTDRLFNAAIHNCKYIIVTADLLMGCGITNNKVEKKLLSMGSILGGESMVPL 
HDIAHRLSYKGLRIENPIVGSCHC)QCLVVPVSMLGKIFSSNMYPTFKNFDQCMALFLNAV 
VTHSAEKMDGKHERNKVIHMPNEVYLDAARRKYLEEKLEETNKLDAIDEEAREEYGNEIG 
RIGDKSTCLVFALSARDFFLTNRFNEDTPLYSGTERGIRFMCSNYCTMRDEGGFRPRLIM 
SAYGPTSYPIIFNTLYDQFNVQYYPCVSGVVLSFIGDDQLAPEPESLVDIWRSIKNPSI 
RIFSGDGETVYQDGRRVDVGGEGKNQKFNREERTILNVLRIIKAYNEERTKEDEDEEEEE 
EEEEEQQTAATVTVESDWDLSLERGENWV 
(SEQ ID NO: 175) 

Top Blast Hits 







Score 




E 


Sequences producing significant alignments; 




(bits) 


Value 


051310 (061310) TSJ5 




44 


0 


.003 


CINA ELEEL {P02719) SODIUM CHANNEL PROTEIN {NA+ 


CHANNEL) 


42 


0 


.009 


MYCBRARE (P52160) MYC PROTEIN (C-MYC) 




41 


0 


.019 


Q9Y0C6 (Q9Y0C6) J5 PROTEIN (FRAGMENT) 




41 


0 


.019 


035788 (035788) CYCLIC NUCLEOTIDE-GATED CHANNEL 


BETA SUBUNI 


41 


0 


. 025 


MYC_CARAU (P49709) MYC PROTEIN (C-MYC) 




40 


0 


.033 



Comments : 

EST confirmation of the predicted transcript; 

An isolated EST has equence identity to nucleotides 374 to 1 of CT522 : this 
corresponds to nucleotides 275340 to 275713 of the genomic reference sequence. 

Hit to public SBV sequence: 

gi 16007410 |gb|AFi78S73.i : CT nucleotides 1 to 552 match nucleotides SS5 to 6 of the 
public sequence with a 99% homology, a score of 1063 and an Evalue of o.O 

TaqMan Primer /Probe Sets : 

5'start=682 

5 ' stop=703 

3*start=767 

3*stop=787 

5 •primer=AATTTTGATCAATGCATGGCAT (residues 682 to 703 of SEQ ID NO: 174) 
Tm5=57.93 

3*primer=CGTTTGGCATATGGATGACCT (residues 767 to 787 of SEQ ID NO:174) 
Tm3=58-39 

probe 1=CAGTTGTTACACATTCGG (residues 716 to 733 of SEQ ID NO: 174) 

probe lstart=7 16 

probelStop=733 

direct ionl=Forward 

Tml=68.99 

scorel=l . 99 

length=106 
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CT603 

Nucleotide 

Genomic coordinates: 

Start: 24906 

Stop: 24660 (SEQ ID NO: 176) 
Amino Acid 

MTCPEISKHISGTDRRFWNTADPGGLSYPFNPLFTLHLHLKNFSKIFSAHSSLGGGPLTR 

PYVKFEGWTAGSTQRQITERS 

(SEQ ID NO: 177) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

CAB4 9803 (CAB49803) HYPOTHETICAL 69.0 KD PROTEIN 31 1.8 

074 312 (074312) PUTATIVE TRANSMEMBRANE 29 5.2 

Comments : 

Hit to public SBV sequence: 

1 6i65655|gbl AF099142 .1 : CT nucleotides 118 to 231 match nucleotides 5652 to 5539 of 
the public sequence with a 90% homology, a score of 139 and an Evalue of 3e-36 
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CT523 

Nucleotide 

Genomic coordinates: 

Start: 281865 

Stop: 281127 {SEQ ID NO: 178) 



Amino Acid 

MVSTRSMEAKAAAAAKAKEVSPTTSKRKAEDLTEGTEEEEESVETHPPSKLPRVDEDEVY 
IDENVDGDVQILASSIEVARMERERLAEAMVRDIKIEEEKAATEARKEIASRLIYKEMVY 
LLPQLENMTNRLRPRSLLRHNEMTITDRTFSDLQIFNKVTFEFPILTDIAFLAREKSRVE 
GSRFYNDMKIGPITAYKLNLMCNKFIESWQKVKAEISPFVEVSVSSELEGSPFWDFKQR 
IVKHT 

(SEQ ID NO: 179) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q23915 (Q23915) PROTEIN KINASE 43 0.001 

077819 (077819) CORNEAL EPITHELIAL RHO-ASSOCIATED-SER/THR K 43 0.002 

Q134 64 (Q134 64) RHO-ASSOCIATED, COILED-COIL CONTAINING PROT 43 0.002 

P70336 (P70336) RHO-ASSOCIATED COILED-COIL FORMING KINAGE 2 40 0.017 

P70335 (P70335) RHO-ASSOCIATED COILED-COIL FOE^yilNG KINAGE 1 39 0.029 

Q63644 (Q63644) RHO-ASSOCIATED KINASE BETA 39 0.029 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to . 7 12 of CT523: this 
corresponds to nucleotides 281143 to 281854 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=404 

5'stop=425 

3*start=502 

3'stop-522 

5 'primer =GATCACTTCTCAGGCACAACGA (residues 4 04 to 4 25 of SEQ ID NO: 178) 
Tm5=58.93 

3'primer=ACGGGCAAGGAAAGCAATATC (residues 502 to 522 of SEQ ID NO: 178) 
Tm3-59.06 

probe 1=AATGACCATTACAGACCG (residues 426 to 443 of SEQ ID NO: 178) 

probelstart=42 6 

probelStop=4 4 3 

direct ion l=Reverse 

Tml=68.98 

scorel^l . 98 

length-119 
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CT604 

Nucleotide 

Genomic coordinates : 

Start: 50300 

Stop: 49079(SEQ ID NO: 180) 



Amino Acid 

MSHINSTSAATTSSNTLPICTTTAPMIAAARAAAIASRTSASAVTSINSNSTSSSAMFRV 
PQGISVTAMPPVPALTSLTESTGTRMSSTPNVDVIPVPGPKNKSKSKKKDSKRKKNQNGN 
RSSDEDEPSLVIDDGSGRQSKNKKYSWVTSLATTTAERNNDTLAPPRPFLPTPEEGNMSE 
IDAGLSNPVTRQITGEVYSAALTSGVGDNGLYPSHFTVADTSYGDCETPIPGPAFVLDDG 
TVSRGTSLLHREEAEFLNDGSKVIHTVKPRNSKYSNIQRAASCMAYAVDLLNNHNITSDQ 
FDFMAMTAWAARQRCGEMAKFFEKRDKDIGEYRNKVVQYNRGIFTRTTEMNKRAKIILEQ 
QQRREAAAAAAATGATAPIPTTSAAGVGATSSATTNSLEYQEIRYQ 
(SEQ ID NO: 181) 



Top Blast Hits 

Sequences producing sxgnif leant alignments: 



Score 
(bits) 



E 

Value 



AMYH_YEAST {P08640) GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1, 
AAC4 9609 (AAC49609) GLUCOAMYLASE 

039781 (039781) MEMBRANE GLYCOPROTEIN 

039782 (039782) MEMBRANE GLYCOPROTEIN 
Q9Y075 (Q9Y075) PROTEOPHOSPHOGLYCAN (FRAGMENT) 
Q14 888 (Q14 888) MUCIN (FRAGMENT) 



55 
55 
53 
50 
48 
48 



8e-07 
8e-07 
4e-06 
3e-05 
8e-05 
8e-05 



Comments : 

EST confirmation of the predicted transcript; 

An isolated EST has equence identity to nucleotides 1 to 666 of CT604 : this 
corresponds to nucleotides 4 9128 to 4 9792 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=572 

5*stop=592 

3'start=640 

3'stop=658 

5'primer=GCCAAATCACCGGAGAAGTTT (residues 572 to 592 of SEQ ID NO: 180) 
Tm5-59.06 

3'primer^CAGCAACCGTGAAGTGGGA (residues 640 to 658 of SEQ ID NO: 180) 
Tm3=59.24 

probel=CTCACTTCTGGAGTTGGA (residues 604 to 621 of SEQ ID NO: 180) 

probelstart=604 

probelStop=62l 

direct ion 1= Forward 

Tml=69.03 

scorel=l . 96 

length=87 
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CT524 

Nucleotide 

Genomic coordinates : 

Start: 291298 

Stop: 289681 {SEQ ID NO: 182) 
Amino Acid 

MEDFKQLKVKNGICLSGENTENYERVLLTFKSVKSVRRSELKEGHFIVRLRDKEVLHIKN 
GNERLRQLTGDPTLQIGLKYTSSLPKQGSFLEDEDPNYGKKWNESLPSPFQEMNKIVEEK 
ALVNDECNFKFSPLYRIIHERLSNAAVKKCDYMIITTDFLVGCGFSPRNCTRTLKNMEQVL 
VQHGGTSSRVSVYDICDRLTYNGLSIANPIVGSFSNMCLIVPMDKLGLLFYNSTHPSAKS 
IGNYMSCLFNAAVVYTLEKSNQKLDNFEKEIRFAKNEVNLLVSERSVLEEKLKESKKLYA 
ASEEQRISLRDVHKKSSIASSRYDGGACLVFAFSDRDFSLLCRTNGNGSFYSATEEGIRY 
VSSDDYRKRDVDERRPRLVMSITGSDAPICIRDSIRNHFNNHFIASGKGNEISFIDPPNE 
RLLMEMVREVTGSDIKIFMDNGKVYQDGVEIKVIDPSSKEGKDIIKKEETLPEEERKRLR 
RERRMIFNTVKAIETYNEERGEEEEVATSSGGTKRKREEKEGDYVALLNKACKEIKVC 
(SEQ ID NO: 183) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q57105 (Q57105) FESMID CLONE 31, COMPLETE SEQUENCE 39 0.10 

094986 (094986) KIAA0912 PROTEIN (FRAGMENT) 39 0.10 

094944 (094944) KIAA0866 PROTEIN 38 0.23 

MYST_HUMAN (P35749) MYOSIN HEAVY CHAIN, SMOOTH MUSCLE ISOFO 38 0.23 

YDH6_SCHPO (Q92351) HYPOTHETICAL 140.8 KD PROTEIN C6G9 . 06C 38 0.23 

Q22'705 (Q22705) T23G7 , 3 PROTEIN 38 0.23 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1642 to 1 of CT524 : this 
corresponds to nucleotides 289650 to 2 912 91 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 

5'start=723 

5'stop=74 6 

3 ' start=828 

3'stop=852 

5'primer=TGGAAATTACATGTCATGCCTTTT (residues 723 to 746 of SEQ ID NO: 182) 
Tm5=58.47 

3'primer=TTCGCTAACTAGAAGGTTGACTTCA (residues 828 to 852 of SEQ ID NO: 182) 
Tm3=57.59 

probel=TGCTGCAGTTGTATACAC (residues 750 to 767 of SEQ ID NO: 182) 

probe 1 star t=7 50 

probe lStop=7 67 

direct ion l=Forward 

Tml=68.96 

scorel=l . 96 

length=130 
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CT605 

Nucleotide 

Genomic coordinates : 

Start: 51809 

Stop: 50423{SEQ ID NO: 184) 
Amino Acid 

MDSSASVVFMRFAPPGEETALPPRRATPGSVAYDLFPSEEMDIEPMGLAKISTGYGIDKF 

PDGCYGQIVSRSGMTWKNNTSVPTGTIDVDYRGELKVILRNHSAEKSVPIRKGTSIAQLI 

FLRYCDVEEEQIVYINETTGERTIIDSSSKKDNKNQARSVRGTGGFGSTDNPNFTETTVS 

RNQQEENKKEELEEGEIVEMEGFIDIPFLEGFENILAEQSNETGVTYPNTNQDVEEKDTK 

NIDVVRELEAEFSSGIGSGSMDSSDSSDSSSSSSDSSDSSDSSDSESSDDSEGGDNKVRR 

IRRHQYHRRQLSYSDDVNGGGRNSEKMEMDRVTHTKTEHIKREDEPRYEERERYIHPRRM 

QVPKDYYCEQYEHYDAPAAAHHHRHHQHRHQHQRHFNQPRSNNSSDVTAYVNENSPTRPC 

RDRNSRFSERPNNGGYNRINSRYTTFDPYRYGARRGRGGVY 

(SEQ ID NO: 185) 

Top Blast Hits 



Sequences producing significant alignments: 



Score E 
(bits) Value 



Q89662 (Q8 9662) COMPLETE GENOME 
Q86612 (Q86612) 0RF2 



90 
89 
88 
87 
83 
83 



2e-17 
8e-17 
le-16 
2e-16 
3e-15 
5e-15 



DUT_CHVP1 (041033) DEOXYURIDINE 5 ' -TRI PHOSPHATE NUCLEOTIDOH 

DUT_YEAST (P33317) DEOXYURIDINE 5 ' -TRIPHOSPHATE NUCLEOTIDOH 

DUT_LYCES (P32518) DEOXYURIDINE 5 * -TRIPHOSPHATE NUCLEOTIDOH 

DUT_HUMAN (P33316) DEOXYURIDINE 5 ' -TRI PHOSPHATE NUCLEOTIDOH 



Comments : 
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CT606 

Nucleotide 

Genomic coordinates : 

Start: 65024 

Stop: 64010(SEQ ID NO: 186) 
Amino Acid 

MSSSQGLNNNMCTTEILLPKCTSSSLSLEESVDYLEKDFEELGIPLVEGKEVLLEFAYKI 
LNKRDTIRVIGDEQGDVCSVFFLRFGKKKTFNPQTKMWLVKLANAIALSMGVVPEPACTC 
SRMMTTAKKIPVPESYKNVNRNIQKFEDVHYIDINFQSFVREQIGLSVLGKNDVQKKKKE 
ETPFFAPFNKSKIGGECIEDLKYDSESVSIIRDVFNLLGEMPTEDVKTSRSCINPSHNDT 
NPSMRLVFRPMYWRNSKLVMDKLSKEQDSALIEKYMGGEHQHCIIGGRNVLLYCITALCF 
SSDCGPKKMLTNDEIKQLIWYLVLLFFHIICPIIQSK 
(SEQ ID NO: 188) 



Top Blast Hits 

Score E 

Sequences producing significant alignments : (bits) Value 



YNHE ECOLI {P77522) HYPOTHETICAL 56. 


.3 KD PROTEIN IN LPP-ARO 


35 


0. 


.69 


TTK MOUSE (P357 61) DUAL SPECIFICITY 


PROTEIN KINASE TTK (EC 


34 


1 . 


.2 


Q56333 (Q56333) FLIL 




34 


1 . 


.2 


CAB49S08 (CAB49508) HYPOTHETICAL 32. 


,8 KD PROTEIN 


33 


3, 


. 5 


BAA85006 (BAA85006) ORFIP 




32 


6. 


.0 


BAA85071 (BAA85071) ORFIS 




32 


6. 


.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 531 to 1 of CT606: this 
corresponds to nucleotides 64063 to 64593 of the genomic reference sequence. 

TaqMan Proe Sets: 

5*start=546 

5* stop=567 

3'start=655 

3'stop=677 

5'prrnier=CCCTTTCTTTGCACCCTTTAAT (residues 546 to 567 of SEQ ID NO: 186) 
Tm5=57 .43 

3*primer=ACATCCTCAGTAGGCATTTCACC (residues 655 to 677 of SEQ ID NO: 186) 
Tm3=58 . 14 

probel=GTATGATTCTGAGTCTGT (residues 606 to 623 of SEQ ID NO: 186) 

probelstar t=606 

probelStop=623 

direction l=Reverse 

Tml-68.79 

scorel = l . 7 9 

iength=132 
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CT607 

Nucleotide 

Genomic coordinates: 

Start: 68659 

Stop: 65032 (SEQ ID NO: 188) 



Amino Acid 

MGVPEAKKVYENAYGAQNGRVIKEKTGYEDCYDDEDDEDYCSGEEDCTTSSLLKATSLAN 

INSKNFLDFGRGKKSSSSSPTCDYTLDMVDLPTYNVSDLVMLGRQIATTMLKGQKNMGQM 

ILFINTTNQQIIDVLHDGFNVIREEDTMHSRMQNKKHIYENFYCRDEKKVISEFFSRKYK 

HEKIKARIERVPIIIPSSQEEVDWLTEPPIEDMMMAPPVSNHKMDDYEGLDYWINKHTDV 

MKKRKFLTNSFLFRNVPTTSFNSSPTAVLKSRFKDAFFASQMEGVILYYAFRMIRVMKNL 

LKSKNLKGRYTVLFTDGKAPAIKMMTRAKRQIRQERSKEKAKSRNENCLNRKTNDLLFYS 

CERMMMRLPQGLMASALLDIMRIPVLKTTGSKCMYLSNASFTEAEDDIVRLTSCLLNLET 

PGKHFSLLEKRKILEYDSYNMSGNRKESKRWEDLLNVLKQHTNDENQTLSMNLFSHDSDV 

LVKWNLMVGHHKNVCRLTGTQFKDSETFLKIGHVKFFRCMNSNSSGENQANELGGFAAKR 

RTKPNTIYNLAESPLMLSPESTLLIMLTKGSDYNSAIVSNCEYDTWVRKEVAVFENTYCT 

CVGGWEIFLSEQEARKNNKDCDDSVGNISMGNLSKSNCRKCDKKLVLPFWTIKFFYLSQA 

IDFVRDPLQLCFPPTHLIDLETDVSLKHALHRALAVNi\AANVMSYLTMGSFNQRVFGTIT 

TLSDISIHLSGANNNESKNTGSDVESDTEDLIPFSNNKRKSGNDPQKSTRKKSKVNATRK 

SAPVTKKLSSSVFESIRGFFESHTEGGIINDRGILTKERIDVFGNNLDTNPEALGEENGG 

GGGIVSSIPGLSTEQTSILKTEQNNSTSDFLDFFKKFNEMDDVEEEEEKMEEGEKEEEEA 

DLETDDWLDEARKAFEYKDSDFLEAVTAATNEMTSSLAKNNIEEDEHSRCSVSSKLNNKQ 

PVMDEEKWAEIVNEFDKCISLDNITYNDNSLLSRLSGVLMDANKREDGNNSNVVLYEPVQ 

GIDDERFSGVPYSVKTMNLLVIVYMNMCGLEDNTIVYQQLMPIIHSEFCGKTEEDKICTD 

RTNFMSAALEYTMLQYMPELKKTPRIKQIKRKNWERIPKVLDDFKDKVSTCTDNYNKLLA 

TLNKEGKIPSENTKWLPSQGQFMPVLGVAISKPWSPLTLWSSFYLQHQQRQDVSLTNITP 

PNSPRPEQ 

(SEQ ID NO: 189) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q12532 (Q12532) HYPOTHETICAL 119.1 KD PROTEIN YPL009C 

Q9XZI5 (Q9XZI5) RANGAP 

Q13387 (Q13387) HYPOTHETICAL PROTEIN 384D8_2 

Q9yTL7 (Q9YTL7) ORF 4 8 

Q25662 (Q25662) REPEAT ORGANELLAR PROTEIN 

096266 (096266) HYPOTHETICAL 283.6 KD PROTEIN 



50 
47 
46 
45 
45 
45 



le-04 
8e-04 
0.001 
0.003 
0.003 
0.004 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 860 of CT607 : this 
corresponds to nucleotides 65102 to 65960 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start=1428 

5*stop=1448 

3'start=1516 

3*stop=1538 

5*primer=CGATTCTGATGTGCTGGTGAA (residues 1428 to 1448 of SEQ ID NO:188) 
Tm5=58.31 

3 'primer^TGGCCAATCTTTAAGAACGTTTC (residues 1516 to 1538 of SEQ ID NO: 188) 
Tm3=58.35 

probel=TGGAACCTGATGGTTGGA (residues 1450 to 1467 of SEQ ID NO: 188) 

probelstart=14 50 

probelStop=14 67 

direct ion 1- Forward 

Tml=69. 04 

scorel=l . 95 

length=lll 
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CT608 

Nucleotide 

Genomic coordinates : 

Start: 69118 

Stop: 68'704 (SEQ ID NO: 190) 



Amino Acid 

MDGDSSSLQLLSESEFDYVVETLKEQGVWELALEVFNEVSNSIETVKEEEDYTVLRSRNY 
FPTESITLYKQQQEEEESTPIKKRKLASGKSPRSLCRELRLLQIPSTTTFKAAPRSSSRR 
GKNTRLRRVCKNYGAHQ 
(SEQ ID NO: 191) 

Top Blast Hits 

Sequences producing significant alignments : 

VE2_HPV08 (P06422) REGULATORY PROTEIN E2 

FLP_ZYGBI (P13784) RECOMBINASE FLP PROTEIN 

Q9X2S0 (Q9XZS0) GUANYLYL CYCLASE (EC 4.6.1.2) 

CINA_ELEEL (P02719) SODIUM CHANNEL PROTEIN (NA+ CHANNEL) 

GLMS_HELPY (026060) GLUCOSAMINE — FRUCTOSE-6-PHOSPHATE AMINO 

Q9Z8C4 (Q9Z8C4) TRANSGLYCOLASE/TRANSPEPTIDASE 



Score 


E 


(bits) 


Value 


38 


0.O28 ■ 


34 


0-41 


33 


0.71 


32 


1.6 


32 


1.6 


32 


1 . 6 



Comments : 



TaqMan Primer/Probe Sets : 

5'start=ll4 

5' stop=138 

3'start=235 

3'stop=259 

5*primer=GGTGTCCAATTCTATTGAGACTGTG (residues 114 to 138 of SEQ ID NO: 190) 
Tm5=57.77 

3'primer=CGAGTTTCCTCTTCTTAATAGGGGT (residues 235 to 259 of SEQ ID NO: 190) 
Tm3=58.39 

probel=AACAACAACAGGAAGAGG [residues 209 to 226 of SEQ ID NO: 190) 

probelstart^209 

probelStop=226 

direct! on l=Forward 

Tinl=68.86 

scorel=l .86 

length=14 6 
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CT609 

Nucleotide 

Genomic coordinates: 

Start : 78365 

Stop: 77441(SEQ ID NO: 192) 
Amino Acid 

MWCSTHLSYSEFFTPLQSQKLFRNFFRALEFRGWTASSTECQVPRVDLWVGPMSDYTRNC 
WFQKRTLTFVCFWNRRFWRLVDPEMRGYNLLFSLENFTLPLSQKLFKNFFRALQFRGWTA 
SSTECOVPRVDRWVGPMSDYTRNVIAPETYINFCVFLEQAFLETGRPRNERVYPSVFTRE 
FYSSSISKTFQKFFRALQFRGWTASSTECQVPRVDLWVGPMSDYTRNVIAPEIEEVSYGH 
FWTRCFWTKILLDGNPLPLPPPFBCKGPRVYNDCTTPHSNHHNHHHHHGRTSILQQTLSRK 
WSSLL 

(SEQ ID NO: 193) 
Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



DYRK_RAT (Q63470) DUAL-SPECIFICITY TYROSINE- ( Y) -PHOSPHORYLA 
DYRK_MOUSE (Q61214) DUAL-SPECIFICITY TYROSINE- (Y) -PHOSPHORY 
06154 3 (06I543) BHLH-PAS TRANSCRIPTION FACTOR SPINELESS 



37 
37 
37 
37 
36 
36 



0.22 
0 .22 
0.22 
0 .22 
0 .29 
0.50 



DYRK_HUMAN (Q13627) DUAL-SPECIFICITY TYROSINE- (Y ) -PHOSPHORY 
HXA1_M0USE (P09022) HOMEOBOX PROTEIN HOX-Al (HOX-1.6) (HOME 
O60275 (060275) KIAA0522 PROTEIN (FRAGMENT) 



Comments : 
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CTlOlO 

Nucleotide 

Genomic coordinates: 

Start: 41348 

Stop: 41795(SEQ ID NO: 194) 



Amino Acid 

MTHLVLLILSLSLSPVYHHLTPYLSPHLTYTPISPITSIFPHLIHSLQFQHPVLAEPTHN 
QIWTPVFPFIPNRHHLCPQALAVYIRRRGQARSISSLQASRRATQQALSLLLPRRDLPIL 
KLQE W PLQP P PHQVLT PCWTLS Y LVLRN 
(SEQ ID NO: 195) 

Top Blast Hits 

Sequences producing significant alignments: 

062081 (062081) C31A11.6 PROTEIN 
Q05330 (Q05330) PUTATIVE ORF 

Q60974 (Q60974) RETINOID X RECEPTOR INTERACTING PROTEIN 13 
P79230 (P79230) KAPPA CASEIN (FRAGMENT) 
Q9WAL8 (Q9WAL8) POLYPROTEIN 
AAD12B52 (AAD12852) Y8A9A.2 PROTEIN 



Score 
(bits) 

36 
32 
32 
32 
31 
31 



E 

Value 

0.20 

2.2 

2.9 

2.9 

3.8 

6,6 



Comnients : 



TaqMan Primer/Probe Sets: 

5 ' start=209 

5 * stop=227 

3'start-280 

3*stop=300 

5 'primer=TCCCTAACCGGCACCATTT (residues 209 to 227 of SEQ ID NO: 194) 
Tm5-58 . 83 

3' primer =GCTGGCTTGCAGAGAACTGAT (residues 280 to 300 of SEQ ID NO: 194) 
Tm3=58.29 

probe l^^TAT AT AAGGCGGCGCGGC (residues 250 to 267 of SEQ ID NO: 194) 

probelstart=250 

probelStop-267 

direct ionl = Forward 

Tml==69.O0 

scorei==l . 99 

length=92 
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CTlOll 

Nucleotide 

Genomic coordinates: 

Start: 41758 

Stop: 42097 (SEQ ID NO: 196) 
Amino Acid 

MLDSKLLSSEELKELTSYVSTSSRRSDMKKHLLHLFEEHEKIFQFIQGKHKFSLYTLDFE 

IFYVMLNILLVEVKNILSPIPLLFDRNLQPVRRLWMFHNGPASPERCSRSLG 

(SEQ ID NO: 197) 

Top Blast Hits 







Score 




E 


Sequences producing 


significant alignments: 


(bxts) 


Value 


044068 (044068) UNA 


POLYMERASE ALPHA (FRAGMENT) 


31 


2 


.3 


Q49563 (Q49563) DNA, 


TRANS POSON-LIKE ELEMENT ENCODING 3 ORF 


31 


3 


.0 


BAA74885 (BAA74885) 


KIAA0862 PROTEIN 


31 


3 


. 9 


044055 (044055) DNA 


POLYMERASE ALPHA (FRAGMENT) 


31 


3 


. 9 


076063 (076063) LEUCINE-RICH REPEAT PROTEIN SHOC-2 


31 


3 


. 9 


YD89_METJA (Q58784) 


HYPOTHETICAL PROTEIN MJ1389 


31 


3 


. 9 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=107 

5'stop=127 

3 ' start=193 

3*stop=217 

5 •primer=TCGAGGAGCACGAGAAGATCT (residues 107 to 127 of SEQ ID NO: 196) 
Tm5=57.83 

3'primer=CTTCAACCAACAAAATATTCAGCAT (residues 193 to 217 of SEQ ID NO: 196) 
Tm3=57.4 6 

probe 1=ACAAGGTAAGCACAAGTT {residues 138 to 155 of SEQ ID NO: 196) 

probelstart=138 

probelStop=155 

directionl== Forward 

Tml=69.0l 

scorel=l . 88 

Length=lll 
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CT1012 

Nucleotide 

Genomic coordinates : 

Start: 42053 

Stop: 45491 (SEQ ID NO: 198) 



Amino Acid 

MAPPHLNAAADLLDKVMSGPLSPEGAQTSSPAACVGAKVVKALVSFCQKTRFTTNIVMRE 
VKAMEFQGDDFNYSALCASMPQRPVTERQMFALMKSEDEEMGVSANFSPVSDDVINPSSL 
PSGQEVDSSTSAQISGMFQNVWSLLEECGSGSNSNSSPVSRTVLVCTL.FIIQVFKFLVTK 
VSNVNVLNQLFGHVVFGSLDVAPSNNNSVPSTVVNNNNKPSTSNNSNNISNKRVGGSNNS 
GGGRSKKVTATAKNPFNNVDGDNHGMFAGAPVDVNLDDFVFPQVETLTSKSTIPKEEVNV 
DEDLSKMCRKTALTPLEIHTFNVFISEINPSKYDRSMFCKGFLTAWDKFVEGDTAGVKRF 
RNYILTRSNYASAARAVYEASIKGTVYYNDKSKFLFHDNVNPDLDKSWGNKNGKKPRLPA 
NLMAFMGIDIVKVCAKGIQKYMFAKQFQHPEVEELVPPMAVYAKVAAGLKSGTLFDDWDL 
PEYENCQFIKYDTEGCKKHSELYAKQLLRTGLNQYNKLEEGQSAFPFANIVTVTSASSDD 
IHGDTIIELMYKTKDGVKGVSKIEDENIIKVNPAEEKKNNRVQAEKTLYFEIDSDDEVCE 
RTEEEFFRPTSVVAAPTTPLVPSNVEEEEEEEEQMEEEEEEEVEREEGSDKEDDGDAPAQ 
EEMEEEKEEEQQQQPEEESNGNENQEEEQQQQQQPEREEENKDADSDSDSDSSSSSSSSS 
SSSSSSSSSSSSSSSSSENEAEKKKEEEVPAKIQKRKRLSERPSEAASSPKRMRVEEEQQ 
QQLSPSLDILQTAVDEMMEEIPAPEPIVATTSPKAATLALKTGFSySSFVRGDDLSVAGN 
TSPTEPAAVPAAATCTSDVGNDFLDMLDGLPGDIVMQPGECDVTAKFFEGITLPDGTDNE 
CTGFDDLLKATETDNIITTTCFTSPIHPSSNSAPRKDIDNCSSIKRSRAGSLFDTDDDSE 
TNEVEKEAPKRKKHLKKRRNKSHRGSSGSASSSHCMSSDEESEDERDMKSTSKVHKSPKA 
HVKHSPKYDAVNSDVNN5YNNVNSTTCMSSSDSDAEAQPKSHNKSHSRKHSSSSTSDKKQ 
NQQCSINTQNVKKTVVQSPPSFRSFSPKKDELGDFLSRKHTKPVRPYNKKRDNVNTTNNV 
VQRSA 

(SEQ ID NO: 199) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9YTL7 (Q9YTL7) ORF 48 101 4e-20 

095815 (095815) DENTIN PHOSPHORYN (FRAGMENT) 99 le-19 

GAR2_SCHPO (P41891) GAR2 PROTEIN 97 6e-19 

AAD46501 (AAD46501) LATENT NUCLEAR ANTIGEN 94 6e-18 

040947 (040947) ORF 73 92 le-17 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 91 3e-17 

Coinments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 955 of CT1012: thxs 
corresponds to nucleotides ^3 4515 to 4 54 69 of the genomic reference sequence. 

TaqMan Primer / Probe Sets: 
5 •start=1345 
5'stop=1366 
3'start=1389 
3 'stop-1407 

5 •primer=CATCCGGAAGTGGAAGAACTTG (residues 1345 to 1366 of SEQ ID NO; 200) 
Tm5=59.57 

3 •primer=CAATCCTGCGGCAACCTTT (residues 1389 to 1407 of SEQ ID NO: 200) 
Tm3=59. 65 

probel=TGCCTCCTATGGCTGTAT (residues 1367 to 1384 of SEQ ID NO: 200) 

probel3tart=1367 

probelStop=138 4 

direct ion l^Reverse 

Tml=68 . 99 

scorel=l - 99 

length=63 
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CT1013 

Nucleotide 

Genomic coordinates : 

Start: 45950 

Stop: 47825 (SEQ ID NO: 200) 



Amino Acid 

MSSTTSPSSLWDDDDDDDEEDEKDVKQEVSNRPPIFSYMETVSFSDNDEDDNKGEEECFG 
SNFDMFGDSDNMPSTSTAPFPPPSTTTPLPTPRSIMDTDSDECDEEGAAAASAPSIAASS 
S I PVG ISEAELKKMEKKKRKEI KKLKKMMKDPliPHLYVGGEPPVAADYKTRANI SLYKVD 
PSIDMCGVAPPQFCAELPTPSIDVYTSSYVFPPPTPAMHNKKGSKKCQFLKGRKALRKWI 
HENVCMAPPGKRGGVFLAHLEQRFLAEHGDEYKVPRMFVSRVLNKAFPNLIARADTLCSD 
MTFYTNLCWIVNGVVVCFDKDDGGIHGDA5EYATGENFDTVVFHKREEQKTNGSASKKRR 
LTPDTSNMGTSTDVQEFQTMGTNTDMQEFQSMGTNTNPIETSSVGVNTNPLPNPPPRLVI 
TPLTNDVPELDMMWLYSPSRGGGNSRMSANTGTSPLSNTPIPTCFTGGANVVVPNGFVPP 
TFPLECDEDDPSIPNSYNYEEDKVFHPFYEYMAKYLSPLVPSYNKGQTCNVVQEWFKGSF 
SLAKRRGTVPKFCSNISHAFFCNMDVCTAMCKWAKTVIRHGQYCNRCIVRRSCTSMLAYH 
Y I VCRDASCDVPKCRERVRNDMDD 
(SEQ ID NO: 201) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAC97971 (AAC97971) FAS-BINDING PROTEIN 47 4e-04 

Q19871 (Q19871) F28D1 . 6 PROTEIN 44 0.004 

NUCL_MESAU (P08199) NUCLEOLIN (PROTEIN C23) 43 0.006 

AAD56625 (AAD56625) NUCLEOLIN-RELATED PROTEIN NRP 43 0.008 

VP41_YEAST (P38959) VACUOLAR ASSEMBLY PROTEIN VPS41 (VACUOL 43 0.008 

035613 (035613) FAS DEATH DOMAIN-ASSOCIATED PROTEIN (DAXX) 43 0.008 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 934 of CT1013: this 
corresponds to nucleotides 4 6867 to 4 7 800 of the genomic reference sequence. 
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CT1016 

Nucleotide 

Genomic coordinates : 

Start: 52093 

Stop: 54913 {SEQ ID NO: 202) 



Amino Acid 

MSEPSVYAFIDIKEIENGWEKEFGLLVQPGQKLAPFRDISYDSSKLDCDAFSCIPSDILH 

SDNEKRVGECNFAEHTSVSFPVKNPEGKTLRHFTACGPGCYRRYKQRDPHTGLPVARGVL 

MQDHVDHETGNKMCEYLNQSLVMWAAVPWIRPGDLTEGYNTTHVPGFAFKEDDERDSKRV 

KYENVVISKAYCDFFKQYYDADSGSCYRSGWMKFVHLMFGQYFTNLSYNLANPKPYNLTG 

NTWSDVVSVLTDNPIVDAGAAPSRSEMDEIITKKKFNVFPSEQTSARQKAENIIRSQYGD 

GVEIDPSSVDALMQFVNREGVVGTEKKSDRLMRVADAVMDAAMRLQVMGLDDSQSRRLLL 

KNMIKMSRNNPEYARHFSSSLKLIGVTLAIKRSVFSKGASAKRKETAINNGEQHRRSRWS 

PETVTEEDALLFARENITEDPKHPAPFVDILHSPDINSSIKSGSSSSIWNDILSRISSTR 

KLEEKASVFVKNLVVKVVRQFLDILEGKLFSDGYEWDDNIPLMIGVDQILREVIKAASNM 

CARFASSALESSLVTGFIDSASAITSRLAVQLAARTFSVFLEESVIEFVVAASLRLAIQA 

FADLATLAASALTVIGIVIFVIQVLGLILDLALGLGWYDHIFSPEDLKKQVLVFRREFAK 

AGNVDVGVAQPVTPEEIVAINVFLQTEENGEEKKEEGARKSKIDFLQKYFHSTPLMGKKS 

KFVYIQEAAQEYLGGRTMNAFGQRIITAADDSDTTTTTQEGRRDDETVTKKMRSIILETG 

QTLKDYSSAVNYNASRLDYVGEEWVRNTALKEETRSNTTSDNLFKKTVSLASMAGAFLVL 

GIGVLVASHITLLRFTNIGLAFAFAGLLAFIALMSISYINMNAMGVVNSDAIYRSTALVG 

DIKTDPRRVGMVQRHVGVGAKYNMITDFVSPMLDEIESD 

(SEQ ID NO: 203) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



067140 (06714G) L-SERYL-TRNA (SER) SELENIUM TRANSFERASE 41 0.048 

YD86_SCHPO (010411) HYPOTHETICAL 222.8 KD PROTEIN C1F3.06C 38 0,32 

Q88606 (Q88606) (CRUCIFER) GENOMIC RNA FOR RNA-DEPENDENT RN 35 2 . 1 

PME3_PHAVU (Q43111) PECTINESTERASE 3 PRECURSOR (EC 3.1.1.11 34 3.5 

081424 (081424) T2H3,2 PROTEIN 34 4.7 

004260 (004260) HYPOTHETICAL 49.5 KD PROTEIN 34 4.7 



Comments : 
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CT1017 

Nucleotide 

Genomic coordinates : 

Start: 55054 

Stop: 58189 (SEQ ID NO: 204) 



Amino Acid 

MDSLTNTVTLLVNDRLGNHRTNKPITEQDVENTLNLNSLERASLLKLYSVFIKEMQSYSG 

CIPKNKYTNVQEIFEDGLITFEWRDGTKVHRSVSPSSPIPLSTKKSPRSSPSPPPSMPSI 

KEEEFEEEFEDDEEIYETDENVEDFINGDGEDSEEEEEEDIIVDDEEEENEEGENKYVLA 

FSNHLRRQTAAAAAAAAAAAAADIEKKDKNHAVSAHDYTLSALQQQQQKLLQQQQQQQHQ 

QRSSSEKVTSTPNKFNKFLLPSNGFSEQXELFVCFDVDKIAQYNGLVELDILPIVAEYII 

NGLGLKCSMETPPVKPCRRKEVKDVWCQPKTSFENDAVEDKHLAFAESPILQRPRDFPIP 

KKITAYFCLDDSVDIKNPWGSCPLLKSGSNFRVSEYSRHFNEFSGVKNDDDTSSNTCFIY 

SQKNPNIEIVSKLNIEFEVMMEGIITHRKDLFETGILSDSSLATAMAFCHPKARVRNVAL 

FYFSVYLPFSKITRKETIKCSETDKVHIGSDAIFSPPSDNPNISAHQNNNNNNNNNTSVN 

lEDRPIRNNNISRKMTITNYQCMACKERCTNNCTNGNYPDRGNQHLSHSVKGEDFFKILN 

NSKVDSLKKLSRVLIPAPPSGNYTSKFCDRSSMCHSFFCRGIEPVSTSFSSDSFEKTKLV 

LYGKVVDVINSYSAIKTSHNNRIRVFFNSEEKDNKTIPSRAESAKNAFKDILVHECNKER 

AVSYFEQNKLSSKDGHLSNKWWIELNDLNIMFEKHVEDFYKKCSKVNDAESLKDIFNDFE 

KTCDKYKTAKRAIIGAQDPSTSTPSKKENGITRIISTLSEFHSKDEATVSALLDKTMLLG 

SRTIMSGVRCVIRNNSVFSGFENKNTNNNWELEIRHYVISMGGAAVTKISDEDLEQFTPV 

RGAVSVTTAPNDKLPVGAHQTWKDEQTLKTNTKRNSLYDSYNSKRNNRDNNKIKNRSLKL 

SDFNWRTPNISIQEFNANKDDVNKKRYAEVVASAAPKSPSPTSSSSSNSNSSSPPLSPLS 

PTVKNSNNKPLYI PPKKRMTTTAV 

(SEQ ID NO: 205) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



077033 (077033) TRFA 60 7e-08 

09^463 (Q94463) K7 KINESIN-LIKE PROTEIN 59 2e-07 

NUCLCHICK (P15771) NUCLEOLIN (PROTEIN C23) 57 5e-07 

AAD4 6501 (AAD46501) LATENT NUCLEAR ANTIGEN 57 8e-07 

040947 (040947) GRF 73 56 le-06 

Q98148 (Q98148) ORF73 HOMOLOG 55 2e-06 



Coininents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 215 of CT1017: this 
corresponds to nucleotides 57667 to 57881 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=1741 

5'stop=17 62 

3'start=l847 

3' stop=1864 

5*primer=CGTGGTAACCAGCACTTGTCAC (residues 1741 to 1762 of SEQ ID NO: 204) 
Tm5-59. 16 

3*primer=TTCCAGAGGGAGGAGCGG (residues 1847 to 18 64 of SEQ ID NO: 204) 
Tm3=59.84 

probel=TGAGCAGAGTACTGATTC (residues 1829 to 1846 of SEQ ID NO: 204) 

probelstart=182 9 

probelStop^l84 6 

di rect ion l=For ward 

Tml=69.01 

scorel=l . 98 

length=124 
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CT1018 

Nucleotide 

Genomic coordinates: 

Start: 58947 

Stop: 60029 (SEQ ID NO: 206) 
Amino Acid 

MKICQISSPTLTLSIPLEGVYHVKQLLHLKVHLDVKGVKQLLHLKVRLDVRGAKQNPWRK 
NLCLLKKNVKSAKQLPHLKVHLDVKSAKQLPHLKVHLDVRGAKQLPHLKVRLDVKSAKQL 
PHLKVHLDVRGAKQLPHLKVRLDVRGAKQNPWRKNLCLLKE<NVKSAKQLPHLKVHLDVKG 
VKQLLHLKVRLDVRGAKQLPHLKVHLDVRGAKQNPWRKNLCLLKKNVKSAKQLPHLKVLL 
DVRGAKQLPHLKVLLDVRGAKQLPHLKVLLDVRGAKQNPWRKNLCLLKKNVKSAKQLPHL 
KVLLDVRGAKQLPHLKVHLDVRGAKQQQQLCLPLKTISTSFTHLLLCLYMEYGKHQNLQV 
X 

(SEQ ID NO: 207) 
Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



041125 (041125) A643R PROTEIN 

077336 (077336) PFC0425W PROTEIN 

Q65683 (Q65683) 42K TRANSPORT PROTEIN 

Q48275 (Q48275) HYPOTHETICAL PROTEIN (FRAGMENT) 

Q89659 (Q89659) 42K TRANSPORT PROTEIN 



36 
35 
32 
32 
32 



0.34 

0.75 

6.5 

6.5 

6.5 



Comments : 
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CT1019 

Nucleotide 

Genomic coordinates : 

Start: 62203 

Stop: 63019 (SEQ ID NO: 208) 



Amino Acid 

MDVEFGFFHGLLSKALLPDEKHQPVIRRLCADDSRNKGEDGCCSFCGRRGTGESNTACLE 

QLIDVCSFIGTVSSIGTIINSNLSTSCSRLQKTSDSYAALSHSSFLDVVYPSLKKTTEDV 

LPHSLRAIWNKQLPKLYEKTLQPIEEEDIGYKDYVVSIEDDDNVDDGDQQEQMIIDEESY 

KTIGEKSTIELIGMYNNNKFGNEFIRIPLRETALHAQSLRYDTEAKFVNHKDSIPLFYEN 

STCTCKERLI DFSERQLQQLKQDGMDKPTDK 

(SEQ ID NO: 209) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

AAF02799 (AAF02799) F5I10.23 PROTEIN 37 0.14 

O23058 (023058) BAG IG005I10 37 0.14 

0PI1_YEAST (P21957) NEGATIVE REGULATOR OF PHOSPHOLIPID BIOS 34 0.90 

Q24042 (Q24042) HYPOTHETICAL 89.2 KD PROTEIN 34 1.2 

Q24043 (Q24043) HYPOTHETICAL PROTEIN (FRAGMENT) 34 1,2 

076871 (076871) EG:100G7.2 PROTEIN 34 1.2 



Comments : 



TaqMan Primer/Probe Sets : 

5'start=292 

5'stop=313 

3'start=375 

3'stop=396 

5'primer=GCGGCATTATCCCATTCTAGTT (residues 292 to 313 of SEQ ID NO: 208} 
Tm5=58.17 

3'primer=TTGTTTATTCCAAATGGCACGT (residues 375 to 396 of SEQ ID NO:208) 
Tm3=58,03 

probel=CTGAAGACGTATTGCCTC (residues 350 to 367 of SEQ ID NO: 208) 

probe Is tar t=3 50 

probe lStop"3 67 

direct ion 1 —Forward 

'rml=68.97 

scorel=l . 97 

iength=105 



FIGURE 3, Sheet 1 12 of 160 



wo 01/38351 



154/201 



PCT/USOO/28888 



CTIOOOO 

Nucleotide 

Genomic coordinates : 

Start: 3143 

Stop: 6956 (SEQ ID NO: 210) 
Amino Acid 

MVLTLSCTTRRVASSKGNFSKEDAVLGNQFPILKKSNNLSIARPPSIESFSASVEKIFRE 

WNESGGEKIFDISQNEEEWMDIISLVESVYEPVFSKSLKPDKLADKTCLTAAAFAALASA 

VDEKLTILSGSDGSVLQRTTKVMKKDPKKIAESLLNNEKWTSILLDRLKTAKKLLSRRGA 

LKSAERVEVLHRLNKLKEAPLPHHPSLFDNFSGGKTSAVSAGTVIASDMHFKLVEHIFKV 

SFRKWGPCGDKTESGEEEDEEEEEEEKKHSISRFVLQFMNGHNGQHYHRPESASVYFCDY 

YDYLAYRNLPNEYKLSSMHPGTFNMEDLPFRPFAVPSTYKTELEYKRFVQSTNLPQLSFD 

YGEFLCYCIFGADWYKHLGDVVDSLENSSMISFDSQTLSGVYKNTANYKRLGKKRNGIAD 

LAVRSMAEFIRTEAHKALTAEEMEEEEEEEEAEEEAMDQEPAEVDFLSVPHLRRKIRQAV 

SVLNNFVENDLSILVSNFKNVLTDDTVSGTDTDNFGSSGEFEALSSHLFLSRILDEVHIL 

RNTDIQRTLFSTHVSliSDKSPPSRVRGSNVNFNNNAGNISSLQTYGGIEELPENVLVGLS 

GGFEDTDMYSGEDVVVVWDGCDGGKVLSVTFNCGDNFIQLHEKTAETFKDDTDLVERIRD 

VLQTASKTGNLNKKAYSRKNIYAVLRENGIERPGDDFTEKGIALKDKTNQPPPPARSAKI 

TVEGVKGFFSGFRDILETRALTTYSAETFRDLGQGIVKETEGLTAATVAETSFSEGLAES 

LRSDANLGLEFSEDAKTVVFKNDTSRSLLEETRALRANNTSFSSFARDMGVQVSADLDAE 

FAAEMRETYPDAALEQNLKDLDKFEETIPESQVKKLKKIDSYLTENPERAGKEINDTELS 

KATDSVLGKKLGNAVTVLMNNFGKVTIVVGASVVAGFLGPAAVALVHASRGAHLNVVDHT 

SPKGVISYKIVDFSCADRNTGWAKPTKHPFREEIDHVIALDASFLTENGAYVFPEDGGPK 

SKYKAYAPICGTKDAAQGECGSWATFDDPHSVLPWVASMKDLPKGQSLSCDKGMSTLKAV 

SSVLLSIGKDVAEAIFEVAEDAVVGLASKAISAVINNPLFIFGVPLGFGIAATRLNPSNW 

KTGLIVFSILLVVILIVRFFAGSGPLTLNWFGAKNSAKRKQTEQFEDGGGNRSKIVLAEK 

DNANSKLQSRRNETGPMRLEELPGHEDLRPVFFPATTNYSKSAKILGYKSKPFNDFYTKI 

INTDIIKMDR 

(SEQ ID NO: 211) 

Top Blast Hits Score E 

Sequences producing significant alignments: (bits) Value 

YK06_YEAST (P36062) HYPOTHETICAL 75.5 KD PROTEIN IN SDHl-CI 43 0.010 

Q12532 (Q12532) HYPOTHETICAL 119.1 KD PROTEIN YPL009C 43 0.017 

075184 (075184) KIAA0702 PROTEIN 41 0.050 

O60721 (060721) RETINAL ROD NA-CA+K EXCHANGER SPLICE VARIAN 41 0,050 

0434 85 (04 3485) RETINAL ROD NA+/CA+, K-t- EXCHANGER 41 0.050 

AAD28522 (AAD28522) FLAGELLIN (FRAGMENT) 41 0.050 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 969 of CTIOOOO: this 
corresponds to nucleotides 5867 to 68 35 of the genomic reference sequence. 

TaqMan Primer/Probe Sets : 
5 '5tart=2102 
5*stop=2125 
3'start=2185 
3'stop=2204 

5 'primer =GGATTGCTCTCAAGGATAAAACAA (residues 2102 to 2115 of SEQ ID NO: 210) 
Tm5-57.56 

3 'primer=ATGTCACGAAAACCGCTGAA (residues 2185 to 2204 of SEQ ID NO: 210) 
Tm3=57.81 

probe 1=AAGTGCCAAGATAACGGT (residues 2148 to 2165 of SEQ ID NO:210) 

probelstart-214 8 

probe lStop^2 165 

directionl=Forward 

Tml=68.99 

scorel=l . 99 

length==103 
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CTlOOOl 

Nucleotide 

Genomic coordinates: 

Start : 17098 

Stop: 20342 (SEQ ID NO: 212) 
Amino Acid 

MGAPTNADFTRTVSGVASSLYLVNPGAPSDREKLVLASSYSDSFVYWYKDAVVTAEAPKW 
CPFNEPALHEHIMNRLEKAGLINRSRFVCNPVKSAGEVCGFRYSGGSTPQNLIFPIGASE 
INWLVILRNAARFGTVAASAKDAIERIPDLREGGTSKHVAKNAMRRLRVWRAFNWIAEAS 
RSAGMIRYEPFSVGCALYDHEVRRRQLKGSYERNVLFLGDHFCKESTLLADISRGGRSSD 
FWTIVEAVIRYKNRHARTISNETNAIPEDSSINLEWEDVLVKNQRDTNQGDDSTLEKTLE 
AAIKEHESIGEKRKKHILEFIKTCLTEEQREMIFKGVGGKGNLSPAHLTNLADAILANWA 
KAGIWVIMQSLLKQINFSILHLIGYEAQRLLMFKLYMPALLALFISQRGIGDVFLNGVFN 
LEVMKERAANAKIRDMVSRDAYKNNTNESNNLGNYSQFDIATIYGQMSDCNAIPLSINIG 
VPLHKSRMNMQDIEKTIQHIVDMGLAKIRVAEENRVARRLLHRRKMQEKAARERAAARQR 
RLRGEEEEEEEEEEEEGEEMEEEEEEAGTSGVNGSGYDQEEEDDGEEEEEEEEEEDEEDS 
EGEDMNGENNSRKRKNTGNTSSTQQPPQKRQRGKNAPISTKGKRLKERDNIGGFLLATIQ 
NDDRQVNVESIQKLLTARQRKYVKGGKCGDNGLPELLIEKVTNLLDSVFHFRKGSILNSI 
HANRRSETGVYTTKANCICDYYERNVSKDSSNNTPHSSECIERARERDASCAESNKRPCP 
VDSNNPEDVEQRMRELIMDPPSLSGVEDSLAIERVLQNEILFTSLVTNPIFNAVLGAEKG 
DLGRFIVLNNIVKFMNMTIACLVDGDMPMLLDSRGKTKNLLEKGTVKNTRKFFKPNMTAA 
ELNVATAQSAGHQYMNAGHCPEPGIKQMLLPDCIMKLKSIAMEKGRGGRSALHRQKCDHA 
FCKMLKCLFFNIDPSNAADTFIDPASRATLFRLDDLCRDRKKYKNIDWVKDLLDPVMKGT 
NKWVGTGEYTNIGRDSNVAAPVDFYTILKYTMIDDGVISVPSRKPNDVYYSTIERADDLL 
TESRDASCESYRPTLFDARAVLEVNGDGRVPYPSSETVEDLGGEEEEEEITGVIDDSTEI 
EDVQVQDSNLFDVELFDIPEIEQHQQGGEEEEliPSAISEVFASLPADNDSSSPAHIPSFG 
NSEEGEKSPEPYNIFDSALDQLLDLIDSDGRNNNPKRVDWNSVTIQE 
(SEQ ID NO: 213) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 77 6e-13 

094922 (094922) KIAA0835 PROTEIN 76 2e-12 

Q9YTL7 (09YTL7) ORF 4 8 75 3e-12 

CENB_CRIGR (P48988) MAJOR CENTROMERE AUTOANTIGEN B (CENTROM 75 4e-12 

007034 (Q07034) RNA BINDING PROTEIN 73 2e-ll 

NAB3_YEAST (P38996) NUCLEAR POLYADENYLATED RNA-BINDING PROT 73 2e-ll 

Comments : 



FIGURE 3, Sheet 1 14 of 160 



wo 01/38351 



156/201 



PCT/USOO/28888 



CT10002 
Nucleotide 
Genomic coordinates; 
Start: 20783 

Stop; 23729(SEQ ID NO: 214) 
Amino Acid 

MEETITLKESTGTVSPFKNENITRIASNYVRAFTDTWSHLVNISGAPLTAEKNPSAIPAN 
ELNR YWTKTNVLCN PLFKLE DH I TRDE DTGT I TLKFKMYI DDKNGL YQS AVLMLALDS FV 
SLASFSHGADLVSNKSENKFCVKIPHDTRAESLLNNVGFPAGLSGPFKRWSINYKAANLS 
GKSGIDGLSGSMLTVLKNNTNKRATDILHLVNNVSASAQQLDDSEMSRTFNHQKKVGVCY 
DINVSSSRQVNQRNLLHHQNIIGQHLIEFRTKQLERAQNKKVKEEENGEHEEMTSEEEEE 
EDEYEEGGCLSDIDEEDFYEDGYDEEEGDDNRTRKKKKMEEDEEDEEEEYDDEEDEEEAE 
TCGANGVIDCEDDAIIFPNGQNSKRKKMGKKTNIKKRSRRKGECSANTLSFVEKYVGNCK 
SLGIKPVGCPPPSTEFTSLFMKGSEADSCYNTCQSTRGASRIRSLLNKYSVKDLMQVNSP 
SSWKWANPPDRRFVLFDKKTKEEVEVKFEIECEKSEYFDWSELPSNIKVWLKETAKIIK 
HLALIEDFLPAMGAATPKIPLNLIKTMTSIFSVRDIVGFKIPEEVLSFIPIEWKTSISAM 
GLLSVQFDRI I EVI DLM ITNGAFATSCLNNAFFLERGVVPRDGSNTWLHTDLVQLSTS I F 
RSIRNRGVNIGGNNNTGSNSSSSSCGGNKGDYGVRCGLSISKRGITLKPPPAAMTNSSSP 
SSSAMISLPQPTRQSIDLSITTIIQDFSEVSGKLRLNGLQKNMSDKSKDVFNDAIYDSGA 
FKALLTCTVNDKSRRKRKRRTLLASGEGVVRRNLMVSQGNDVNDAHQFQEECGIKIGGGA 
SRVYKRAQRRGSAVSSRRRVRWKPQFTIAVSDEDDDCEEEGDFSSELNPTHSQLLLFQQR 
QQDSCTEDDDVLVSVEEYNNRVSGSSTTAGDRVLAKDLLSTVSPNEKRNSAALAALTISR 
HSLFNALSAKTKLGENGRFFL 
(SEQ ID NO: 215) 

Top Blast Hits 







Score 


E 


Sequences producing significant alignments: 




(bits) 


Value 


Q9YTL7 (Q9YTL7) ORF 4 8 




75 


2e-12 


GARP_PLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN 


PRECURSOR 


75 


3e-12 


Q07034 (Q07034) RNA BINDING PROTEIN 




74 


4e-12 


NAB3_YEAST (P38996) NUCLEAR POLYADENYLATED RNA- 


•BINDING PROT 


74 


4e-12 


Q9YPA9 (Q9YPA9) HYPOTHETICAL 45.2 KD PROTEIN 




72 


2e-ll 


040947 (O40947) ORF 73 




69 


le-10 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=1699 

5 'stop=l723 

3'start=1773 

3 ' stop=1797 

5 •primer=ATGACGAGCATTTTCTCTGTTAGAG (residues 1699 to 1723 of SEQ ID NO: 214) 
Tm5=57 . 07 

3 •primer=TGCAGAAATAGATGTCTTCCATTCT (residues 1773 to 1797 of SEQ ID NO: 214) 
Tm3=57 .23 

probel=ACCAGAAGAAGTGCTCAG (residues 1743 to 1760 of SEQ ID NO:214) 

probe Is tart = 174 3 

probelStop=1760 
^ directionl=Reverse 

Tml=68 . 95 

scorel^l . 95 
/ length=99 
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CT610 

Nucleotide 

Genomic coordinates : 
Start: 85707 

Stop: 83427 (SEQ ID NO: 216) 



Amino Acid 

MPPKHKPNTALKKHIIRNQQRKKEDDAESRrQRNMGQEVSKLDAPTSSKNRQRRKIRTSK 
ILSRSGDCVAGDCSDLENDEGKRDTDQEGGGRGGGNEEEEEGKEEGEGEEQQREEKEEQS 
EEKEEKDGEEEEEENVEDEHVTPTTSVSKRAKQMKKHIFPPSKKRKRSDTESKALAVPAG 
KMMTVSRPLRGAITSGSILGVRSENAPQYDYVSYLADEAVVKEKAIQYRIRSLLANLLKA 
NKTKAFPTSSSLLSSEQGKKKFGGKRTNTFVVTNVGAELVKALLANSCWAISHRKDIRSG 
EIQWQELSSKILKSLNDGNATEINNLMSSIVEDRIQRTVKERVYFEQLATVCNNLFGTRI 
LPNKNFDKNFVSVASDNSNATVRGLS I PRYFRAINNNVWVKMSSTMDLLVGGGMRSKSEH 
SISMLEKCAAGVLARASARPVEKMIKSAVEETSQAFNLSTGVFVPKQQQQQRQQQQQQFP 
PFQPPPFPLPPPQAPFQVQQPTYQGYLNPYYQYNQYYNPYAPQQLQQQYPLYFLGNQSQP 
PPQLQQQQQQQQPPQPPNNIPPPPTPQQQSPSNIPPPQQQQQQPFPVQLISSPPPPPPIP 
NTAPSPPISRVRFDSRSTTPQPPPTPVLPKPTPLPPPSTARAEEENATDMSFTDIDSELG 
SIDFDLPPATPGRNVEEIIKAQRQAVKETGVRGEEEEEEEAFIAPIIRQPRTPGNFRDEL 
LDVNESIYGSDIEPAAAAAAFDWDMGLDDLNGDEPYEFE 
(SEQ ID NO: 217) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

P93794 (P93794) LOW-MOLECULAR-WEIGHT GLUTENIN STORAGE PROTE 84 3e-15 

Q05573 (Q06573) OMEGA SECALIN PRECURSOR 83 Se-15 

AAD43602 (AAD43602) T3P18.1 83 8e-15 

004365 (004365) OMEGA SECALIN 83 8e-15 

048809 (O48809) F2401.18 83 8e-15 

065375 (065375) F12F1 . 9 PROTEIN 82 le-14 



Coiranents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 34 to 361 of CT610: this 
corresponds to nucleotides 83630 to 83957 of the genomic reference sequence. 
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CT611 

Nucleotide 

Genomic coordinates : 
Start: 88938 

Stop: B5761 (SEQ ID NO: 218) 
Amino Acid 

MEGVTSIVAAAVPEVAILITDLMGGRNNKRSTYERIVGIVGESGDLLEAILDICNRNSYR 

DELLEGETVVINPTGLLKEISLLMKKALDMNIKMKSNDDPVPFTTLDQNEQEFIGHLKSC 

KKQDGPAYKDLIHRIYSGMFVMKNTRLMLDEIIRGNAGDAVEEKNALCEAYAEMISDMDL 

IRIFLLLVAIKRDQNKKHRHMKSVIYEDWVSLNTLKDVFHKEWYMWPFSALQVGTKIRD 

ARTFSVLFGSDMHEGRNNDRIWENMAFSVTEAFLSGPSTNNHYNKGHLRMYAARPVYDAM 

EYVPQELHHILFGTKIAKMIDIVYRYSIYNVPYLLAADTERVEEPKKSVMSPSGLIISPN 

ASLLENTPLSLVSRHGIPSARKLGSFILEHENAENMHLEEAIAKCMVSQTLQEESWGESQ 

AAMVYQPSDEVEVIQAHVTKILSGNTTNKTCGLCYADLDMKPKFFNCSHENMKASYDYFP 

VHAFMDTFEARQETCSAKLCPDCTIKHLMYVYEKVSAGSEKLKDVFRCPCCGEYMVQFIG 

RCHEFSSLFERAILAGENVDPEYIAANKLLITELIKRAEKCFYTVELLQAEFMEMCPaylDK 

DFALDKDSKFTVVDNRFRPPVKLFKMVEGETGDSKCSLICTQCLLPNVCDQPNEMEDIVT 

VDVPPPVLPYPPPEQLEDYYFQDVEDAEFDDPPTDELVRYDTGPGLHKWPMRLSCGFIAS 

NFVPPNEEVTWCRQAVSILKRTPEKKIRGWNPESPEGKVLLALANWHSTDRMPENMKGLL 

NDISVIHNTRERFQNRVKVHYLNSVFGGFDDRDFEQVVGVSIPLIATYFYVYEKLNHESA 

LGLW AKM FVKNLI GEMVLERPECVFHRAHS FVLHCV DRRALSG I RPNQGAKME I VKQVN I 

VRQNMTSESI KDP VFT VDEKRTLEWKVEKEGQE I KT VKC PKCKT PN I KLGGC I TMTCY DC 

SGRRDGYPTVFCWICEDEITNPDHILIDHKLLYSDCKSTKAALEKVYNCTLCCLALRKCS 

DS YLSKQRGGGEEEE I E I YVMEDGFE FDVHTKTAVPTK 

(SEQ ID NO: 219) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



AAF04637 (AAF04637) HYPOTHETICAL 84.4 KD PROTEIN 




1495 


0, 


0 


BAA78677 (BAA78677) HFB30 PROTEIN 




44 


0. 


,006 


AAD21842 (AAD21842) ANDROGEN RECEPTOR ASSOCIATED 


PROTEIN 54 


44 


0. 


,006 


094793 (094793) HRIHFB2038 PROTEIN (FRAGMENT) 




44 


0. 


. 006 


Q9XII0 {Q9XII0) F7H1.11 PROTEIN 




41 


0. 


.042 


YKZ7 YEAST (P36113) HYPOTHETICAL 63.6 KD PROTEIN 


IN YPT52-G 


41 


0. 


, 056 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 829 of CT611: this 
corresponds to nucleotides 85988 to 86816 of the genomic reference sequence. 

Hit to public SBV sequence: 

gjLi 6l65655|gb| AF099142 . 1 : CT nucleotides 1 to 2738 match nucleotides 8582 to 11319 of 
the public sequence with a 99% homology, a score of 5398 and an Evalue of 0.0 
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CT612 

Nucleotide 

Genomic coordinates : 

Start: 91607 

Stop: 89060 (SEQ ID NO: 220) 



Amino Acid 

MGSNQQQSFISKRNGTKQEISLEKIIKRIENACLPVNQYVPKLDKNAINPQELASHIMDR 
LPATISFQEMDDFLADYAKTKIVDHPDFGKLAGRFICSNIHKNTKEWNSFSATTQKLRHA 
IHPGTGKPASVVNDTYYENVMANAEILDAVIDYKMDYLFTCFGLRTLEYSYLIKIGSPTD 
RKKRILVERPQDMIMRVAVGIHGSDIKSVIETYDLMSRHYFTHASPTLFNCGTVTPQLSS 
CFLLGLQDDSIEGIYDTLKEAAirSKTAGGLGIHFHDLRAKGSPISSWSGTHPGLMAFLQ 
IFNVSVKKVSQGGDKRRGAAAIYISDWHLDVKDFIDCRKNAGNEDLRTRDLFPAIWVSDL 
FMERVKAGKNWSLMCPHECPGLSDVHGEEFKALYEKYEAEGKGKEVVKARALFDQINSAR 
lETGTPYVCFKDTINRKSNQENVGIIKSSNLCTEIVQYSDSEETAVCNLASIAVNKFVKY 
SPIPSLRPYVDYREMKRVVKIMTRNLDKVIDVNFYAVDKTRISNMKTRPMGLGVQGLADL 
FFKLRIPFESEEAALINKRIFETIYYGALEASCEIAKEKGETYELFEGSPLSKGIFQFDM 
GKENIKNRDIYFNSLPIHDWEQLRRDIMKYGVHNSMFVAPMPTASTAQILGNSESFEPLT 
SNMYNRNVLSGSFQVVNEYVIRELIKLGEWNSVTKQRIMASGGSIQTLPNIPKSTKELFK 
TVWEINPRTTLDMAIQRGMFVDQAQSLNLFVEEPELSKVRSMTMYAWEKGIKTLYYLRTK 
GAARAVQFTVDKNVLQEVKKEAPSPVAAFSAPVREEEEEKKSSIVVPDPAAALLCSINNP 
GACEMC3S 
(SEQ ID NO: 221) 

Top Blast Hits 

Score 

Sequences producing significant alignments: (bits) Va 



AAF04 636 (7U\F04 636) 
RIRl_SCHPO (P36602) 
RIRl_MOUSE (P077 42) 
RIRIHUMAN (P23 921) 
AAD37 4 91 (AAD374 91) 
RIRl YEAST (P21524) 



LARGE SUBUNIT OF RIBONUCLEOTIDE REDUCTA 
RIBONUCLEGSIDE-DIPHOSPHATE REDUCTASE LA 
RIBONUCLEOS IDE- DIPHOSPHATE REDUCTASE Ml 
RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE Ml 
RIBONUCLEOTIDE REDUCTASE Ml SUBUNIT 
RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE LA 



1720 
807 
804 
803 
801 
796 



0.0 
0.0 
0.0 
0.0 
0.0 
0.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 84 5 of CT612 : this 
corresponds to nucleotides 89225 to 90069 of the genomic reference sequence. 

Hit to public SBV sequence: 

1 6i65655igb| AF099142 . 1 : CT nucleotides 1 to 2547 match nucleotides 5913 to 8459 o 
the public sequence with a 100% homology, a score of 5049 and an Evalue of O.o 

TaqMan Primer/Probe Sets: 

5'start=1327 

5'stop=1348 

3*start=1383 

3*stop=1402 

5*primer=GTCGGCATCATCAAGTCTTCAA (residues 1327 to 1348 of SEQ ID NO:220) 
Tm5=58 .78 

3'primer=TGCACACTGCAGTTTCCTCC (residues 1383 to 1402 of SEQ ID NO:220) 
Tm3=58.45 

probe 1=TGTCCAGTACAGTGATTC (residues 1365 to 1382 of SEQ ID NO: 220) 

probelstart=1365 

probelStop=1382 

direct ion 1= Forward 

Tml=69.00 

scorel=l , 99 

leng th=76 
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CT613 

Nucleotide 

Genomic coordinates: 

Start; 94397 

Stop: 94175(SEQ ID NO: 222) 
Amino Acid 

MVIRLCFLESITCFVYGIMAPLSLDTNTDYLSHKKDTNKKIQMQINFIPYSNMHVYIAGV 
YTFHEKKGLTYQQY 
(SEQ ID NO: 223) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



096794 (096794) AM I NO PEPTIDASE P 


29 


4 


1 


YA2G SCHPO (009705) HYPOTHETICAL 157,7 KD PROTEIN C2F7.16C 


29 


4 


1 


024567 (024567) ESR3G2 


29 


5 


3 


AAC361B3 (AAC36183) PUTATIVE PEROXIDASE 


28 


9 


1 



Coraments : 

Hit to public SBV sequence: 

gi I 6165655 I gb I AF099142 . 1 : CT nucleotides 1 to 222 match nucleotides 3447 to 3668 
of the public sequence with a 93% homology, a score of 357 and an Evalue of le- 
101 

TaqMan Primer/Probe Sets: 

5'start=4 8 

5 ' stop=65 

3'start=169 

3'stop=194 

5' primer =TGGCATCATGGCACCACT (residues 48 to 65 of SEQ ID NO: 222) 
Tm5=57.92 

3'primer-TCATGAAATGTGTACACACCTGCTAT (residues 169 to 194 of SEQ ID N0:222) 
Tm3-58.55 

probe 1=CTTTGGACACAAATACCGA (residues 68 to 85 of SEQ ID NO: 222) 

probe 1 star t=68 

probe lStop=86 

directionl=Reverse 

Tral-68.82 

scorel=l .22 

length=147 
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CT614 

Nucleotide 

Genomic coordinates ; 

Start: 129006 

Stop: 127158 (SEQ ID NO: 224) 



Amino Acid 

MEYIGEKNNNPVSNESVSEKELKLRSSFLMIGKKTSKYEQVMGVYEAIESIRQSELSEDT 
FVVHVKKDKQLKFARGLKRLQELVEDDSLRIERISCAPPEPGHLFKDDAGHVTDEEWLAT 
QEEDVRKINTIVKEKLKRKDKDFKFSQLYRYMSNSLSEAVEKKHDCMIISSDFLIGLGFS 
TMNVTHALKSMERTMQKHGFKDMMVPLVEICHRTHYKGEYIANPIFKSHSSHCLIVPLFM 
VAGVFARSAHPSAASIEMYLSTLAYAVILYSDEKQRQIREELARKNLQIKEELENQVEKT 
TKVEKELETQVVKTTKVEKELETQVVKKEEYKNSYIETEQLFKVSEEQKESLRNVHKKSS 
NATFRYDSGSCLVFSISSTEFYLLCRTDKSGSFETATENGLRYIFSPINKKRDTAGMRPR 
LIMAVTGCDAPIACNDSIKHQNKFKVLKCNR3SIVFQTPPSDEDLKGIVQKVTGSDIRIF 
MNDGTVYQDGQRIDISSPQELDEENMTQFEIEQQRKLHSMMENTSKIVTRYNKERHLTTK 
EARTRNKTEKWFEKVKKREEQKKRENGEQSTSEQEQRGVKRTWENDNEFDSDVEEEEDGN 
NTQEQQRVKRHAISV 
(SEQ ID NO: 225) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q06166 {Q06166) MATURE PARASITE -INFECTED ERYTHROCYTE SURFAC 54 2e-06 

06687 8 (066878) CHROMOSOME ASSEMBLY PROTEIN HOMOLOG 53 4e-06 

GLE1_YEAST (Q12315) RNA EXPORT FACTOR GLEl 4 9 8e-05 

035788 (035788) CYCLIC NUCLEOTI DE-GATED CHANNEL BETA SUBUNI 48 2e-04 

Q14789 (Q14789) GIANTIN (GCP372) (MACROGOLGIN) (GOLGI AUTOA 48 2e-04 

Q9ZU69 (Q9ZU69) PUTATIVE VICILIN STORAGE PROTEIN (GLOBULIN- 47 3e-04 



Coinments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 2 to 976 of CT614: this 
corresponds to nucleotides 127217 to 128191 of the genomic reference sequence. 
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CT615 

Nucleotide 
Genomic coordinates : 
Start: 130290 

Stop: 129405 (SEQ ID NO: 226) 
Amino Acid 

MEYIGEQKLINLLDETPEEDELQLRSSFLMIGEKQYEKYEEVMSTFEAVETIRKSEFRDG 
VFIVQLKENKHITFEGGLKELRELTGDNSLKIESLLSSIKPEKGHVILKNTSTTTDDEWL 
ASQDKDVQEVNKLVKEKTRMLFRGFYFSPTYRYITKSLPQIPFGEKERFVVSTDFLIGLG 
FSADDVMEKLIAIEGNMRKSGLKYTWVPVAEVCHLKKYKGDIVVNPIFKSYHSHCLVIPL 
VYLG YM FSRN VQPPS LEVET YLLALAFAI DL YGREEMRKSCMRLCED I SEVKRG 
(SEQ ID NO: 227) 

Top Blast Hits 

Sequences producing significant alignments: 
Q13779 (Q13779) APOLIPOPROTEIN B-48 (FRAGMENT) 

APB_HUMAN (P04114) APOLIPOPROTEIN B-lOO PRECURSOR (APO B-10 
Q13788 (Q13788) APOLIPOPROTEIN B-lOO (FRAGMENT) 
P96470 (P96470) IGA-SPECIFIC METALLOENDOPEPTIDASE PRECURSOR 
028789 (028789) SIGNAL-TRANSDUCING HISTIDINE KINASE 
CAB55172 (CAB55172) HYPOTHETICAL 77,9 KD PROTEIN 

Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 725 of CT615: this 
corresponds to nucleotides 1294 99 to 130223 of the genomic reference sequence, 

TaqMan Primer/Probe Sets: 

5 • start^296 

5'stop=318 

3'start=360 

3*stop=383 

5'primer=TTAAGCCTGAGAAAGGACACGTT (residues 296 to 318 of SEQ ID NO: 226) 
Tm5-58.14 

3'primer=TGTACGTCTTTGTCTTGAGAAGCA (residues 360 to 383 of SEQ ID NO:225) 
Tm3=57.87 

probe 1-CTACTGATGACGAGTGGC (residues 341 to 358 of SEQ ID NO: 226) 

probe 1st art=341 

probelStop=358 

dir ec t ion l=For ward 

Tml=68.84 

scorel=l - 84 

length=88 



Score E 
(bits) value 

36 0.34 

36 0.34 

36 0.34 

36 0.44 

35 0.58 

35 0.76 
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CT616 

Nucleotide 

Genomic coordinates: 

Start: 147517 

Stop: 144748 (SEQ ID NO: 228) 



Amino Acid 

MESIKEQQQQQPTVTFSEEPDQVYEFEDTTTTSAKKPTPSKAKFAAGRRMVSKQRRNTIR 

SPHTETVEEVVGEEEEQQQQTPPEITPAEKKQQSLQELDALMGKVPAHLDVSVLAKSVAE 

FLENDEDEDEELEKNKKAQKSVLFNSVMNSGRTELSPSTFCDGCVSKVKSAFEGKDLVSN 

IVKVEGEAVKKTAIATDTTKLANLFLGCMNLQFHEHVTIETLNKKALDKGGPLFTLKLSD 

AVYVDEMDLEKKRQIFGSNGDKSLFKELGGNYIDSAIKSTGLVMSTPSSSSTKKAGTHFK 

TTNQIVEESVTESMRNGCCCFKNDKWLAKRESNLKSLNNTVFGEEDDEKSAYAYSDSEDE 

DEDENEEEVDYDYNNETIESSVGNVIKNLIRKTIGLSDVEEEKEEGEQSEEEEEDSDDDD 

DDASSVCSSSSSSSSVTWAAAEEEEEEDEEDKDKDTATVVEDEDDKESVISSSSEDSEE 

DEDDDGATSQCSEVVFGDVTECEFDESDGNPLYLASDNSFRPSASVTKYPQSEEEMDVSL 

LSKNRSTPVCLSLCRHSSGCITNSFNMSTILKSLKLFPAGTEAAEDCVHIESTKKKDEDE 

DEEDQGLDLQNSQYYSVLVDVDNLIIFSMGSTTYESSMVEVDYDKSFWSSFDKSVKPYCE 

SKKSALINALCEDNVTAKVYATVHTLAIPFCESMPINHINNTTPYGSYKTFRISLPGNFS 

GQHNDINNNWRSDMYTKMVENLLKREVVENKTHSRRYVRNLIVDGGVGENSGNYLKVHEN 

NEDIFGSIEANSMSAKTAAAAFKNVAKKCDLIQTTTNDILTGPFKQYLIDYKYNSARKNI 

IMEPCEGDETTAHEMKRAQDAYKQALHRAKITASSISLRGIWHEMITRDMNTTYNSMFMY 

IPDFYKYVQVSPVNVSPLYMLD 

(SEQ ID NO: 229) 



Top Blast Hits 

Sequences producing significant alignments: 
Q9YTL7 {Q9YTL7) ORF 48 

AAD56625 (AAD56625) NUCLEOLIN-RELATED PROTEIN NRP 
NUCLCHICK (P15771) NUCLEOLIN (PROTEIN C23) 

GARPPLAFF (P13816) GLUTAMIC ACID-RICH PROTEIN PRECURSOR 
IE68_PRVKA {P24827) IMMEDIATE-EARLY PROTEIN RSP40 
NUCL MESAU (P08199) NUCLEOLIN (PROTEIN C23) 



Score 
(bits) 
85 
77 
76 
75 
75 
75 



E 

Value 
le-15 
6e-13 
le-12 
2e-12 
2e-12 
3e-12 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 476 to 1 of CT616: this 
corresponds to nucleotides 145115 to 145590 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start-1507 

5 • stop=l526 

3 • start=l554 

3 • stop=1575 

5'primer=GAATTCGACGAGAGCGATGG (residues 1507 to 1526 of SEQ ID NO: 228) 
Tm5=59.36 

3 'primer ^TGAAGCAGATGGTCTGAAGCTG (residues 1554 to 1575 of SEQ ID NO:228) 
Tin3=59.21 

probe l^CTACCTGGCTTCAGACAA (residues 1536 to 1553 of SEQ ID NO:228) 

probel3tart=1536 

probelStop=1553 

di rec ti on 1= Forward 

Trnl = 69.02 

scorel=l . 97 

length=69 
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0X61*7 

Nucleotide 

Genomic coordinates: 

Start: 148612 

Stop: 147766 (SEQ ID NO: 230) 



Amino Acid 

MSPVISQQSSPSATSTAAARIISTANLRVLGVKNKEEKDEEEEQQEVEPEIIEPATDFEI 
PFSPALTICIYINANRIHINSKGVCLNRKKIKPTSTINKNQDVPPELANASSYLVQTEHV 
TDKFLSSHCSICNYNVNDGEYKSALSTTRNGDQPLMRKSVRYVPLNEDNVVVQKGTYYGT 
TFIPEKTGRRILWFSHYKKSPRPITAKLCCLLETINSFNGSCSSSSSASSSSNAPGPIEE 
FQVSSSIFFKKEECCPLQMKWVEQNELDAESPVLVLLMLAL 
(SEQ ID NO: 231) 



Top Blast Hits 

Sequences producing significant alignments: 

Q9Z2U2 (Q9Z2U2) ZN-15 TRANSCRIPTION FACTOR 

Q20150 (Q20150) r38B7 . 5 PROTEIN 

094002 (094 002) SEC12 HOMOLOGUE 

Q20497 (Q204 97) F47A4.2 PROTEIN 

013779 (013779) HYPOTHETICAL 71,2 KD MEMBRANE PROTEIN C17G6 

P93002 (P93002) REGULATORY PROTEIN NPRl 



Score 
(bits) 

36 
35 
34 
34 
33 
32 



E 

Value 

0.24 

0. 70 
1.2 

1 . 6 

2.1 
3,5 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 741 of CT617 : this 
corresponds to nucleotides 147819 to 148559 of the genomic reference sequence. 



TaqMan Primer /Probe Sets: 



5'start=363 
5*stop-383 
3 • start=446 
3 • stop=467 

5'primer=CGACAAGTTCCTTTCATCCCA (residues 363 to 383 of SEQ ID NO: 230) 
Tm5=-58 . 62 

3*primer-=ATCAAGGGCTGATCTCCATTTC (residues 446 to 467 of SEQ ID NO: 230) 
Tm3=58.15 

probe 1=CGGCTCTAAGCACAACAA (residues 428 to 445 of SEQ ID NO: 230) 

probelstar t=428 

probelStop=-4 4 5 

direct ion l==^Reverse 

Tml=68-92 

scorel-1 . 92 

length=105 
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CT618 

Nucleotide 

Genomic coordinates : 

Start: 150145 

Stop: 148675(SEQ ID NO: 232) 



Amino Acid 

MESVRDVKFYTFMNVLAEKAKKIQRLNKDKGWRTSINAEIGYGGARLMDVRFTGRKSMDE 
LARCLYNCDGEYTTLRLVGSSAGNI IVYSLAFIMGIRGECCGFNVNNRLRMGKI IDRELF 
YKITGLNFPETVKCTCDGVRAICDLFLEVAALQEHPAWHETKEVGKKQQQHFNEFGSQYP 
GTKFNKRHKLSTKIIQQMFSEEKTMEQVLAFSEGTAASGFSDLYVEAPIQYVVNMYRAIS 
NMEGRVGAMYNLSRVLILLCSRWEKKPGYKNDFYSKCEMYIGSKKIVDDESFIFTDLITG 
DLVPLVRLAPSNEDIQRDVIRFNDSTDILMDSIDVRDVVLPVLSKIIWQNVSARLKLRNN 
KSLSKLAKWKWNGMVSTHDNFD3NDYVIEHKRQLAADIMSDSLSKNHLPNFSKTITEYDE 
KENKTTPLICWNYIFELSPMGKHLFPLEEVCGFYEASLPLITPWQLKVVQKKRGRQMVIY 
GPRKRPRTQ 
(SEQ ID NO: 233) 



Top Blast Hits 

Sequences producxng sxgni f icant alignments : 



Score 
(bits) 



E 

Value 



Q64568 (Q64568) ATPASE (PLASMA MEMBRANE CALCIUM ATPASE) 
ATC3_HUMAN (Q16720) CALCIUM-TRANSPORTING ATPASE PLASMA MEMB 
Q12019 (Q12019) HYPOTHETICAL 559.3 KD PROTEIN 



37 0.28 
33 5.4 
32 7.0 



Comments : 



TaqMan Primer/Probe Sets: 



5'start=67 6 
5'stop=701 
3'start=823 
3 • stop=849 

5'primer=GAAGCACCTATACAATACGTGGTTAA (residues 676 to 701 of SEQ ID NO: 233) 
Tin5=56.76 

3'primer-GGAGCCTATGTACATCTCACACTTACT {residues 823 to 849 of SEO ID NO: 233) 
Tm3=57.26 

probel=AGAGTAGGTGCCATGTAT (residues 733 to 7 50 of SEQ ID NO: 233) 

probe Is tart =7 3 3 

pr-obelStop=750 

direction 1= Forward 

Tml=69.06 

scorel-1 . 93 

length=174 
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CT619 

Nucleotide 

Genomic coordinates: 

Start: 169814 

Stop: 165116 (SEQ ID NO: 234) 



Amino Acid 

MSTTQTQTIERPLPGKNNEDNSRLACLLAEGLQQQQQQQDGDSEISLPLVNAGTFACYDS 
TLANLTEGRLGSETENAKIRVKIHPSVFIIETNKEMTIEEISTKSLNALVEKRAREARRF 
SSLTEOKFPRGGGGCYSRKNERFIEGEINNIKLMMEETASSLERLAGLLPWINIKDWTM 
HDEKEIRLDLKGNDGMEELVNISHLNQEEWEMERLSSSIVLKDAYGVFYAHHGILDIVLT 
TSRFTGKLLQHPVIFRLMDVKVWINTPLQIAFPDTSKNPNAKKILYQHPSLTRLRDLNDM 
ASNSKSVSSIIIPELSKFNSTEFGMHYFTAQCFFGKNTNSLKDLVTRYYQLSFKNKPQPK 
LYEPTATATAASSSSSTASLTTEQKEKIAQSILSSKGKSLGDVSSTLSKEYDENRKRTKR 
QKTSTDTNIVPSGAPTSrSMKNPVTCFFGPQYTSIMDCISEKTDWIEMHLFLTSLNDAEH 
NKTLVVDRKSNVSEIHDSGRFLTFGQNNTTAFIPDVDIPTLKLILRDDSGESSAAIIASL 
lYYNNVNLEGREFSNVSDAVVGLFSGGSAITVGDIAREIASIYNIGRESNCDSILFPGEP 
ILAGRRSYGRQYRWYDPINCVVGLYRSCLETMTRNIMRGQPVKVDETAWMYMHQQVLQVV 
LLPFFDCVLKSGVWAVKEARQLTDYIVREVLLKYTADPDQHKFLLFKKPVMDLIAKIVTH 
YAVIHSAADNGGVCLAFPRDPPFIVENDTSLRYYTLTDTPQSILNGDNVAENLKSATSVA 
SSPSSSSRYSSETPIRVVNLPVPTGRFLKMNKDLELFINVPLISSKEQKQQQQQTTATAP 
FSSETISKSFLNYVPPKSLTRNVTYGQNIAEDGFLGLKMKGELVSYFKVVKNTERDDGIK 
DMEIGDINNHQDDTGSLSSSSSSFVDGVRTSFSVDGKIEHVSAFLPGTTSQPTNLPVHAS 
KQVKY S VKELGMETVFFEPLLS S AVLY E ASKTKSTQHLS PMRI YKEC VS PLSTGRI DI FP 
SKVGTVAGTGFEFIWKVLQYDTGLPTTLERLSPKIPSVPISGEDSKMEVIAESGKGVQNI 
lAIAADQLRGSNNIVGGGTRRAIQQQQQQQQQETQAVVPVNVPARFEPTFTEIELFLQNK 
FRNVIATIISRMMMLVSNEEMKIIKEVCEHVSHIMVDGLYVALDPRKAIEEILERITAEQ 
NGITIDTGNEGYGSLRYASSGRLFINDEASEEAAAAIGGGGALGTGRRVPVELRSILDKL 
NTIGSTTQQQQQQQRQQRQANNNNTVPEDIKVHNEQMQKIRDSSLFTSKLLNYIRDDGRK 
DRIKTNISETLKKYSRIPSYFIASKAQKPIPWKHTKDNINLNKIPEDLNFSPAQNLFVPV 
NPEIHILTDMQWLNCISIIETATRDSAIVMQSFQEQADKTTTQLEELLSQWNNIVSQVTDE 
KSPAYVSSVKLEWLNNEASRIAAIRENSEKSKIVMGVQGKIVNIDELGIVAVARSIVDVD 
FYIKMPNVWASRDWKNLIYYAVNIAATPLINNISRGIMAASQTSVLYDSSLAI.IAAEQAT 
RNITM 

(SEQ ID NO: 235) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

ST20_YEAST (Q03497) SERINE/THREONINE- PROTEIN KINASE STE20 { 46 0.001 

YN23_YEAST (P53832) HYPOTHETICAL S2 . 3 KD PROTEIN IN MRPLIO- 41 0.047 

Q24523 {Q24523) BUNCHED PROTEIN, CLASS 2 ISOFORM {SHORTSIGH 40 0.11 

NIT4 NEUCR {P28349) NITROGEN ASSIMILATION TRANSCRIPTION FAC 39 0.24 

062235 {062235) F36F2 . 3 PROTEIN 39 0.31 

MYSG CHICK (P10587) MYOSIN HEAVY CHAIN, GIZZARD SMOOTH MUSC 38 0.53 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 234 to 1099 of CT619: thi 
corresponds to nucleotides 165434 to 166299 of the genomic reference sequence 

TaqMan Primer/Probe Sets ; 
5 •start^l920 
5*stop=1941 
3*start=2022 
3'stop=2045 

5'primer=GCCAGTG7^AAGTGGATGAGACC (residues 1920 to 1941 of SEQ ID NO:234) 
Tm5=59. 57 

3 'primer=AGTTGTCTAGCCTCCTTTACAGCC (residues 2022 to 2 04 5 of SEQ ID NO: 234) 
Tm3=58.24 

probe 1-CTTGGATGTACATGCACC (residues 1943 to 1960 of SEQ ID NO:234) 

probelstart=1943 

probelStop=1960 

direct ionl^Forward 

Tml=69.03 

scorel=l . 96 

length=^126 
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CT1020 

Nucleotide 

Genomic coordinates: 

Start : 62990 

Stop: 63659(SEQ ID NO: 236) 
Amino Acid 

MEWINQRTSREDLFNTYTGNAVIRSAAKQALAXEKHAAERRGEKAWTTSAAAAASSNFNN 
VQQDYTDDDITQVSIANSVLNNPFLKRYAKLIDNLAISSLPPDIEDDVIIHTRDASNSTV 
RVDGANIYFAIIDGDLCVYPKQYISDKVLCGSLNREKALFYNSSKNKWTYGCNLNFDIVD 
AAIMKHPDYKEETTSTKHIRKILGIGASEKLNITHYLNYFIQ 
(SEQ ID NO: 237) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



P75338 


(P75338) 


MG307 HOMOLOG 




32 


2 


. 7 


080990 


(Q80990) 


LATE MINOR CAPSID PROTEIN L2 


(FRAGMENT) 


31 


7 


.9 


Q18668 


(Q18668) 


HYPOTHETICAL PROTEIN C47D12.2 




31 


7 


.9 


Q8C991 


(Q80991) 


LATE MINOR CAPSID PROTEIN L2 


(FRAGMENT) 


31 


7 


. 9 


Q80985 


(Q80985) 


LATE MINOR CAPSID PROTEIN L2 


(FRAGMENT) 


31 


7 


,9 


Q02689 


(Q02689) 


HYPOTHETICAL 45.6 KD PROTEIN 


IN COI INTRON 


31 


7 


. 9 



Comments : 

TaqMan Primer/Probe Sets : 

5 ' start=331 

5 'stop=354 

3'start=379 

3'stop==402 

5'priiner=CACACTAGAGATGCCTCCAACTCT (residues 331 to 354 of SEQ ID NO: 236) 
Tm5=57 -74 

3'primer=ACCGTCAATTATGGCGAAATAGAT (residues 379 to 402 of SEQ ID NO: 236) 
Tm3=58 . 84 

probe 1=CAGTCAGAGTAGATGGAG (residues 356 to 373 of SEQ ID NO: 236) 

probe Is tart =35 6 

probe lStop=373 

direct ion 1= Forward 

Tinl=69.01 

scorel--1.98 

length-=72 
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CT1021 

Nucleotide 

Genomic coordinates; 

Start: 69264 

Stop: 76212 (SEQ ID NO: 238) 



Amino Acid 

MSDTGQMEENRPATQKRRPGDEEEEETGSSNVPYYANFGDDATYSMYTGEGKRGKFVLEP 
PKERSVQRVQKPPKEKEEREQRSNVRTRRPGQEFEQKVLQDRSRERSEKLGQNLAEKGLQ 
ERQKKYTPKVAQTMTKKIIRFREGGRKFKAPQQQTSDKGAATNVLEREEIEMAAEREQPV 
EITGDTILGGLGEEDDEDMGEDELTIQHSSMAVSQPVQQIVVSSPIPPKPTRPAPDIPIQ 
EDIVGKNISQLPPLPLDDYEDEEDEHLYEEVNDFLVAPPTAAAAASTRPPRPNIPPPPPP 
VVAVADETLKNLASIAALEKEAEEQRAAAVEREREVEEQRAAAAAAAAAAAAQREADEKR 
EREAEEQRAAAAAAAAAAAQREADEKKEREVEEQRAAAAAAAAAAAQREADEKREREVEE 
QRAAAAAEREILAQQLQEMKEQMRIKEEERRKELADKEEEKRRELAAKEEEKRQEILAKE 
EQLEKLNFQLGTEITSKRALEQMLEEEKASRSRSAASAQIAIQAIEYEDELPQAVEPQGQ 
LVPMDTDLYGKMYDLNKKLEVQNNTLTSAFEDVNKTNEQNQLVAQSLEKSAKAIEKLTSQ 
KHLPVDDPAFMQRIITERDFSLKNLGMVYKRVLGVYFTLKRDLFKSKALITDKESRDLEV 
RLTDVSTDLRANDLNTILERLDVSVNIRSGGTLYTKFTEADTALADQVPSRIEISNRSRS 
ALLPFSSAGLDTNFTNSSDKYNEIVNQLSSINEAMNILKENIVPTLNQIKIDVTNLLTVS 
SSRQYAIEERVYSDVSRMDSEIRKFLAIMNSKISPYFKGDWTDERQRSIADSISSQIKSN 
DKIKESVATLHDINTTSRIRSNPLLHKSSVLSSPDFLNAVNDFRNFLDIQGGSQFTYDVL 
SGQNIDDLSLASKTTEKVTELCLELSIILDVIHKNALSLNLPAITYPAGETSMEESGSLA 
VDIRQEIGKNISDSSAELSRTLSEALQIFQQQQQQQQQQFQQQLLQQQQDQQNQQQLLQQ 
QIEEQQRVQEQQQQQQRDQQQQEQQOREQQQOQQQREQQQQREQQQQQQQQQSDQFRQQL 
LQQQQQFQQLLQQQGRRRGGDDGDEEREEREEGAEKDDCVRKVAESVATKYTADLTTLFQ 
REENNFQSKIASAKLGTLVFATPPSPIMNLTKLREEYSTFTTQCFSKLTAENNSIMRIFP 
ERIVEVCKSKNLNLMGKYLYIITTAQTEMEDRVKNILSGIFNQIEEFSNNVKQQQQQQAA 
SASSTNPPPPSTPSTTPPVTSMQVCELDDQRTLEKAAIVEAITLANAVLQTTKSASAPST 
AAEREIALKLENGKTSIRMEKVDLSSGATGVSDQQKWIDESTSKQELEDFIAEENFVETA 
HNEMDIGLILDAKKNDPTRDANLRLVKPHGINVQSFPYYVLRTWLGETDILDEDTVHPEY 
FRQYIDRNWKVEEHEREDTLKALGVSLSDTLAHIKDYYSPSVKNDASKSVPFALNTLLYN 
IFAIDGGMISSLSRTAFIYRKFLRQSMTDKEVAQGPVRSQLCEATIASLFTACSNLLRSS 
PLADKVEPRLQEKLAAAAAVDTSTGDMFRIRVCHLMYNFIVAYVNLCNNRINYTLNVLRA 
SGLANKKVVAGKTTKGHTSSSHRFGSYDVTYDFSVLYKILQLQKQNISLLLEKGFNAWES 
CVAAMAAFTADPSLSISDADQSILFPLEGGEIVIEKHENDAEKNVDMVQELWKETALTLM 
AKELNSYYNWFIYSKDTDMEKLARVCRMIIGIVKAVLRLTNKAESLVDTNALSDIFKLPV 
IPIDDTKTLAINIVVFTLNNVIKPWMVSFKQMFRQKDGGVSSAYFSFQNIQQQKHQQTAS 
ILDAWACAPGKLTKAAHVFISGYENHIKLKKDDLLWGASMKFPADGRGTWEGWAQQYNN 
ESVLEDFTDFSIEVNAPASGLLIPPDPLLSSMFGKGNGGSSSSSSKDNTIIGKGGLILNR 
QVVGQEQAPPINTSSDTKKIRRDANIEPIIGTPYSVIKASKGVSISVLDDFNEDSPEDFA 
LKTSIINDAIREIGQRMTYTRPIFDHQTQKNIHYSSPKIILEGSDLKNGQRSGQSWAPSS 
SSLTLASDWNLPSLELLYRELATKQVEKEEEEKSEREEDKGQKLNEKLSFVVNKAIGTIQ 
QQHQY5ERGGGMKRYQQHSAD0ASNGGIDDIELMNSKDATSMRKAKLALAVTNKIAAAAA 
RDGENSSAKPSNFGNRLDEAINPGALLLRRGGGVRGGQTPQSSMLTMFRPGQTGGNSSWW 
TTNTPLIQRTTSVGNNLVVLVPNLLDSHPPTFN 
(SEQ ID NO: 239) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

076853 (076853) SRF RELATED PROTEIN 136 2e-30 

AAD50121 (AAD50121) ADENYLYL CYCLASE 131 4e-29 

077033 (O77033) TRFA 120 8e-26 

AAD46501 {AAD46501) LATENT NUCLEAR ANTIGEN 120 le-25 

088542 (088542) OPA-CONTAINING PROTEIN 1 118 4e-25 

Q62006 (Q62006) CPA REPEAT (FRAGMENT) 118 5e-25 
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Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 462 of CT1021: this 
corresponds to nucleotides 75184 to 75645 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=3774 

5*stop=3799 

3 ' start=3851 

3*3top=3872 

5'primer=AGCTGCTTCAGCTTCTTCTACTAATC (residues 3774 to 3799 of SEQ ID NO: 238) 
Tm5=57 . 20 

3*primer=CGTTGATCATCCAACTCACAAA (residues 3851 to 3872 of SEQ ID NO:238} 
Tm3=57 . 20 

probe 1=CTGTTACAAGCATGCAAG (residues 38 33 to 3850 of SEQ ID NO: 238) 

probelstart^3833 

probelStop=3850 

direct ionl=Forward 

Tinl=69. 00 

scorel=l . 99 

length=99 
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CT1022 

Nucleotide 

Genomic coordinates : 

Start: 79064 

Stop: 83375(SEQ ID NO: 240) 
Amino Acici 

MDFEGTTSSTPSKMSQLYSSVKKVAEHSFANLHDKATLASKVIKDLEGERKKMSTPKSSS 
DGQKLDKAMLDDIINEYQAVKSTADNSIESTIKEIENVLESVRRTKIESEAKNSVTSSPE 
KVFSVEDLEIYSKGRVCKGLKLNANCSRIGGKYAVSMSIKKHNVSSFENNNNQVFSEEPR 
DCFMLETTYPLVGFETSTEDGNTYAVFLTGVGLERSLPKYVPVFDMNAGIQTLNMTGLRM 
AKLPVLCMFGRTEYDNLEDFYITSIETQSFDEEENDARMRCHTEDLERKKRMNDAPAITP 
HVAVYDYSGDGKEQLLYMITEYENTASWCNANGVVTSDSGFSNECAISDMNDLCCFADCI 
DVTVNNEEHEERSMNIVVESDRRLFDASPSPIKTEEDGENSSSSSSSPTVPPPTPYEGNA 
VVEGEEEEEEIDEDESSKYEGSEDALVMKKLAKLSTMKQMRRVKNEPALKITSGGNNSSS 
SINNEDDGDDDDAVDATALCPQTEATVKNSFMAPNDERTENILYETMQISLAKICNNPSS 
MSSYRVFTNKLQECLNTMDDSIRRRPTIWTEESQQFAKGLLFDEVVTSIVAHQMAQDICK 
SEIFGGMFNANSTNIKGKYEGQKKSLYGNKHISSSCFKTNTESNVNNALFAWVKSKLHSG 
TVIPNVFSFKMASEKPSKMKRKRTSSASSSNDEHQEPSTKMMKNDEGEKVAQESSSPSSS 
STPEQQQQAGHDKETINLIPLSFIKMPRSNVNGSASYLSEIFGQRLCGLSDASSTFKRMC 
KTFEDLENEIMRSSFTRLTRYEREVTRLYEKCRSQAVDIEENEMDVLSHQGELFAEFLED 
PIAYFEEVLENIKSWSLENVNTPKRKNKYAKVLVSVNAIRRTYEEYHAFSKFVPMFLFNL 
IKRELEGDNYTHDVHFSSTCLWYLTVMTRNRICDVLQYINNNNNDNEETDIVEEEEEGEG 
EEDKMEESMDVEQQKQVRKGGRKKGQKFN3IGDQVIRKFVKSLCENSMVVSIAINSLISG 
ISWMNKKIPPGFLKDSSTINTLDEVSRFVFSDVKINRKINGTDDKYETVFGVSTRVDSHI 
VGPFSIPVDFSSAGLDKASCGKLYVNTIDGKGILTISPKYDSLNDEDVDSTTTDKLEKDI 
LHLSKHDTFFNINKNKVLPFYNISPSSSLTEKKKTKFNRKKISSGMSNNNGMCVQTPSSS 
NSVSSVSSIVAPSSSVLALSCSLSSTKKKSIWNENMFLTSRNMWRCGFVVPPKLCSFIVN 
HRHAVKLVAETAPKTKLCRNIIDRNRKIRFNGLKKVCKSVSAFTGESTYLLNKNMTATSP 
SDLNLCIYTSSLNDPLYTCKLTHEEYQDGNALDDYGAVFVNYTFKSIKSCSSKDETADDN 
AAAADDDGSTTSTSSSTDTDAAAIQDFMHVMIKKIDAMKDIRGKYKKSLAKKTKKH 
(SEQ ID NO: 241) 

Top Blast Hits 

Score- E 

Sequences producing significant alignments: (bits) Value 

UBPl YEAST (P25037) UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 1 50 le-04 

097236 (097236) PFC0230C PROTEIN 48 3e-04 

CAB43859 (CAB43859) HYPOTHETICAL 85.2 KD PROTEIN 4 6 0.001 

NSR1_YEAST (P2747 6) NUCLEAR LOCALIZATION SEQUENCE BINDING P 44 0.006 

Q21000 (Q21000) SIMILARITY TO C. ELEGANS MYOSIN HEAVY CHAIN 44 0.006 

Q18918 (Q18918) CODED FOR BY C. ELEGANS CDNA CM11B12 44 0.006 

CoiTuuents : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 1010 of CT1022: this 
corresponds to nucleotides 82212 to 83222 of the genomic reference sequence. 
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CT1023 

Nucleotide 

Genomic coordinates: 

Start: 9322B 

Stop: 94137{SEQ ID NO: 242) 



Amino Acid 

MVS S I THLSLLFVVAVVASVVFTTEGASVRVKRCAVS PCPDV I DPDHRCQGRLCRRSTRG 
GDDDDDDDDGGTFDTVGSGILGRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRK 
KRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPP 
PEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEEDEFYRKKRAAPPPEDEEE 
DDFYRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRKKRAAPPPEDEEEDDFYRK 
KR 

(SEQ ID NO: 243) 



Top Blast Hits 

Sequences producing significant alignments: 

Q18401 (Q18401) COSMID C33G8 

Q9YTL7 (Q9YTL7) ORF 48 

P73032 (P73032) HYPOTHETICAL 185.1 KD PROTEIN 

Q43687 (Q43687) EXTENSIN-LIKE PROTEIN (FRAGMENT) 

Q09085 (Q09085) EXTENSIN CLASS II PRECURSOR (CELL WALL HYDR 

014686 (014686) ALR 



Score E 
(bits) Value 



113 
98 
95 
88 
83 
77 



le-24 
6e-20 
7e-19 
7e-17 
2e-15 
le-13 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 214 to 1051 of CT1023: this 
corresponds to nucleotides 93435 to 94272 of the genomic reference sequence. 
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CT1024 

Nucleotide 

Genomic coordinates: 

Start: 94623 

Stop: 95742 (SEQ ID NO: 244) 



Amino Acid 

MDNLKGEFVALKTDLTHYKTQLDRSILVFVDVVGRLYVIVNSEQTAKKEGLATRVAKQAT 

EIQQFKDEINNKYNALTNTLDDIIYIFDHGGSFKRAKHKAIIEAREYSKPLRELECMFTR 

lADMLTLTFMTVYTNIITEFRHSSEQATNSINVTLGRLFLCDDLCNQLPKEEEEEEDLKQ 

KFITFHANLYMLDTRLKKDLIIFKDVIQQLHVILQKDTYAVKEGVAIRCAKQMNEISQYR 

DNLKDNYNTFSNILNEIVYIFDHGGHFEEVKHKAITLTRNYLKTLMGLKCMFKRISEMLS 

LTFLTVYTNVIAEFINASNISDREINNYLVQLVTCNELCNQLPKPKQYRPLSLIDNIAYF 

SLSVQKHLSGFL 

(SEQ ID NO: 245) 



Top Blast Hits 

Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



AAF04635 (AAF04635) HYPOTHETICAL 43.2 KD PROTEIN 
096219 (096219) HYPOTHETICAL 139.4 KD PROTEIN 
064554 (064554) YUP8H12R.45 PROTEIN 
TPR_HUMAN (P12270) NUCLEOPROTEIN TPR 

Q99968 (Q99968) NUCLEAR PORE COMPLEX-ASSOCIATED PROTEIN TPR 
097291 (097291) PFC0960C PROTEIN 



747 
39 
39 
37 
37 
37 



0.0 

0.051 

0.067 

0.20 

0.20 

0.26 



Comments : 

Hit to public SBV sequences: 

gi I 6165655 I gb I AF099142 . 1 : CT nucleotides 1 to 1119 match nucleotides 3220 to 
2102 of the public sequence with a 100% homology, a score of 2218 and an Evalue 
of 0.0 



TaqMan Primer / Probe Sets: 



5'start=^478 
5*3top=499 
3'start=576 
3'stop=598 

5 "primer =TTGTGTGACGACTTGTGCAATC (residues 478 to 499 of SEQ ID NO: 244] 
Tm5=58.28 

3'primer=AATCTTTCTTTAGGCGTGTGTCC (residues 576 to 598 of SEQ ID NO: 244) 
Tm3=57.55 

probel=TGCGAACCTATACATGCT (residues 558 to 575 of SEQ ID NO: 244) 

probe 1 start =5 5 8 

probelStop=575 

di rec t ion 1= Forward 

Tml=69.00 

scorel^^l - 99 

length=121 
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174/201 



PCT/USOO/28888 



CT1025 

Nucleotide 

Genomic coordinates: 

Start: 95824 

Stop: 97369 (SEQ ID NO: 246) 



Amino Acid 

MDSCCLISRITPELAGKLTWIFIPENNFKIVQNSLPDDQVISQFRYFDHRHCYTFMEILM 
ANIKIQDRKQNTTAICELTTGREGLLCRRTIPVFLGSEEKREELLGNLPEGAEIFRPREV 
MQVIGTLLDKKLEIDDGIASVKAALCAGSSSLYLIMSHIVKMTFSAITNMKDINEEYFVD 
FIFRHKQFLNPEFFKHLISLLKNSRKEHVAHLVRRLEHFLMLWTLSKMRFTEMEENYFPI 
SSDSDYGICEKCARKTPKYKLRIFRERKCCDRCCRLYHQQPPPEVYNWDGKITQQSNKGY 
INAGDEIIGMLNSNDKGKTFPPIPKMVVRRVVDGVYGQGTILSKILKFRQANIPTCLFVT 
CNKCNRIFRLTILGPTRNILCPPCRKKSVAVNTQQKGENKPSFVQKGTKRLRVDTGSNKN 
TLEKFCSWERFNTEVLLPWLGYTIESKWQNWESFLGYSSTRYKELWAFVNKQEISSMKDS 
YIKIEDI DQLLRS I LQDQKGVFETVCK I KSRDGL 
{SEQ ID NO: 247) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



AAF04 634 (AAF04 634) HYPOTHETICAL 45.2 KD PROTEIN 689 0.0 

STA5_MOUSE (P42230) SIGNAL TRANSDUCER AND ACTIVATOR OF TRAN 32 7.5 

AAD20715 {AAD20715) PUTATIVE DNAJ-LIKE PROTEIN 32 7.5 

CAB60246 (CAB60246) HYPOTHETICAL 80.9 KD PROTEIN (FRAGMENT) 32 9.9 



Comments : 

Hit to public SBV sequence: 

gi|6165655|gb|AF099142.1: CT nucleotides I to 1545 match nucleotides 2019 to 477 
of the public sequence with a 99% homology, a score of 3031 and an Evalue of 0.0 

TaqMan Primer/Probe Sets: 

5'start=672 

5'stop=693 

3'start=765 

3'$top=786 

5*primer=CCTTTCC7U\GATGAGGTTCACA (residues 672 to 693 of SEQ ID NO: 24 6) 
Tm5=57.77 

3' primer =ACGGAGCTTGTATTTGGGAGTT (residues 765 to 786 of SEQ ID NO:246) 
Tm3=57,84 

probel=TCCAGCGATAGTGATTAC( residues 721 to 738 of SEQ ID NO: 246) 

probe lstart=7 21 

probelStop^738 

direct ion l=Fo rward 

Tml-69.03 

scorel=l . 96 

length=115 



FIGURE 3, Sheet 133 of 160 



wo 01/38351 



175/201 
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CT1026 

Nucleotide 

Genomic coordinates: 

Start: 97547 

Stop: 98789 (SEQ ID NO: 248) 



Amino Acid 

MESIKLrTVAGLNMEQANQVAEEIKSEYKTEEEKRIAQEVFDKFTKKLIMQVDTSKHLLT 
RENPNRFVSRPIVHEDLWEMYKKEVACFWTLEEIDFERDPKDWEKLTQDEKDFILQILAF 
FASSDGIVIENLTTRLRQVAQIPEARSFFDFQVGMESIHGNVYGELIDRLVPDEKDKAIL 
FNAAQHFPAIKKBCEQWAINWMQSNNDLAELIVAFAAVEGIFFSGAFASIFWIPCNRGILPG 
LTSSNEFISRDEGLHRDFACMLLKKGFVDTPSRERILEIVTEAVRIEQEFLTVSLPVKLV 
GMNCKLMSQY I EFVADKLLVEMGLEKH YNVTN P FPFMDN I SLENKTN FFEKRVAEYQRAQ 
VMASINKIKKDOQTQETGSPLPILTAPPPVSSSSSEQEDVEDGVGDYISYDDF 
(SEQ ID NO: 249) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XYN8 (Q9XYN8) RIBONUCLEOTIDE REDUCTASE R2 5UBUNIT 388 e-l07 

RIR2_HUMAN (P31350) RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE M2 381 e-105 

RIR2_MESAU (Q60561) RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE M2 381 e-105 

RIR2_M0USE (P11157) RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE M2 378 e-104 

Q27124 (Q27124) RIBONULEOTIDE REDUCTASE SMALL SUBUNIT 377 e-104 

RIR2__BRARE (P79733) RIBONUCLEOSIDE-DIPHOSPHATE REDUCTASE M2 376 e-103 

Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 1 to 959 of CT1026: this 
corresponds to nucleotides 9778 6 to 9874 4 of the genomic reference sequence. 



Hits to public SBV sequences: 

git7672988|gb|AF144620.1: CT nucleotides 1 to 1242 match nucleotides 1 to 1242 
of the public sequence with a 99% homology, a score of 2454 and an Evalue of 
0.0; 

gii6165655|gb|AF099142.1; CT nucleotides 1 to 299 match nucleotides 299 to 1 of 
the public sequence with a 99% homology, a score of 585 and an Evalue of le-169 

TaqMan Primer/ Probe Sets: 

5*start=642 

5'stop=663 

3' start=706 

3'stop=727 

5'primer=TGCTGCAGTTGAAGGAATCTTC (residues 642 to 663 of SEQ ID NO:248) 
Tm5=58.19 

3'primer-AGGTGAGACCAGGCAAAATACC (residues 706 to 727 of SEQ ID NO:248) 
Tm3=58.13 

probel-^TTAGTGGTGCATTCGCAT (residues 665 to 682 of SEQ ID NO: 248) 

probelstart=665 

probelStop=682 

directionl=Forward 

Tml=68.98 

scorel=l. 98 

length=86 
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CT1027 

Nucleotide 

Genomic coordinates: 

Start: 98874 

Stop: 99789 (SEQ ID NO: 250) 



Amino Acid 

MNLLPIFLTTFFVAVDACSCSTICLLPDGKKOPLVFDSVLEEVVYPTDVCGPKGAGELFT 
GVDLLTLCIGGKNNGGEWSGKGPCPRINNAVVERDYSLDEEDCKGFRKGFRIPGTDHFHT 
VFSLCWVDRDMHAKWVRNKINPGIVTDDEDLVDSGIRTKFKYSSKIFGKGFNPRPLYSLD 
YQER I KI LKS H FNKRTGNFFARGHLAPAGDFFLASERWATFALEN AV PQ I QN HNNGEWKD 
lENRARTTPGAAWAETGPIFYQHKKKEYLDKKKKYIPIPHALYKIVYDKNNKELFRVQSD 
MSWK 

(SEQ ID NO: 251) 



Top Blast Hits 

Sequences producing significant alignments: 

CAB55635 (CAB55635) DEOXYRIBONUCLEASE I PRECURSOR (EC 3.1.2 
Q9Y2C4 (Q9Y2C4) ENGL-A 

NUCGMOUSE (008 600) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 
NUCG_BOVIN (P38447) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 
073911 (073911) K123 PROTEIN PRECURSOR 

NUCG_HUMAN (Q1424 9) ENDONUCLEASE G PRECURSOR (EC 3.1.30.-) 



Score 
(bits) 

59 
48 
45 
45 
43 
42 



E 

Value 

6e-08 
7e-05 
0.001 
0. 001 
0.004 
0.005 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 808 of CT1027: this 
corresponds to nucleotides 98914 to 99720 of the genomic reference sequence. 
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CT1028 

Nucleotide 

Genomic coordinates: 

Start: 103891 

Stop: 104680 (SEQ ID NO: 252) 



Amino Acid 

MSLAVTEDYGHNEKLIKRLQTSVYHTPLLGADHVMKSISDYIISRRFMNYTNLLKQVEYV 

FDEETGAVIANICLLKILERCAQKGGIYDAPEDVAFFNSKMGEVTRLFTIIGGRPNMTVR 

VNFKHGQTNNPAYGYLTDDNDTTTVTPPVTPPPSPAARRSPFFTRTLISESSSVDHYVLM 

HDNPKRSSFKVYDIHAETFPHKAPSVPTFPPKTSFEISDVTLDCSMEIFSRDRDVLDNVH 

DYIANDPVPFLVDVVHRGSSLR 

(SEQ ID NO: 253) 



Top Blast Hits 

Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



PAXI_CHICK {P4 9024) PAXILLIN 36 0.39 

Q24459 (Q24459) POLYCOMBLIKE NUCLEAR PROTEIN 35 0.67 

LA17_YEAST (Q12446) PROLINE-RICH PROTEIN LAS17 34 1.5 

Q14676 (Q14676) KIAA0170 PROTEIN 33 2.6 

Q94399 {Q94399) ZK265.2 PROTEIN 32 3.4 

088699 (088699) HOMEODOMAIN PROTEIN 32 3.4 



Conments : 



TaqMan Primer/Probe Sets: 

5'start=390 

5*stop=413 

3*start-457 

3'stop^477 

5'primer=TCCTGCCTATGGTTATCTCACAGA (residues 390 to 413 of SEQ ID NO:252) 
Tm5=58.7 7 

3*primer=TCTTCTTGCAGCTGGAGATGG (residues 457 to 477 of SEQ ID NO: 252) 
Tm3=59-15 

probe 1=CCTCCTGTTACTCCTCCT (residues 439 to 456 of SEQ ID NO:252) 

probelstart=4 39 

probelStop==4 56 

direction l=Forward 

Tml=68 . 94 

3corel=l . 94 

length=88 
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CT1029 

Nucleotide 

Genomic coordinates: 

Start: 104759 

Stop: 107330 (SEQ ID NO: 254) 
Amino Acid 

MASTSSSTKKRVHEEDENLIPQPKKKKSKKVLPFPVDKYRAVDKKVVNLIHKILDQEKDH 
LSSTELQMITECNGAREDLLKHLLDEGEFNPTIIEVVSSMPIETIYEILSSSADDKKFVQ 
ISLSMLIHILFFADKGTMWVSNACVQNVLGNDYKVEFENIRKKYLILEDLLNGVSNHWSE 
HGPLSHMLHSSIPIVQDMLLNRLVRYFSTYDGDAQFDISFIINSVLWGIDKSVLNELTQL 
ISRGVFIVSYVPMRVRTPSKDSNRPQNTPSQNMSALGMKLNTFSSRISVYRNNTFKKLTE 
LVHNFDYGSKDASSSSPPPPSLSDSVNTFVRLYTNYDIFLKVISDWKMPYGFFKKTFDVL 
YSKGLMTLSVSEYTLKKELVTFLRALKERETLIYKMEKRDIICILKKS1.FGFNFRCLKQL 
LPLFKHFLKIEEVKHIARFVFRDYSLMCKTQKDLQSFPAIQSASLFMEEFPWLAKTWIDD 
DDDEGGKGHTLLTFAIVHRYPLISQLISHPILKSLVNTTCRDKHFTPLMHLANTSIMYQC 
NTLLCLIINGAKPEFINKFNENVLHIAIENVNYGVITELRGTLSSEQIEKMVNVRRMMDN 
TTPLMIALARENIVLAQLFDGLYKPKIKVRFGSSKRLRIPEFVLLKGLKESVAYLETRNI 
SYDINIIKDAVMDNSLFEEEYEIAAAGLRGNNCDPEADEKTMNTWNFFTKNSTKWASSIF 
QKNRQKFVKIVDGMNRTYEDSECAICLDSLDGDLPSGRTTCGHCFHNVCWLSLIRMSGPN 
NGSRARGGGIKCPSCRQVTCLGKRLGVADYDIETEEERDTKNVVPSVEEGRREWRKIGVD 
RYEFLVGGVWTNEIKL 
(SEQ ID NO: 255) 



Top Blast Hits 

Score E 

Sequences producing signxficant alignments: (bits) Value 



Q9Y577 (Q9Y577) RING FINGER PROTEIN TERF 




43 


0. 


,007 


AAD27914 (AAD27914) T23015.13 PROTEIN 




40 


0. 


.058 


AAD56722 (AAD56722) AUTOCRINE MOTILITY FACTOR 


RECEPTOR 


40 


0. 


.076 


Q9WV59 (09WV59) RING FINGER PROTEIN TERF 




39 


0. 


. 13 


082239 {082239) F17A22 . 9 PROTEIN 




39 


0. 


. 13 


R052_MOUSE (Q62191) 52 KD RO PROTEIN (SJOGREN 


SYNDROME TYPE 


38 


0. 


.22 



Comments : 



TaqMan Primer/Probe Sets: 

5*start=1406 

5'stop-1425 

3*start=1491 

3'stop-1514 

5 •primer=AATTCCCTTGGCTTGCAAAA (residues 1406 to 1425 of SEQ ID NO: 254) 
Tm5=58 .29 

3 • primer=TGGCTTATTAAGGGATATCTGTGC (residues 1491 to 1514 of SEQ ID NO: 254) 
Tm3=57.50 

probel=TTGGATCGACGACGATGA (residues 1428 to 1445 of SEQ ID NO:254) 

probelstart=14 28 

probe 1 Stop=14 4 5 

direct ionl -Forward 

Tml=68. 95 

scorel=l . 95 

length=109 
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CT620 

Nucleotide 

Genomic coordinates : 

Start: 172439 

Stop: 171509 (SEQ ID NO: 256) 



Amino Acid 

MDNLITNDNIILVTFLSGLAVGCSMTIGLALAMNMIiVKCIDRTTTCISCSPWEKNKNKKN 
RNGSNTESSFISHVRFNTPDKDLDISEPMLKSTTYDLANVTPQVTKLVTFSGPTYASPPT 
PRPVANTPQQQPTSTNKEEESVYMPMSSCSSSFSSDNSLPLPTPPPSPPRSNGGDYVSYV 
NGRHLKLPSNPPSPIFNIKNEEGEDDNVEEHVYEYVPEVPQQSPSIQKCIQELKEMKHKK 
NTLTRSSSNNNNNAPRITQVTFKKFPPNNNNMWENHVYGNTTIVSSTPSPTFIPSPKSII 
RKLSFKRKQ 
(SEQ ID NO: 257) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q24035 {Q24035) ENA POLYPEPTIDE 46 4e-04 

Q41805 (041805) EXTENSIN-LIKE PROTEIN PRECURSOR 43 0.002 

Q40786 (Q40786) ARABINOGALACTAN-PROTEIN PRECURSOR 43 0.003 

Q9XZU9 (Q9XZU9) LARGEST SUBUNIT OF THE RNA POLYMERASE II CO 42 0.005 

036428 (036428) GLYCOPROTEIN PRECURSOR 42 0.005 

YHC4 YEAST (P38741) HYPOTHETICAL 80.1 KD PROTEIN IN SNF6-SP 42 0.005 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 665 to 1 of CT620: this 
corresponds to nucleotides 1714 94 to 172158 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start=4 94 

5'stop-513 

3'start=558 

3'stop=578 

5'primer=CGCCATCTCCACCTAGAAGC (residues 4 94 to 513 of SEQ ID NO: 256) 
Tm5=58.4 9 

3'primer=GAAGGTGGGTTTGAAGGAAGC (residues 558 to 578 of SEQ ID NO:256) 
Tm3=58.68 

probel=GGCGGTGATTACGTGTCA (residues 517 to 534 of SEQ ID NO: 256) 

probe 1 start =51 7 

probelStop=534 

direct ionl=Reverse 

Tml=69. 00 

scorel=l . 99 

length=-85 
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CT621 

Nucleotide 

Genomic coordinates: 
Start: 173054 

Stop: 172505 (SEQ ID NO: 258) 
Amino Acid 

MGESIFDAVSLATNNPKKSNSRNKKLLRELKNMRKDFPSTFLQCRMIDFHFSGDIIDKHY 
CHSVNVPDVVPNTIFAVFLPEEDRANNPGLYDSIEGVCITVEQGELCIINKSSVHEFNIL 
VSLHKDLFGEDILDGIETASREESRSIHLYLEAGQSIRTPIPRPEGTNTVNYTIVFSNQV 
TV 

(SEQ ID NO: 259) 
Top Blast Hits 



Sequences producing significant alignments: 



Score £ 
(bits) Value 



088632 (088632) SEMAPHORIN IV ISOFORM B 

088633 (088633) SEMAPHORIN IV ISOFORM A 

029244 (029244) ABC TRANSPORTER, ATP-BINDING PROTEIN 
054948 (054948) SEMAPHORIN IV HOMOLOG (FRAGMENT) 
Q13275 (Q13275) SEMAPHORIN IV PRECURSOR (SEMAPHORIN ITI/F) 
INA9 MOUSE (P09235) INTERFERON ALPHA-9 PRECURSOR 



34 
34 
34 
33 
32 
32 



0.54 

0 .54 
0,71 

1 . 6 
2.7 
3 . 6 



Comments : 
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CT622 

Nucleotide 

Genomic coordinates: 
Start: 183817 

Stop: 180274 (SEQ ID NO: 260) 
Amino Acid 

MEAASNLRITEGAGVLDIDNEDDINNNVDYSNLYEDEEDEEEMNEDEEEEEEEDYEDEDE 

DTGVRNGRNKDPPSSKKQSKFVRDVTNDMYDDDDEEEEEEEEEEDEEGEEGGEYDGNLED 

EEEEGDEYEDDNEGEGEEDEADPALLALAAQQEDATIIPENQWKSIVNTPSPVGPNRQVL 

PMLNFLLENVNAMGGSAGEEQKNKEDDNQQIEPVEEEEDEEEEEQEEEEEEEEEQEEEEE 

EKEPIEQEKNEPEKDEDAIENESVHSHRVESSPMSEGGNDDGMDYFFSSIAGGGNDNEED 

EEEDEEEGEEEEEEEPAQKSEEHVETKESVQSHTEYIEEEEEYEEYEDESRHTLEDEEIS 

TMHQFNNAPRVRRSPPPDIQECEDAVVFPPIMKETDILPQIKEPSPKAPRMFSILGSGGE 

EQYDQLNDIAPPPVPSIVTFPPDNEMGEESRDIMCK3DSMLMPPPPPPPPPPPHQPPQLKP 

TNILLPPPPPPPTNQSLFSNNNNNPSFLSTVVGKVNNTLGGKEAEERLHKTMESIILKTR 

VKTLLETTKNLQCSELVKVVFQDPENPVKPSEKVMERLKNIIAAELTMKAFLDSAAVTDI 

KSAELFRKTNEKLELFQRKQIMSNPLFSAAYASTYIMGERASKIRPSTPAPSLKKVESIS 

ELNEDETSMSSSAGGVCAEGDESIAGGGGGGGGGGGEVVEHSSFYSNQTQANLHMELINI 

LKEDDDNQPCQTYKLGQRLAFLNNLISFKTSSAVSWSRLVNMLSDIVTKASVALFGDTNK 

AQEDFEKHQTETNDVSDLSTSSKLKQMSKESANIMEEMGLGSIGAEICFGAISTIIEKHI 

NKLCMDVGRLTIFLNIPIVLLNWPKEFTLSKDYKVLLLDSISSCSSKMAVPPIYVLNSIQ 

FDKAVDEEDEDGNGSEAEKRSEDGNMFSEKDKKEAIRRVYDNIRYGDSNDRTSLNHFFGD 

AYSGVSNNNSKNSMFDLQTQGGGRFGVAYSAGSSIIEHRSPIFDNALNTLVNFMDKRKHL 

LSAVVIKLLKKAKLSIEVYCIKYKLNQASEKYNKKGKHGKSTSVVPMRNLMYRPSKNQDV 

SPSTPAAATAMDVPSSVSSHVGRKRTFSFSNDINSNMSSASSVYIDQESSTPSRRRTFMD 

LLNNKSSVNSLAKQVKRMKHTKYYNSSSNSEDDDEDDQYE 

(SEQ ID NO: 261) 



Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



Q9YTL7 (Q9YTL7) ORF 4 8 


154 


3e-36 


040947 (040947) ORF 73 


148 


le-34 


Q98148 (Q98148) 0RF73 HOMOLOG 


138 


3e-31 


AAD46S01 (AAD46501) LATENT NUCLEAR ANTIGEN 


136 


le-30 


VG48 HSVSA (Q01033) HYPOTHETICAL GENE 48 PROTEIN 


133 


6e-30 


Q91255 (Q91255) NF-180 


130 


5e-29 



Comments : 

TaqMan Primer/Probe Sets: 
5 'start=1940 
5*stop=1962 
3'start-2039 
3'stop=2058 

5*priiner=CGACTCCTGCTCCTTCTCTTA7VA (residues 1940 to 1962 of SEQ ID NO: 260) 
Tm5-57 . 98 

3'primer=TCCAGCAATAGACTCGTCGC (residues 2039 to 2058 of SEQ ID NO:260) 
Tm3=57,66 

probe 1=GTGGAGTATGTGCTGAAG (residues 2021 to 2038 of SEQ ID NO:260) 

probe Is tar t=2 021 

probelStop=2038 

directionl=Reverse 

Tml=69.00 

scorel~l . 99 

length=119 
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CT623 

Nucleotide 

Genomic coordinates; 

Start: 196292 

Stop: 195506 (SEQ ID NO: 262) 
Amino Acid 

MSNGATISDERLILILDKIVERRGVSNLSELLIHPITKHINELLKNTVRHGDRVYMKDAE 

LDVRSRLEDIKKDCVLKAIEKQGIDVRQIITDYLAKRKLTQNLVHWYRPPISCTDIDEKI 

QQETGQVGRCSVATYNLRIGGDDGEFTRYDFSIPLGDFKITAKLFRSINDEDVDAVILVS 

RSDVVNDVLSFEAFNRTGERVVIFFNVIVEGKSKDIDIVCKSRYKHTHILNGESATYAVK 

RIKRGDTRDDILFAITAFKEE 

(SEQ ID NO: 263) 

Top Blast Hits 









Score 


E 


Sequences producing significant alignments: 






(bits) 


Value 


PYCl YEAST (P11154) PYRUVATE CARBOXYLASE 1 (EC 6.4 


1.1) 


(PY 


33 


1 . 9 


BAA85008 {BAA85008) ORF3P 






33 


2.5 


BAA85073 (BAA85073) ORF4S 






33 


2.5 


PYC2 YEAST (P32327) PYRUVATE CARBOXYLASE 2 (EC 6.4 


1.1) 


(PY 


32 


5.7 


027375 (027375) ADENINE PHOSPHORIBOSYLTRANSFERASE 






32 


5-7 


Q88444 (Q88444) GLYCOPROTEIN PRECURSOR 






31 


7.5 



Comments : 

5'start=282 

5*stop=304 

3'start=383 

3*stop-402 

5 'primer =GGCTAMCGAAAACTAACGCAAA (residues 2 82 to 304 of SEQ ID NO: 262) 
Tm5=58.58 

3'priiner-CGTAGCAACACTACACCGCC (residues 383 to 402 of SEQ ID NO:262) 
Tm3=S7 . 92 

probe 1=CCAATATCTTGCACAGAT (residues 328 to 345 of SEQ ID NO:262) 

probelstart=328 

probelStop=34 5 

directionl=Rever se 

Tml-68.97 

scorel=l . 88 

length=121 
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CT624 

Nucleotide 

Genomic coordinates: 

Start: 195503 

Stop: 194651 {SEQ ID NO: 264) 
Amino Acid 

MSSNGDEPAVTEAEIASVEAQLGAAHHONSWITRKSDQLKYRLGAIAYSVAKNASIKYIE 
DQVRQEINSHLTNVMTFEHLYEDAFNPVICEAIFEKGIPVVMEKVYDVNRRIMEPREDFI 
TEILKEERWRRYIPGFYHTSFSFKYNTIAFTDSSTSFSVPINDKHMLSITPPGAAQGDLI 
DLSLSFKIDSSAKTLTLEFNRKSTFAGIVNRPKSWILSNLRNSDSSDNIGDYLKRNDPI 
YISHDTNGIINPSEDSASLITIHMPEIENASDDLYIDFNLFVF 
(SEQ ID NO: 265) 

Top Blast Hits 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Y687_METJA (Q58100) HYPOTHETICAL PROTEIN MJ0687 
Q9Y717 (Q9Y717) CAGCR3 PROTEIN 
Q83457 {Q83457) FIBRE 

CAB53329 (CAB53329) PUTATIVE FORMYLTRANSFERASE 
PGK_LACDE (032756) PHOSPKOGLYCERATE KINASE (EC 2.7.2.3) 
Q9XEQ1 (Q9XEQ1) TNP2-1.IKE PROTEIN 



36 
33 
32 
32 
32 
32 



0.42 
2.8 
4 .7 
4 .7 
6.2 
6.2 



Comments : 
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CT625 

Nucleotide 

Genomic coordinates: 

Start: 194629 

Stop: 193327 (SEQ ID NO: 266) 
Amino Acid 

MALSNNGGIYIVFAVIVLVIGASIALFFAISGVGKGTLHSNAKTKKSKKYKLDSKYTDDD 
EKTDDDNNNNGGGGGGTVDVINETALQRQTREHFARTLEKAEDEFFTKLADQEFDTYKSE 
NVWLIKDKITDGKVSIPEGDINVPDVGQAIADENLFDLIGTNHDEVKETMDEVVAQKSTN 
ITYEQLVIDLTNILLFGTVTVDPSDENGDESLQRSTDPDAEMVMLTTTPSSQLARQQQPP 
QPTPDYLARYSKELVINNIRGGFISDRDMRTWQGRMSVHVNMKQRTFNVrSAATNLDSLQ 
VGLEPVLQKQGRAAVGGRIEKARIEFSFVVEGNRVRVYATNKTEDCFCSLLPNCYNVKKA 
SDYWISSASTAKEKTYLFIANKNDETSFFYNFEEGVEEIDLDIFMTIDCAPNLPFIKNLP 
RPITDNNIMVALS 
(SEQ ID NO: 2 67) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



096148 (096148) HYPOTHETICAL 57.8 KD PROTEIN 


38 


0. 


. 14 


TRIl ECOLI (P14565) TRAI PROTEIN ( DNA HELICASE I) (EC 3.6.1 


37 


0. 


. 31 


Q43423 (Q43423) CYSTEINE PROTEINASE (FRAGMENT) 


36 


0. 


.40 


015696 (015696) ORNITHINE DECARBOXYLASE 


36 


0. 


. 69 


MIX2 XENLA {Q91685) HOMEOBOX PROTEIN MIX. 2 


36 


0. 


. 69 


BAA84604 (BAA84 604) MYOM PROTEIN 


36 


0. 


. 69 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 351 to 1 of CT625: this 
corresponds to nucleotides 193270 to 19362 0 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5'start=651 

5 'stop=669 

3'start=729 

3*stop=74 8 

5*primer=CCCAGACGCAGAAATGGTG (residues 651 to 669 of SEQ ID NO: 266) 
Tm5=58.66 

3 'primer=ACCGGGCAAGGTAATCAGGT (residues 729 to 748 of SEQ ID NO: 2 66) 
Tm3=59.29 

probel=ACAACAACAACCTCCTCA (residues 705 to 722 of SEQ ID NO: 266) 

probel start =705 

probe lStop=7 22 

directionl=Forward 

Tml=69.01 

scorel=l . 98 

length=98 
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CT626 

Nucleotide 

Genomic coordinates: 

Start: 228196 

Stop: 227989 (SEQ ID NO: 268) 



Ammo Acid 

MSDMTRNIIVGLAVVVIALSMVAFMLSVTPALTGFLLGLGVSALGVTLFGCPTMKSPGGG 
NATINPVA 
{SEQ ID NO: 269) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

SUGE^PROVU (P20928) SUGE PROTEIN HOMOLOG 32 0.35 

048546 (Q48546) INTEGRAL OUTER MEMBRANE PROTEIN 32 0.35 

086349 (086349) HYPOTHETICAL 27.4 KD PROTEIN 31 1.0 

Q9Y9X7 (Q9y9X7) 604AA LONG HYPOTHETICAL CARBON STARVATION P 30 1 . 8 

069867 (069867) PUTATIVE TRANSMEMBRANE TRANSPORT PROTEIN 29 4,0 

026370 (O26370) HYPOTHETICAL 5 . 9 KD PROTEIN 29 4.0 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1032 to 768 of CT626: this 
corresponds to nucleotides 227951 to 228215 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5' start^eo 

5 ' s top=-80 

3'start=151 

3'stop-173 

5 •primer=CATGGTCGCTTTCATGCTTTC (residues 60 to 80 of SEQ ID NO: 2 68) 
Tm5=59.30 

3 'prxmer=^CCTGGAGATTTCATAGTGGGACA (residues 151 to 173 of SEQ ID NO: 268) 
Tm3=58.94 

probe 1=TGTTACTCCTGCACTTAC (residues 81 to 98 of SEQ ID NO:268) 

probel start =81 

probelStop=98 

direct ion l=F'orward 

Tml=6B.95 

5corel = l - 95 

length=114 
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CT627 

Nucleotide 

Genomic coordinates: 

Start: 234330 

Stop: 233778 (SEQ ID NO: 270) 



Amino Acid 

MFQKWFESFLDSSRPRYLDTTCVCSVYSYFSPCRKHIKFSTSHSHEGIKIHPPSILNHNT 
SSPTSGKMCNHHHKRLYLSTDDHTRWYDKNTSCIYLEDIGGVQFMVYEFHLTPKNNQLFS 
FPVHLQIHNRNTEKTSLLVFENEEDMRVRNIHPKSKILIPVSKDTVLVENGFRYKVKIVL 
SNK 

(SEQ ID NO: 271) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

TRBH_ECOLI (Pig381) TRBH PROTEIN 31 6.4 

BCL6_HUMAN (P41182) B-CELL LYMPHOMA 6 PROTEIN (BCL-6) (ZINC 31 6.4 



Comments : 



TaqMan Primer/Probe Sets: 

5'start=187 

5 • stop-207 

3'start=233 

3'stop=253 

5*primer=CCCACCAGTGGAAAGATGTGT (residues 187 to 207 of SEQ ID NO: 270) 
Tm5=58 . 36 

3*primer=TCGTATGGTCGTCAGTGCTCA (residues 233 to 253 of SEQ ID NO: 270) 
Tm3=58 - 89 

probe l^CACCACAAGAGATTGTAC (residues 214 to 231 of SEQ ID NO: 270) 

probe Is tart =2 14 

probelStop=231 

direct ion l=Forward 

Tml=68.98 

scorel^l . 98 

length=67 
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CT628 

Nucleotide 

Genomic coordinates; 

Start: 240139 

Stop: 239455 (SEQ ID NO: 272) 



Amino Acid 

MDSLISKLENIFSIAEQDFFNADSMFMQTMLLPTDAMFTDCESPLYKNKSGGKNIVTDVG 

ESVLSSSSDEKMSFKVLSHVLRRFPVLLHCNYKQTNTPLWKELYKHGKFALLGDLVLFSN 

PFHPNIPAMPFDKSPICDTTGKSIIMSEVMTKELLYKLADKDIGQFFAVLNVTNPITGDS 

FLHYFAGGNTMRDGEGDKICTSADVLRIIAEITIQKTGKMPYELMKK 

(SEQ ID NO: 273) 

Top Blast Hits 

Sequences producing significant alignments: 

SYT_HAEIN (P4 3014) THREONYL-TRNA SYNTHETASE (EC 6.1.1.3) (T 
SYT_ECOLI (P00955) THREONYL-TRNA SYNTHETASE (EC 6.1.1.3) (T 
CDRl SCHPO (P07334) MITOSIS INDUCER PROTEIN KINASE CDRl (EC 
Q55626 (Q55626) METHYLENETETRAHYDROFOLATE DEHYDROGENASE 
ERG2_SCHPO (P87113) PROBABLE C-8 STEROL ISOMERASE (DELTA-8- 
Q85056 (Q85056) SEGMENT 2 



Score 
(bits) 

39 
38 
35 
32 
32 
31 



E 

Value 

0. 028 
0. 049 
0.55 
3.7 
4.8 
8 . 3 



Comments : 



TaqMan Primer/Probe Sets: 

S*start=246 

5'stop=267 

3'start=288 

3*stop=305 

5 'primer=GCGATTCCCTGTCCTACTTCAT (residues 246 to 267 of SEQ ID NO: 272) 
Tm5=58.22 

3 'primer=TCCTTCCACAGGGGCGTA (residues 288 to 305 of SEQ ID NO: 272) 
Tm3=58. 60 

probe 1=CAACTACAAGCAGACGAA (residues 27 0 to 2 87 of SEQ ID NO: 272) 

probe Is tar t=270 

probe lStop=2 87 

directionl=Forward 

Tml=69.05 

scorel=l . 94 

length=60 
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CT62 9 

Nucleotide 

Genomic coordinates : 
Start: 247143 

Stop: 244950 (SEQ ID NO: 274) 



Amino Acid 

MEGGDQRTKLTPATVMGLYQSKTPGEGEGGEGGGQFKIPSAIAVKSCCSKNATRRSPPSD 

SPYSLRPMKRLKKNNGEVGGKAPPPVTLRLREDYESTPYNFNRNKKKRPITIDENQFATL 

NPTYATDIIKKQQLPSVSAASVLRKHRANADTQYRKRFSHPNCAKFSTVNLKARDYTPLS 

VLRSHVKGPKHLKSSCDTVTETNVVKRNFSSIDKWVKLEKPPCYFAVAEADTNIAAGLES 

PFHLIRQAAKLGLISDVQDVSSNYETIKQSCIDAKEKASKFLWSNNRTKQPPSSWWPVGF 

GSKNLSVLDTSPLLNWNRLCKNNGKGWIKTMSIDHMAKNVFKLSPGACESILEKKTTLLG 

EVTAQCKKWESYRRNIPVPAHVQPEYASQVVMIGPSELYLEVKVGVYYMLSTGKVIKFMT 

DKEMYCEFVFETVFSHALEGRMKGAVGVRKMCVEGFCVEMDFAGISVIDVLNGDLKCKMD 

ENVVQQPNPSTTSSKPAAELMQDHGSLCRMRDTLYGVRMLQATGRLPEGLQSKCKKPITD 

SISAIAIVGKMRERMLNQLPFVLVEIVNIVTRLSQQGLVNPDIKSDNIVIDGITGQPKMI 

DFGLIVPCKKYYNFKCWGTDERFFSNHPHTAPEFINSELCSETAMTFGLAYLLIDMLSIL 

IKRTADLSANSIYTNIPFLSIVSKMYDQEKTNRPRAYEIAPVIGACFPFKDNIAKLFQSP 

KHSLYSKKVK 

(SEQ ID NO: 275) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q20443 (020443) PRK-2 PROTEIN 46 0.001 

Q17735 {Q17735) SIMILAR TO PROTEIN KINASE 44 0.004 

KPC1_LYTPI (Q25378) PROTEIN KINASE C (EC 2.7.1.-) 43 0.007 

001715 (001715) PROTEIN KINASE C 43 0.010 

001716 (001716) PROTEIN KINASE C (FRAGMENT) 43 0.010 
Q21734 (Q21734) PUTATIVE RIBOSOMAL PROTEIN 36 KINASE II ALP 42 0.017 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 933 of CT629: this 
corresponds to nucleotides 244 994 to 24 5926 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 

5*start-933 

5'stop=^954 

3'start=1020 

3'stop=^l040 

5'primer=CCCTCTCTTGAACTGGAACAGG (residues 933 to 954 of SEQ ID NO:274) 
Tm5=58. 83 

3'primer=GCTCCAGGGGAAAGCTTAAAA (residues 1020 to 1040 of SEQ ID NO: 274) 
Tm3=58. 49 

probe 1=TGAGCATCGATCACATGG (residues 992 to 1009 of SEQ ID NO:274) 

probe Is tar t=992 

probelStop=1009 

direction l=Forward 

Tml^68.98 

scorel=1.98 

length=108 
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CT1030 

Nucleotide 

Genomic coordinates: 

Start: 108549 

Stop: 109164 (SEQ ID NO: 276) 
Amino Acid 

MSSGKVTYEIVEGGLLNNKYLLDGGAAICLQSNCVARKRHAGSLHDNLFKMLGFGDPYKQ 
RRGKTNSPCNLAIIEDRPQLGSVSVVQHPTEPERFCSMTFLFAQYNMGNGRKCYFPNDKEY 
VESCKKHERVHKSSTEMKRLRLYYFNKCLHAIAKSPAMKKYNKIIFPARIGCAAAGGDWE 
K YHAS I R DFS T II DKE V 1 1 VSQRM 
(SEQ ID NO: 277) 

Top Blast Hits 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has equence identity to nucleotides 1 to 604 of CT1030: this 
corresponds to nucleotides 108546 to 109149 of the genomic reference sequence. 

TaqMan Primer /Probe Sets: 

5'start^284 

5 ' stop=307 

3'start=387 

3'stop^408 

5 'primer =GCTCCATGACATTCTTATTTGCTC (residues 284 to 307 of SEQ ID NO: 27 6) 
Tm5=57. 98 

3 'primer^^TTCTGTGGAAGATTTGTGGACC (residues 387 to 4 08 of SEQ ID NO: 27 6) 
Tm3=57.62 

probel=TGTTGAGAGCTGCAAGAA (residues 360 to 377 of SEQ ID NO: 276) 

probe Is tart =3 60 

probelStop=377 

direct! on l=Forward 

Tml=69. 08 

scorel=l . 91 

length=12S 



Sequences producing significant alignments: 



Score 
(bits) 



E 

Value 



Q9YTM9 (Q9YTM9) ORF 36 

Q9XAQ7 (09XAQ7) NUOD, NADH DEHYDROGENASE SUBUNIT 



32 
31 



4 .3 
9.7 
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CT1031 

Nucleotide 

Genomic coordinates: 

Start: 109250 

Stop: 110088 (SEQ ID NO: 278) 



Amino Acid 

MSSNRFSQLRGNEEMVGDYSRWTTVKNRRNRQQQYSHSFRPQQQOQHQKRTSTNSPPAPP 

PPFPIISWGALGSYSMYRLDDQCRNCDETGYYNFHSYDRKRERVRSLNNTPSEGMWRRTS 

RSSPFLNKKKDVDEAPPPQSNQHMYPLNKYSFREYTPSSKLVNWRDPSQEKQDKILQEEE 

ARAPTPTPQEKEPEVETKDDVVIEEETAPEPEPEPAPVPDPDIPAITATTTTTTVATRHD 

DSSTVFLRNVILSIVFWFLGVYSALFAKCIRSKKE 

(SEQ ID NO: 279) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q01823 (Q01823) ORF-3 48 5e-05 

PAR1_TRYBB (P08469) PROCYCLIC FORM SPECIFIC POLYPEPTIDE 81- 48 7e-05 

Q99356 (Q99356) PROCYCLIN 48 9e-05 

P73032 (P73032) HYPOTHETICAL 185.1 KD PROTEIN 48 9e-05 

PARC TRYBB (Q06084) PROCYCLIC FORM SPECIFIC POLYPEPTIDE B-A 48 9e'05 

Q27045 (Q27045) SCHI ZONT/SPOROZOXTE SURFACE PROTEIN ( FRAGME 48 9e-05 



CoiTiments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 137 to 680 of CT1031: this 
corresponds to nucleotides 109401 to 109944 of the genomic reference sequence. 

TaqMan Primer /Probe Sets: 

5*start-255 

5'stop=277 

3'start=347 

3'stop=^364 

5' primer =TTGCGATGAAACTGGCTATTACA (residues 255 to 277 of SEQ ID NO: 278) 
Tm5=58.44 

3'primer=ATCTACTTGTGCGCCGCC (residues 347 to 364 of SEQ ID NO: 278) 
Tm3=58.42 

probe 1=CTCCAAGTGAAGGCATGT (residues 32 9 to 34 6 of SEQ ID NO: 278) 

probelstart=32 9 

probelStop—34 6 

di rect ion 1— Forward 

Tml=6B.97 

scorel=l . 97 

length=110 
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cri032 

Nucleotide 

Genomic coordinates: 

Start: 119056 

Stop: 121081 (SEQ ID NO: 280) 
Amino Acid 

MAGNRTQFVSSLIAKCISDVEQGMECCGRQAQDALMTRLANLKLGDSLKETDVNLEYLRY 
ASTPLLGELNYDKQQYAATVDINLMAHFSYAALGIESILNSIRRVVVANHQRRNNGKKPS 
EPISRPHPLGGVEPPLSSELANAIRDKFISMGALDRLNSAIVTAALGAIASERELFLREN 
AVNYMYDVEFAERDAATTDTGNVVYLSTKMDEDEDDIIKRSEILDKVSKRPAKEGIDWRP 
TPDNSFPYQLIWGDDSVDDTVLIDLITNAIVPNIFMAKFILFICNHLRAVIRSMREILYG 
NISSSSDNYFEDGRKWCFWLNLYNRLEWFMLVVRFVIFLHSKKESFSGADNVNVKRLLVV 
VVESFPPVLLDTEWVKTNITSWPVINNSNNNSTLPVTEDTLMRLAIRTSSGARHPIFDEI 
NSLTTAVTNRITFQSAEFCTKILLGRALDEEEAGTKMLVKSVKETGEEKDKNNTFSSFGL 
LLKNTKNEELEINIGDNDDETTDVACWARTSSTSFIRNRTYAFKKIWGLEDASDVVELKR 
ESDAITSFVTDKSSPLLFPYVSDWSCLLLHPCCKAPAIIKSVWLQILKDFSQENIKTINE 
KVQSLSSEICQKSNDRFKNKKIAAEHVRSVKKLLNTISNREQEAALSTEHCIWLTILWKQ 
VVQNTLNLLENFPV 
(SEQ ID NO: 281) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

025761 (025761) HYPOTHETICAL 88-8 KD PROTEIN 38 0,29 

075130 (O75130) KIAA0635 PROTEIN 34 3.3 

Q17425 (Q17425) HYPOTHETICAL 62.4 KD PROTEIN B0024.10 IN CH 34 3.3 

033600 (033600) PURINE NTPASE 34 4.4 

Y066_NPVAC (P41467) HYPOTHETICAL 94.0 KD PROTEIN IN POL-LEF 34 4.4 

Q53626 (Q53626) ATP-BINDING PROTEIN INVOLVED IN MITHRAMYCIN 33 5.7 

Comments : 
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CT1033 

Nucleotide 

Genomic coordinates: 

Start: 121099 

Stop: 123634 (SEQ ID NO: 282) 



Amino Acid 

MFTHLTRAFRKMNNLVNRSFIDVHRVVAELSYPEFEEDVKNPESSIYRTPISLFONKDIV 
TIVGDYILSPKTDSFQVLYPIKKVIEHFPVIFHCTHNNAPLWVHLLDERHHRLLQSLLTY 
EIVNAKYRGIVVIPYYRRPINYQTGKSLLMSKLASVKVLDILMRCGSYKFISLMCMINKK 
NNTN FLHCCASKWGEVGSKMMLH I AEMFFAN PTTSQHLS DAS S FP DAAAEDDKGKTPAHL 
AIQEDNADALLFLISLYGAPWFQDNNSYMKSALELKSNKCVKVLSFAADKYEILPNINNN 
QLEPDTMCGVCATSVEEDENEGKTTSLSWYQMNCKHYIHCECLMGMCAAAGNVQCPMCRE 
DVGDEVLERCPPTIFRWLKLAERSEHNRVLFEAKKQEFYKOMEAMKPPRVVVPPRRTFLT 
PARRGERAIRIAREIATNAIAEATAQGDVNSYFPVLIDGSGEEYEEEGEEFFNSEEEALA 
FGRPFLEDEEEARQIQMRQFAELSRRGV3VNIINNDNPHRHISTVNIVQPVYGVEKSPAA 
SFIYNMLKNDVFESIRSRDTRVGGERVPVMNLSNDKRALFHAASSMLCDFATETNSQIVG 
LDFQAVYDPHHISNYIETFGSPLHAYPGAVTFLDGAQDYYAESIRYDNDIVSFSEMASEL 
HITEALDVFEGSLLSPLFKKIRTGKSYSNWNDHLRRRNYARDIAEEFVRVCENSLASREH 
PPVHVHPFRDGAIPILIEYIVDFIHHCITWSMQVNALHCMRKYIEHENTNVHLLNLRPTD 
ERVEVLRVSQLRWSRLFNEQYNTRMSLSTKRLSLMKIFNHDLGVSKFGVYKLLDIIEMYC 
FTLI 

(SEQ ID NO: 283) 
Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 

Q9XUM8 (Q9XUM8) W02A11 . 3 PROTEIN 39 0.17 

YEYC_CAEEL (Q21802) HYPOTHETICAL 64.6 KD PROTEIN R07B7.12 I 38 0.22 

CAB40955 (CAB40955) HYPOTHETICAL 22.5 KD PROTEIN 37 0.50 

YBR2__YEAST (P38239) HYPOTHETICAL 13,2 KD PROTEIN IN ORC2-TI 37 0.50 

Q47955 (Q47955) HHDA PRECURSOR 37 0.50 

082239 (082239) F17A22.9 PROTEIN 37 0.50 



Comments : 

EST confirmation of the predicted transcript: 

An isolated EST has sequence identity to nucleotides 1 to 208 of CT1033: this 
corresponds to nucleotides 1234 4 9 to 123656 of the genomic reference sequence. 

TaqMan Primer/ Probe Sets: 

5'start-1206 

5»stop-1225 

3'start=1269 

3' stop=1287 

5'primer=GGAAGCAATGAAACCTCCCA (residues 1206 to 1225 of SEQ ID NO: 282) 
Tm5=58-56 

3*primer=GATGGCTCGTTCGCCTCTT (residues 1269 to 1287 of SEQ ID NO: 282) 
Tm3=58.97 

probel=TTGTTCCTCCTCGCAGGA (residues 1232 to 1249 of SEQ ID NO: 282) 

probelstart=1232 

probelStop=124 9 

direct ion l=Forward 

Tml=68. 99 

scorel=l . 99 

length=82 
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CT1034 

Nucleotide 

Genomic coordinates: 

Start: 123757 

Stop: 126556 (SEQ ID NO: 284) 



Amino Acid 

MAADLLELAIQETIQSELEEXADTEFLNYLPHKTGICEEAAANGRPYLPTLEMRNEVDHF 
WSQDNRKLKLLGHFCGNLYVEAFIAGSIDAETCVGFLRSQATGLGYPLLKKLALIAREDK 
SNTTNYNLYIDRNSMMKQVFSAEIDKRPSSIQNTSHTKSSPVYLKLIDRRTECLALDWLD 
ASKRTAKEIGAARKVCFLQNLIVAILIPAYTETFVLDTGNELEQQVLDDAYFNAENKDKV 
DEMCVVAILSTLHNLFVRKSLPHHLYNAPFRLPPFGQHPIINIENSSFFNEDTTPILASI 
SIPSSMVIKHHTRKNSRWRCPNNLMTAAERSIFLRGVLTVSGDYGWFSVIVGSTIMPSVL 
FYGDRKHLINTVKSNNFSAITCSYWNKYMDCRSYGFEIIDTPENNCGFRIRAAIDCSNTD 
FHSPVTRVNKKKTSIINAVKNPFFIRHTEPKWYNKNAMCGEVLENVGVTLEQHVRVSDEY 
MDRFGSLLLGREKKWTCNYLDRIKSLETISNNLKGKIDTMCKILETKYNYKSSSLYYKQI 
TATSDDPIKMKIIASINKRRYLCNILEFAIISSEKKDEVEEDHTKTGNGGCAFSKYKKKQ 
LEPKQHLIVKVNKYIEAFSLIKMLRNDCERNKCRFKEAEIRECAISIELVRELYRASARSYV 
HDLVLKRTNVHLTWQRPYDENANTIMSLIPKCKLHTVLYDKDSRDVKLLNFLRTRDGNYN 
PIRHSMLELVYGEEYAKDVSTVTCFEWLKWCSKKGVIKYEDFLDRYEKTGEEDKDEREFF 
RLKKCSRDHTKDIKKIENVLNSDTLYSYSLDKNVQTHASSSTVVKNDTDGKTSMVGWDYI 
FSIGKGEKTTKKRKLETIDISSSDDDDEEEEEEDEGKRMKMNNCSSSIKNKSKNKNGRMC 
CTDILNVVEPSLPNTLSFNCVKSMDVLNLL 
(SEQ ID NO: 285) 

Top Blast Hits 

Score E 

Sequences producing significant alignments: (bits) Value 



062235 (062235) F36F2.3 PROTEIN 

AAF04442 (AAF04442) PUTATIVE TRANSLATION INITIATION FACTOR 
060313 (060313) KIAA0567 PROTEIN (FRAGMENT) 
Q904 91 (Q90491) DNA BINDING PROTEIN E12 
BAA82145 (BAA82145) MYOSIN HEAVY CHAIN 2B 
097303 (097303) PFC1060C PROTEIN 
Comments : 

EST confirmation of the predicted transcript: 
An isolated EST has equence identity to nucleotides 1 to 647 of CT1034 : this 
corresponds to nucleotides 12564 9 to 1262 95 of the genomic reference sequence. 

TaqMan Primer/Probe Sets: 
5 •start=1391 
5'stop=1410 
3 'start=1468 
3'stop=1489 

5'primer=AAAATGTTGGCGTGACCCTC (residues 1391 to 1410 of SEQ ID NO:284) 
Tm5=58 .49 

3'primer=ACGTCCATTTCTTTTCTCGTCC (residues 1468 to 1489 of SEQ ID NO: 284) 
Tm3=58.24 

probel=AACAACACGTCCGTGTTA (residues 1412 to 1429 of SEQ ID NO:284) 

probelstart=1412 

probelStop=142 9 

direction l=Reverse 

Tml=69.00 

scorel— 1 . 99 

length=99 



46 8e-04 

43 0.012 

41 0.037 

41 0.048 

40 0.063 

40 0.063 
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SBV.ORF022.txt 

Nucleotide 

Genomic Coordinates: 

Start: 265664 

Stop: 266297 (SEQ ID NO:286) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=230 

5 ' stop=252 

3'start=321 

3'stop=343 

5*primer=CAAATCAAAGGCCATATGAAACC (residues 230 to 252 of SEQ ID NO: 286) 
Tm5=58- 41 

3'primer=GGTGGTCTTTATTTTCATCACCG (residues 321 to 343 of SEQ ID NO: 286) 
Tm3=^58. 41 
primerScore=0 , 8 6 
allelel= 

probe 1=GAGGATAGCCAGTTCTAT {residues 265 to 282 of SEQ ID NO: 286) 

probelstart=265 

probelStop=2 82 

direct ion l=Reverse 

Tml=68 . 99 

scorel=l . 99 

length=l 14 
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C003 

Nucleotide 

Genomic Coordinates: 

Start: 126709 

Stop: 127126 (SEQ ID NO: 287) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start-175 

5' stop=194 

3*start=265 

3'stop=283 

5 'primer^CATGGTGTGCCTAAAGCAGG (residues 175 to 194 of SEQ ID NO: 2 87) 
Tm5-57.84 

3* primer =TGGCACAATCATGGGTTGA (residues 265 to 283 of SEQ ID NO: 287) 
Tm3=57 . 81 
priraerScore=0 . 91 
allelel= 

probe 1=AATGGAGGCATCGAAGAA (residues 212 to 229 of SEQ ID NO: 287) 

probe 1 start =2 12 

probelStop=229 

direction l^Reverse 

Tml=69.02 

scorel-1 . 97 

length=109 
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COlO 

Nucleotide 

Genomic Coordinates: 

Start: 184094 

Stop: 184931 (SEQ ID NO: 288) 

Comments: EST confirmatxon of transcript not predicted computationally, 

TaqMan Primer/Probe Sets: 

5'start=547 

5'stop^570 

3'start=655 

3' stop=678 

5'primer=AAGATGATGAAACCTGGTTCAAAA (residues 547 to 57 0 of SEQ ID NO: 288) 
Tm5=57 .75 

3'primer=CGTCATGGAAGAAAAGGAATTGTA (residues 655 to 67 8 of SEQ ID NO: 2 88) 

Tm3=58.24 

prxmerScore-0 . 67 

allelel=- 

probe 1=CTCCTTGCAAGTGCGATA (residues 598 to 615 of SEQ ID NO: 288) 

probelstart=598 

probe 1 S t op= 615 

direct ionl^Forward 

Tml-69.01 

3Corel=l . 98 

length=132 
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C020 

Nucleotide 

Genomic Coordinates: 

Start: 60122 

Stop: 60341 (SEQ ID NO: 289) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5'start=2 

5'stop^26 

3'start=118 

3 'stop=140 

5 •primer=CAACCTCCACATCTCCTTCTAATTC (residues 2 to 26 of SEQ ID NO: 28 9) 
Tm5=58.27 

3 •primer=AACGCTTTGTTT7VACGTGCTTTT (residues 118 to 14 0 of SEQ ID NO; 2 8 9) 

Tm3=58.18 

primerScore=0 . 61 

allelel= 

probe 1=ATTCAAGAGGTCACAAACA (residues 80 to 98 of SEQ ID NO: 289) 

probe Is tar t=BO 

probelStop=98 

direct ionl=Forward 

Tml=68. 75 

scorel=l . 50 

length-^139 



FIGURE 3, Sheet 156 of 160 



wo 01/38351 



198/201 



PCT/USOO/28888 



C028 

Nucleotide 

Genomic Coordinates: 

Start: 277009 

Stop: 277324 (SEQ ID NO: 290) 

Comments: EST confirmation of transcript not predicted computationally, 

TaqMan Primer/Probe Sets; 

5 *start=293 

5'stop=315 

3*start-384 

3'stop=4G6 

5'primer=GGTGCAGTTTCATCCTTACCATT (residues 293 to 315 of SEQ ID NO: 290) 
Tm5=58.07 

3'primer=TCGTCATCATAGTCGTCGTCAAC (residues 384 to 406 of SEQ ID NO: 290) 

Tm3=58.72 

primerScore=0 . 86 

allelel= 

probel-TTCCACAACCACCACTAC {residues 320 to 337 of SEQ ID NO: 290) 

probe 1 star t=320 

probe lStop=3 37 

direct ion l=Forward 

Tml=68.94 

scoreI=l . 94 

length=ll4 
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C036 

Nucleotide 

Genomic Coordinates : 

Start: 115067 (SEQ ID NO; 291) 

Stop: 115380 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5' start=99 

5'stop=117 

3' start=158 

3*stop=177 

5'primer=CCACTGCTGGACGCATCTC (residues 99 to 117 of SEQ ID NO:291} 
Tm5=58.94 

3*primer=CGACGGACAGTGGAGCTCTT (residues 158 to 177 of SEQ ID NO: 291) 
Tm3=5 9.04 
primerScore=l .25 
allelel== 

probe 1=CCATCAATGAAGAAGCGT (residues 131 to 148 of SEQ ID NO: 291) 

probe Is tar t ==131 

probelStop=148 

direct ion l=Forwa rd 

Tml=68.97 

3Corel=l . 97 

length=7 9 
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003-7 

Nucleotide 

Genomic Coordinates: 

Start: 186828 

Stop: 187047 (SEQ ID NO: 292) 

Comments: EST confirmation of transcript not predicted computationally. 

TaqMan Primer/Probe Sets: 

5 '3tart=63 

5'stop=82 

3'start-133 

3'stop=157 

5 'primer=TTGACATCAGACCGACCCAG (residues 63 to 82 of SEQ ID NO: 2 92) 
Tm5=58.08 

3 'primer=GGAATTTTACTCTTCCTCCATCTCA (residues 133 to 157 of SEQ ID NO: 2 92) 

Tm3=57.92 

primerScore=0 . 62 

allelel= 

probel=CACCCTCTAAACTCGAGC (residues 88 to 105 of SEQ ID NO: 292) 

probelstart=88 

probelStop=105 

direct ionl=Forward 

Tml=^69.4 9 

scorel=l . 50 

length=95 
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C0I9 

Nucleotide 

Genomic Coordinates: 
Start : 141541 

Stop: 141647 (SEQ ID NO: 293) 

Comments: EST confirmation of transcript not predicted coniputationaily . 
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1 1 Declaration 1 ^ 1 Declaration 

Subnnitted OR Submitted after Initial 
with Initial Filing (surcharge 
Filing (37 CFR 1.16(e)) 
required) 


Attorney Docket Number 


CL000895USNAT ^ 


First Named Inventor 


Xun XU etal. 


COMPLETE IF KNOWN 


Application Number 


09/914,464 


Filing Date 


11/08/2000 


Art Unit 




Examiner Name 





As the below named inventor, I hereby declare that: 

My residence, mailing address, and citizenship are as stated below next to my name 

I believe I am the original and first inventor of the subject matter which is claimed and for which a patent is sought on the invention entitled. 



ERI MARY NUCLEOTIDE SEQ UENCE-QI^THE-SijRI.MP WHITE SPOT BACILLIF ORM 
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SHRiMP3ZlRUS-OUTBREAK-AND-SEREAD 



(Title of the Invention) 



the specification of which 
□ is attached hereto 



0 



was filed on (MM/DD/YYYY) 



11/08/2000 



as United States Application Number or PCX Internationa! 



Application Number 



USOO/28888 



and was amended on (MM/DD/YYYY) 



(if applicable). 



I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as amended by 
any amendment specifically referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined in 37 CFR 1.56, including for continuation-in-part 
applications, material information which became available between the filing date of the pnor application and the national or PCT 
international filing date of the continuation-in-part application. 



I hereby claim foreign pnonty benefits under 35 U S C. 1 19(a)-(d) or (f), or 365(b) of any foreign application{s) for patent, inventor's or plant 
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□ 
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are believed to be true, and further that these statements were made with the knowledge that willful false statements and the like so 
made are punishable by fine or imprisonment, or both, under 18 U.S.C. 1001 and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon 
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Inventor's 
Signature 




Date 



Xiamen 
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Rockville 

City 



MD 
state 



20850 



ZIP 



us 

Country 



W\ Additional inventors are being named on the _^ supplemental Additional Inventor(s) sheet(s) PTO/SB/02A attached hereto 



[Page 2 of 2] 



PTO/SB/02A (10-00) 
Approved for use through 10/31/2002. OMB 0651-0032 
U.S. Patent and Trademark Office; U S. DEPARTMENT OF COMMERCE 



DECLARATION 


ADDITIONAL INVENTOR($) 
Supplemental Sheet 




Page 3^ of 4 



Name of Additional Joint Inventor, if any: 
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Family hlanr^^ ^ 
or Surname 
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OA 

State 



C/O Celera Genomics 

Mailing Address 



us 

Country 



us 

Citizenship 
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Given 
Name 


HE 
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Inventor's 
Signature 


Date 


Xiamen 

Residence: City 


State 


China 

Country 


China 

Citizenship 
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45 West Gude Drive 

Mailing Address 


Rockville 
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20850 
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us 

Country 
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Inventor's 
Sianature 


Date 


Shanghai 

Residence: City 


State 


China 

Country 


us 

Citizenship 


c/Q.CeJera Genomics 

Mailing Address 


i§iiyWdg^de Drive 


Rockville 

City 


MD 
State 


20850 us 
ZIP Country 



on the amount of time you are required to complete this form should be sent to the Chief Information Officer, U S. Patent and Trademark Office Washinaton 
DC 20231 DO NOT SEND FEES OR COMPLETED FORMS TO THIS ADDRESS SEND TO. Assistant Commissioner for Patents. Washington. DC 20231 . ' 



O :l. ^Mi^ . 'IDI 'Mi-CJl HXm iff 



PTO/SB/02A (10-00) 
Approved for use through 10/31/2002. OMB 0651-0032 

under the Paperwork Reducfon Act of 1995. r.o persons are required to respond to^'^o^^e^l^^^^^^^^^ 



DECLARATION 



ADDITIONAL INVENTOR(S) 
Supplemental Sheet 

Page 3 of 4 



Name of Additional Joint Inventor, if any: 



□ A petition has been filed for this unsigned Inventor 
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us 
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SEQUENCE LISTING 

<I10> PE CORPORATION (NY); 

THE THIRD INSITITUTE OF OCEANOGRAPHY, STATE OCEANIC ADMINISTRATION, 
CHINA ; 

SINOGENOMAX CO. LTD. 

<120> PRIMARY NUCLEOTIDE SEQUENCE OF THE 

SHRIMP WHITE SPOT BACILLIFORM VIRUS (WSBV) , DISCOVERY 
SYSTEMS CONTAINING THIS SEQUENCE AND DETECTION KITS AND 
ANTIVIRAL TARGETS FOR DETECTION AND CONTROLLING SHRIMP VIRUS 
OUTBREAK AND SPREAD 

<130> CL000895-PCT 

<150> 99124717.5 
<151> 1999-11-24 

<160> 293 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 305107 

<212> DNA 

<213> SHRIMP 



<400> 1 

ggatccagac 

tgtcgtgagg 

taatttctca 

taggaatgat 

attatcaaca 

caggacggca 

ggactttttg 

tatatatgaa 

aaaaaatgtg 

cttaacagat 

agtgtgttga 

gctgaaaatt 

atatccctct 

ttgtttacat 

ttctttttta 

ttcctggttt 

atggtcaagt 

ttcttgttaa 

cccgccagta 

tcatgagaga 

cactgagata 

cgatcctttg 

aagtactgga 

gaggaagaag 

tggtacattt 

ctcaggtaca 

tcatggcttt 

atctgttcaa 

ttttgacaac 

cgccgctcat 

cttcttccat 

tgtctatatc 



atcaccccaa 
ataaaaactc 
tttttaaaat 
atgtccgctt 
ggatggaata 
acttgtcgtc 
caaacacctg 
tttcaagcta 
tttatttttc 
aatgcctatt 
tcctattttt 
cccttcctgt 
ttgtattatt 
ttagtgtgtg 
ccagttcaga 
gtaatgtgcc 
ttattatttc 
gttctatgtt 
tagcggcgta 
aaaaaacaga 
tattgtgagc 
aagccggcat 
tagcctttac 
ttcgcctgca 
tgaaggtgag 
tccacgctca 
aatcttgttg 
aactttccac 
tacaggatct 
gcctgtttct 
ttct tctcca 
atcatcatca 



caatcgcaga 
cttcatgtga 
acgtcaaacg 
tcttattaga 
gagaagctct 
tagcgcaatc 
ggtttgacta 
caatgtaaat 
cccaacctta 
tgtatcatcg 
cttcaaaact 
agcatttaga 
gtatactctt 
gtctccgtct 
atcggacgtt 
gtttacaaag 
agattcaaca 
ggttacaact 
aaccccccac 
tattaaagtt 
agcaactgcc 
cttgaaatga 
tgagttgaaa 
ctggcaattc 
agcacgtctt 
ccttttatac 
gttatccctc 
gactccattg 
ggttgaatat 
agtct tgaaa 
ctatctaaaa 
tcttccccaa 



cccgattaca 
tgagttgaat 
gtttatggcg 
tttgattttt 
gggaggaact 
ttattacccc 
caataaatct 
aattggttaa 
aacagatcat 
atgtcttcgt 
gttggtttga 
cctggagaca 
gcatacaatt 
ccaagtaaag 
ccttggccag 
ttaaagtggt 
gggtaggcgt 
atagatataa 
atgtgcattt 
tgttatattt 
gtttcagcat 
tgtgcagtat 
gtttcacggg 
acaaaggagt 
tcttcaacca 
caacgtctta 
gatctacgaa 
ggtccagagt 
atgcagatga 
ggagtgtgtc 
attcgtctat 
gagatgaaga 



gtagcagaag 
aaattcatga 
cagagagctt 
gaaaaattga 
actggggcta 
ctccctgaaa 
ttccctaaca 
taaaataaag 
tgccaggaga 
caaacgttgc 
aatataatac 
gtgaaacaga 
ctaattcttt 
atgtgaggat 
ctacatatac 
ttcctctagc 
ctttatcctt 
ccacgagtat 
tgaagaaagt 
tatttagtcg 
actcgaaaga 
tatttctgct 
aacagcttga 
gctgattatt 
aaggtgtttg 
cagttccttc 
gaatctaata 
ataatctgac 
tgcatcaaaa 
caaatcttcc 
atcattatca 
aggagatggg 



ttggaaaccc 
gattctctac 
catctaatag 
tagagttttt 
ttcttatgat 
ccgttggtat 
atgaaaggta 
gtatattttt 
aaatcgcata 
tccaaacaca 
aaatttgtta 
gtccactgtt 
agattcggct 
tatgtctcct 
ctttccatag 
aacaccgtta 
cttgtccaat 
caatgtcaaa 
tgtacaaagg 
tagaaatcac 
agaatagttg 
agttttgttg 
acagttgatg 
gttgttatga 
tctaatgtgg 
agatcttccg 
tctccctcag 
ctttctatgt 
tctacttctt 
tcttcttcgt 
tcaaagtcta 
gttcgtgtcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1580 

1740 

1800 

1860 

1920 
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ggggaggtgg 

gtggaggtaa 
cagtatttac 
cagcaccttc 
ctccttcggt 
ttgtagtagt 
acacaaaaag 
caattatggc 
ctgacctctg 
tgttgatgag 
gccgtgcatt 
cgtagtattt 
aaaagagaag 
ttttaaaacc 
gtaatcattg 
tcgtcttcgt 
gaaggtatta 
ggcgattgta 
gttgtttaga 
tgacctcatg 
aaggggtaaa 
cgtctagcaa 
taaagaaatc 
cagtggaaaa 
ctcagaatga 
tattttctaa 
cctttgcagc 
ggagtgtgct 
ctcttttaaa 
aacttctaag 
tgaataaact 
gaggaaaaac 
tggttgaaca 
aaagcgggga 
gattcgtgct 
cttctgttta 
acaaattatc 
tcgcagtacc 
atcttcccca 
attggtacaa 
ttgattctca 
agaaaagaaa 
aagcgcataa 
aagaagaagc 
gccgtaaaat 
tattggtttc 
acaactttgg 
tattggatga 
acgtgtctct 
ataataacgc 
aaaatgtact 
atgttgttgt 
gtggtgataa 
atttagttga 
aaaaagcata 
ccggggacga 
cccctgcaag 
gtgacatttt 
gccaaggcat 
tctctgaagg 
aggacgccaa 



aggagtagga 
caaaccttct 
actcggttgt 
ctcgtctctt 
gtcatcttct 
tttatttggt 
tatgagagat 
accagaaaaa 
ccgcgaattc 
ctaccatttc 
cttcttcaat 
cctccctata 
gattttctgt 
caaactcttc 
tatttgcctt 
gttttttcct 
gcccctcaac 
gtacgagcgt 
cctagataga 
tccgatatat 
aacaaaacac 
ggggaatttc 
aaacaacttg 
aatattcagg 
agaagaatgg 
atcacttaaa 
actagcttct 
tcaacgtaca 
taatgaaaaa 
cagacgaggt 
caaggaggct 
atcagcagta 
tatttttaag 
agaagaagat 
tcaatttatg 
cttttgtgat 
gtcaatgcat 
ttcaacttat 
gctaagtttc 
acacctgggg 
gacattgagt 
cggaatagcc 
ggcattgaca 
gatggaccag 
tcgtcaagct 
taacttcaag 
ttctagtgga 
agtgcacatt 
gtcggataaa 
tgggaacatt 
agtcggtttg 
cgtatgggat 
ttttatccag 
acgaataaga 
ttcaaggaag 
ttttacagaa 
aagtgccaag 
ggagacgagg 
agtaaaagag 
tttagctgaa 
aacggttgta 



gtaggagtag 
ccaggcacag 
tcaaattcaa 
ccgatcggcc 
ttttccttct 
cgtgacatga 
aataaaagga 
taaaatgatt 
ttcataccct 
atatcgcata 
ggcagaagtc 
caaatctata 
aagctggtcc 
taaaatatct 
gtctatcact 
ttcttcttct 
tacctgagaa 
gatcgcgctc 
gagtgggaga 
gtattggttg 
accatggtgc 
tctaaggaag 
tcaattgcaa 
gaatggaacg 
atggatatca 
cctgataaat 
gccgtggatg 
acaaaggtta 
tggacatcta 
gcactgaaaa 
cctcttcccc 
tctgctggaa 
gtctccttta 
gaggaagaag 
aacggacaca 
tattatgact 
cctggcacat 
aagacagaat 
gactatgggg 
gatgtggtag 
ggtgtgtata 
gatttggccg 
gcagaagaga 
gagcctgcag 
gtttctgtgt 
aatgtgttaa 
gaatttgaag 
cttaggaata 
tctcccccta 
tcttccctgc 
tccggaggat 
ggttgtgatg 
ctccatgaaa 
gatgtgct tc 
aacatctacg 
aaggggattg 
ataacggttg 
gcgctcacca 
accgaaggac 
agtttaaggt 
ttcaaaaatg 



gtggaggagt 
ttcctgttcc 
aaacttcctc 
tattttcagg 
ctttatcttt 
aaatcacacc 
acatcattat 
tgtacaaaca 
tccatattct 
tctcgccaaa 
atgtcagctg 
accttctgcc 
atggtgtaat 
acagat tcca 
gcttgtcttt 
agaagccgtc 
gcttcctgtt 
ttgtagatgt 
cgctattaat 
ggtgtaatac 
taactctttc 
atgcggtgtt 
gacctccctc 
aaagtggggg 
tatccttagt 
tggcagataa 
aaaaattgac 
tgaaaaagga 
ttttattgga 
gcgccgaaag 
accatccgag 
cagtcatcgc 
gaaaatgggg 
aagaagaaga 
acgggcaaca 
a tttggccta 
tcaa tatgga 
tagagtacaa 
agtttttatg 
attctctaga 
agaacactgc 
ttaggagtat 
tggaagaaga 
aagtagactt 
taaataactt 
ccgatgatac 
cattatcttc 
ctgatataca 
gccgtgtccg 
aaacgtatgg 
ttgaagatac 
gaggaaaagt 
aaacagcaga 
agactgcaag 
ctgttttgcg 
ctctcaagga 
aaggagtcaa 
catatagtgc 
tgacagctgc 
ctgatgcgaa 
acacctctcg 



aggagtagga 
tccacctcct 
ctcttctttt 
ttcccttcgc 
ctttgttttt 
aaccaccaca 
tatccttatt 
cgagttaccc 
ttgctgcaaa 
aagatccatc 
tcattttttt 
taaagacagg 
aaagtgttca 
ggtactctgc 
tatcttccac 
taaagacagc 
ttacatcatt 
cttccaaaat 
ttttttgttc 
aagtctctac 
ctgtacgact 
ggggaaccag 
aatagaatct 
agaaaaaatt 
ggaaagtgta 
aacatgtcta 
aatcttatca 
ccccaaaaaa 
caggttaaaa 
agtagaagta 
cctatttgat 
atcagatatg 
tccctgtgga 
aaagaaacat 
ttatcatagg 
caggaatctc 
ggatttacct 
aaggtttgtg 
ttactgtatc 
aaatagttcc 
taattacaaa 
ggcagaattt 
agaagaggaa 
cctttcagtg 
tgtggagaac 
tgtatcagga 
ccatttattc 
aagaacccta 
tggaagcaat 
cggtatagaa 
cgacatgtat 
gctaagtgtc 
aacttttaag 
taagaccgga 
tgaaaatggc 
taaaacaaat 
gggatttttc 
agaaaccttc 
aacagtggca 
tctaggtcta 
ttctttattg 



gtaggagggg 
gtattggacc 
ccgtcttcta 
cctaatacag 
ttattggatg 
cccacaaaac 
tttttacatt 
ctccacacgc 
ggtcacgtca 
cgcaacagcg 
aataagatca 
cctctctgct 
tttatctaac 
cgccattgtt 
atctcccacc 
ttctctaatt 
cacaaattcg 
gtccattttt 
atgtggctca 
aaagcaatag 
aggcgcgtag 
ttccccattt 
ttttctgcat 
ttcgacatat 
tatgaacctg 
accgcggctg 
ggtagtgatg 
atagcagaat 
acagccaaga 
cttcatcggt 
aattttagtg 
catttcaaat 
gataaaactg 
tccatatcaa 
cccgaaagtg 
cctaatgagt 
ttccgccctt 
caatcaacaa 
ttcggagcag 
atgatatcat 
aggttgggga 
atccgcactg 
gaagaagcgg 
cctcatttac 
gatctttcta 
acagatacgg 
ctttcaagaa 
ttttcaacgc 
gtcaacttta 
gagttgcctg 
tccggagagg 
accttcaatt 
gatgatacgg 
aaccttaaca 
attgagcgcc 
caaccccctc 
agcggttttc 
agagatttag 
gaaacatcct 
gaattttcag 
gaagaaacta 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
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gggcattaag 
ttagtgccga 
cccttgaaca 
tgaagaaact 
aaattaacga 
atgcagttac 
tggtggccgg 
atctcaacgt 
tttcttgtgc 
aaatagacca 
tccctgaaga 
aagatgctgc 
tgccttgggt 
ggatgtccac 
aggctatttt 
ctgtaataaa 
cacgcctcaa 
tcatactgat 
caaagaattc 
caaaaatagt 
aaactgggcc 
tccctgccac 
tcaacgactt 
acacattaaa 
taaaaattgc 
tataagtttc 
aaacagattg 
atggctccga 
gtccaaattc 
aaaagtgggc 
tgtgcacatc 
ccgtttttga 
aagtggagta 
ttattgctag 
ttacaatttt 
agtacgtgat 
aatattttgt 
atcatcacga 
ggaacctaaa 
agcagctggg 
aaccccgatc 
tagaactggg 
aactggctcc 
atataacagt 
gtccctttcg 
ttcggctgct 
ggaattggga 
gtagtccagt 
catgtttgag 
agaggaggta 
atagaagtta 
ggagacatct 
tgtttcttgt 
cgattacggt 
ttgataaatc 
gaagatcgta 
tcctctccaa 
tcttcttcct 
gaagaagatg 
tcctcgacat 
tgccttgtgt 



agcaaacaat 
tttagatgct 
aaacttgaaa 
aaagaaaata 
cactgaactg 
agtgttgatg 
gtttttaggt 
cgtggaccac 
agatagaaac 
tgttatcgca 
cggaggaccc 
tcaaggagaa 
ggcaagcatg 
tttaaaggca 
tgaggttgca 
taaccccttg 
tccatccaac 
agttcgcttc 
agctaaaagg 
attggcagaa 
catgagatta 
aacaaattat 
ttatacaaaa 
tgtatatttt 
attgttttat 
tgatgacgtt 
cattccgtgt 
catatcttgt 
tcctctccca 
ccagccagcc 
aacatatgcc 
caagtaagtg 
tttttcttgt 
aataaagaaa 
gttccacatg 
agaattgcag 
ctagattgaa 
ggcaaaagat 
agtttccaca 
aaagttgtcc 
ttgttgcttt 
gtattcttag 
ttttcagtag 
tggcgaataa 
attagttggc 
ggtgctgtcg 
ggaggagctg 
tctaatacat 
gagggatggt 
ggagcaggcc 
tttttatccc 
ttttaataaa 
ctgttggttt 
gcctcttctt 
tatcaacgta 
gaaagtttgc 
gaagagaatc 
catcgcttga 
tagaagtaga 
catcttcttc 
ttgttacagt 



acttcttttt 
gaatttgctg 
gatctcgaca 
gacagttatt 
tcaaaggcta 
aacaactttg 
ccagccgctg 
accagcccta 
accggatggg 
ctagatgcat 
aaatcgaaat 
tgtggatctt 
aaagatttgc 
gtttcttccg 
gaggacgccg 
ttcatatttg 
tggaaaactg 
tttgcagggt 
aaacagactg 
aaggacaacg 
gaggagcttc 
tccaaatctg 
ataataaaca 
atatattttt 
tcctatacac 
tttccatcac 
ccatcaacat 
actgtcgatt 
taacggatgc 
tcgtcagtac 
ataaaatcgg 
actggcaact 
ttagaagcat 
acaccaacaa 
gtcataattt 
tgataggcac 
gcattgcagc 
tgtcctttga 
ttatagcaat 
atattaattg 
ggcagatgtt 
ccagtttgac 
agttggtttt 
tggagtcaac 
acacgtcttt 
cagacgcagt 
caacagataa 
caaaatcctg 
tgttgatact 
ctttttatag 
tgatattata 
tttagggagt 
tttgagtctt 
ccttttattc 
cttttttatt 
atcagcagat 
gacgttagtg 
agggatggcc 
agccatttct 
ttcaatctta 
gctagaccat 



cgtcgtttgc 
cagagatgag 
aattcgaaga 
tgacagagaa 
cagattcagt 
gaaaggttac 
tcgccctggt 
aaggtgtcat 
ctaagccaac 
cattcttaac 
ataaggccta 
gggcaacatt 
ctaaaggaca 
ttcttttgtc 
tggtggggtt 
gagtgcctct 
gcctcattgt 
cgggcccgct 
aacaattcga 
ccaatagtaa 
ctgggcatga 
ccaagattct 
cagacatcat 
attttagtaa 
caacaataga 
ccattctgat 
ccgtgttctt 
ggatagtttc 
gacgataagc 
acatggggtg 
aatgggtcaa 
gtttggcagt 
gtcgtatata 
ccataggcca 
ctcagaaaag 
aggcggatat 
aatctcctgt 
tccttttgat 
gtgataaaag 
cccactttgt 
cgccatccaa 
tgcctcccct 
tatgtttttc 
ttcaatagta 
atctagcctc 
tgctattagt 
catggcagta 
ttcttcccct 
acccgagctc 
ccctaaaggg 
caagcagaga 
cttctccaag 
ttgttgttta 
ctgtcagcaa 
agcctgtttc 
gtggattctg 
tctgattcag 
gaagacgtgg 
ttgtttgttg 
ttcagtgcac 
aaagccctcc 



aagggacatg 
agaaacatac 
gactatacca 
tccagaaagg 
attggggaag 
aattgtagta 
gcatgcgtcc 
cagttataaa 
caagcacccg 
tgaaaatgga 
cgcaccaatc 
cgacgacccg 
atccctctcc 
cataggaaag 
ggcgagcaag 
tggatttggt 
attttcaata 
aaccctgaat 
agacggggga 
acttcaatcg 
agatttgcgc 
gggctacaaa 
aaaaatggat 
caataaagat 
tatagactta 
tttattccta 
cttgaagatg 
agctagggaa 
ttcagcagaa 
cttcagagtt 
gtgcatgtcc 
aaattgttca 
gagagcgaca 
attcattcgt 
ttcctcaaca 
tccgacggtc 
ttagcatgga 
aaagcttcga 
ctgtccaacg 
aacgcagagg 
tctagttttg 
ttcggttctt 
tctgatgcag 
tcactcttca 
atcctgttag 
ccatcagatg 
atatcatgga 
tcctccattt 
atgccgtaca 
ccattatcga 
aaaatttcag 
ttttcttctt 
gcttatgtat 
gtaaaatatc 
tggtcgtgga 
tatctgaa tc 
agtcatattc 
gagaagaaga 
ttgttacctg 
ggttcaattg 
tggaagcagc 



ggcgtccaag 
cccgatgcag 
gaaagtcaag 
gctggcaaag 
aaactaggca 
ggggcttctg 
agaggggcac 
attgtggact 
ttcagggaag 
gcatatgtat 
tgtggaacaa 
cattctgtat 
tgcgataaag 
gatgttgcag 
gcaatttcag 
atagctgcta 
ctactagtgg 
tggtttggtg 
ggaaatcgtt 
aggaggaatg 
ccagttttct 
tctaaaccct 
aggtaaaata 
tatacatgta 
tcactaagat 
ttcttctcca 
tgggttatat 
tgtttttctg 
gccagcaggc 
tttgttctat 
tttaatttac 
ttgatatgag 
gctaaggcag 
ttagtagtta 
ggggaattat 
ctcttccgag 
caacctgtat 
tgtttataaa 
aagcatatac 
ctaaattttc 
ttatcgtatc 
tggggaagta 
attctccaaa 
ctaattctat 
agtttcctgt 
cgtccagtat 
tgattttgga 
tgttggtgtc 
ccgaatgacg 
cagtactatt 
tctacttgca 
attggttttc 
ttggggttgc 
cttttcgtac 
attccgtgaa 
ttcttcctcc 
ttcttcttct 
ggaggatgag 
cccatgttcc 
ctgctgataa 
cgatgctcca 



5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
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gggtccattt 
tttactttca 
agtcgtacag 
tttgtatcaa 
cattaacaaa 
tgtccacatc 
tttctccttc 
cttcgtcgtc 
cgtcatttat 
ctggcttctt 
aaaggccatc 
acataatctt 
tgaaaccacg 
tgtgggaatg 
ccaaactctg 
caggtacatc 
ttctcaccgt 
ttgatagagc 
cattgtcggg 
tagagaaaac 
aaggtgcatc 
aagtgtctat 
cacaatcaga 
cggatttaac 
gcccaagatc 
gcatcgcgag 
atgcaacccc 
cctttgtcat 
cttgatcgat 
atctatcggc 
atcttctctc 
cgtttctgtg 
aaacgcaatg 
cggtattgcc 
taatacattt 
tcgttatacg 
attgtgtgat 
ctgaaggcac 
agattgtgaa 
ttgtattagc 
cactttcatc 
ccccagtttg 
tcactgcatc 
cagaagtact 
caaatacttc 
cagttttacc 
ccatatcaat 
aaagcgcatc 
ttgttttccc 
cgatattttc 
acggggaatt 
cttggtgttc 
aatcga tgag 
tttgaagagg 
tcctactaac 
tcacgtactc 
cttgtatgct 
cagcattgct 
tctctaagaa 
taactcggcc 
tcattgcact 



ctgatcgcgt 
gccagccgtg 
gcaaaatttg 
agaaccttcc 
aacactattt 
gtgcccgatt 
ttcttccccc 
gtcttcttct 
tatatgtctc 
cctgaccttt 
atacctatca 
agactctttt 
ggacaatttc 
tgctagtgca 
tataacatta 
taaacattca 
cactagttga 
taacgttgta 
aacatcttca 
atccatcttt 
actttctatt 
cagggcagat 
tttaaagtgg 
aa tggacaat 
gtctgatgat 
gacaattact 
tcctctattt 
actggctgtg 
aaattgttcc 
atatgttcct 
tgcgggcgag 
agtcctaccg 
gtaacgttta 
tttaggtccc 
gtttgtgttg 
cgaacacaat 
tgaatctata 
aagcataacg 
agctgaagcc 
aacaggcgat 
cttggcgatc 
ctctagtgac 
ggtgacagaa 
tgctgtatta 
accccatccg 
tttggaaggt 
gctgcttatg 
ctttttctgt 
atcatgtgcg 
cattacttca 
aatctctgct 
tcgtataatc 
tacttgtcta 
ggaggcttta 
caattgtcct 
gctgtgctca 
cgtgtaggct 
ctggaatggc 
aatacggaac 
cttattattg 
cttgaaccta 



acatctaacc 
ttagtcatct 
ggtaaacctt 
gcacctatta 
tctttagtga 
gtgttatttt 
cactcccttt 
tcttcctccc 
ctaaaattct 
aagaaacttg 
tctctggcca 
ttaggtgaga 
ttagccaaac 
gcattatgta 
gttgccgtca 
ctgataggaa 
tgtgcatctt 
gcattttctt 
tcaaagtcca 
ctattcctag 
ttatttgtct 
gctgcatcac 
actaaatgca 
atttgattat 
gtgattttat 
tcagtgaaga 
gtcctcatca 
acaatctgta 
ctcataactg 
gcctttgaat 
tctgtagaga 
atagtctgta 
gcgtacatgg 
agcattataa 
gcagttttag 
ttcctgtttg 
gggttgcctg 
tgagctgggg 
atcatgactg 
gaatcgaatg 
gtttgtaaga 
atgaccatct 
gttgccttga 
atgaagtatt 
tcttcttcat 
ttttcctttt 
tcaacttccg 
gagtaggttt 
acatccctat 
tataaatcat 
actaattttt 
tttgtgaagg 
gcgtttaaac 
caattagtag 
ctatttctta 
gccatagcat 
ggcgcacttt 
gtagcgctgg 
gcatcagctt 
gatgatttca 
ttaataaggt 



ttagcaataa 
acggaatggt 
caaactcgcc 
aatttccaac 
gtgttagttt 
tatcattgat 
ctgcttgttc 
cgccaaacct 
tgtccaataa 
catccatcat 
tgatcgcttt 
aaagttggtt 
ccagaagatg 
tagtaaaaag 
ctacaggtat 
tcactccctc 
ttatggctct 
cttgcaaagg 
tgccgttatt 
gaccagaagg 
tggaagtaac 
gcactcgctt 
gacacaaatt 
gaggaagtac 
tttgggcacc 
taccattttc 
tcatctttga 
cgatattccc 
cggcagcaat 
ctttgatcct 
atatgaggaa 
ggaaagcaat 
agttattaga 
cgtcgacttc 
tgttgttttt 
ttatttcagc 
ctagcgccat 
aagttcttct 
cacggcacaa 
tccccacgtc 
tgcgtgtagt 
tttttgattc 
tattcaatag 
gtctcctcaa 
tagcaccgtc 
tcttgatagc 
gagacgttat 
ttatagcatc 
cgcgcacaac 
catcatttgc 
caacatcttc 
cccttctgac 
agtacgaagc 
tagagcagtc 
aaagtttaat 
ctggagtagc 
tccttagtat 
ccataaccgt 
gagataatag 
tgcgcttcac 
ctgacactga 



ccagaaatac 
gtctttgtct 
tgtgcagtta 
cataaaagag 
ctttctattg 
gacagaaatg 
ctcctcctcg 
ctccctcgtt 
ttttgtaatg 
gctgttcatt 
gacgctcaag 
ttggaactgt 
agaacctccg 
caaaccagga 
gagatttgtc 
tttctctcct 
atcataatca 
taaacaccct 
gttattatat 
gagcattaga 
ttcgttattt 
ctcgcgtggg 
agaaaaaacg 
agacgtgaaa 
tataactact 
taacgtcaaa 
aaaatgttcc 
catagtcttt 
atctatggaa 
cttatccaaa 
ctgagggact 
ggcgttataa 
agagtcatgt 
tttcgtgttg 
aaccagaaac 
gttactatcc 
gagcggttcc 
attggtttta 
aagtctatag 
tacgatgccg 
caaattcttg 
gtttgttgtc 
tgcaccttta 
cttttcaaaa 
gtcataatct 
atcgtataaa 
tattcctcct 
ttgtactata 
caccacctta 
agcacgtttc 
ctcttccaat 
atgtcctcct 
caattcgtct 
gactccagcc 
aacctgttcc 
actggacact 
gtggtcaatc 
gaacgtatca 
tttcaagtag 
tttaaagttt 

agggtttgcc 



gtgctttttt 
ttagatgagg 
tcaaagcata 
tctatgtgtt 
cacagtatta 
tcgtaagcac 
tcttcaaatt 
tcttcctctt 
aaaacacact 
ctcatatcac 
aatatatcgt 
ctaaaattaa 
tacccttctc 
ttttctttag 
atatctaatt 
tgaccaacaa 
tgggggaagt 
tcgtcttcat 
gccattatat 
gcagttaatg 
ttcgtattca 
agaagattgt 
aa ttgagatg 
gaaagagcat 
agcgctatgt 
ccttctacga 
atcaagtgta 
aggactaacc 
ttattactgt 
ctatcaaaaa 
gaaagttgac 
ggtacatcta 
aaagagagac 
t tgaatgcac 
aatcccctat 
tcccctattg 
gtgtcaggga 
ttctt taagg 
cctgagaata 
tattttgcgt 
agaaaagaat 
cttaaacgct 
catgctgcca 
cactcgtcgt 
tcgt taaaat 
tcttgagcaa 
cctccattag 
cttcttagtg 
aaacgcctgt 
ctctttctgg 
attcctcctt 
actttttcag 
a tagcgtatt 
atgctaatac 
agaaacactg 
tctttgaagg 
atgtgcaaat 
tactttaaga 
ttggacattg 
ctgtcgtcac 
cttttttgta 



9300 

9360 

9420 

9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 
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tgttgcgcaa 
agtcactaat 
caatgggcat 
attccagatt 
agaaatctcc 
tgaaaaatct 
tcatggaccc 
gtagttttct 
gcatgccctg 
ggtgcctttt 
aaattgatgg 
tcatgacgaa 
atcctccact 
ctataatgga 
gcactgctag 
catcggttac 
atgaaggttc 
aaagtttgtg 
gttgccttaa 
cggatcaaaa 
ttcccatgct 
tctataagcc 
cttcttatgc 
tccttcgggg 
taccgggcca 
gtcaatatcc 
gggagttaca 
gtggtctact 
ggggaaagcg 
ttttcctttt 
tccgtatgaa 
gatatgtctt 
tatttcactt 
tctcctctca 
acctgtagaa 
gggtttattt 
cgaaaccatg 
ttgaggtatg 
gaatggccta 
taacaatgaa 
gctaggtatt 
cgtgatgatg 
tttaggtctt 
catggcagga 
tgagtcttgg 
aaacgaaggg 
tctgaatttc 
cttattatta 
aattttgaat 
gaattctgta 
agatgaagga 
tgttcctccc 
atctgtcaga 
ggttggggat 
atcttctcca 
aactgctctc 
gttggaacag 
tgatgtagat 
gatatcatgt 
cttcaaattg 
agatttttcg 



aactctagta 
attttgagtc 
aaccttgttg 
tgtgcgtaaa 
tattgatctt 
aatatccacc 
acttttcccg 
ccatgaaaat 
ttgacagctg 
ccacacgcca 
ccctacgtgt 
ggctgccaat 
cgttttcttg 
catgttcttc 
tgcctgcaca 
atttttgcag 
tgggtgggag 
agccatggta 
gaagactctt 
atttggtata 
aaagagaacc 
tcagaagtat 
ccccactcga 
gttgaggaaa 
ctcttggggt 
actctgtact 
ttggagcaat 
gaaaaaatat 
tactgcaaag 
aggtaatcta 
cgcattaata 
agaaattctt 
gtcctcatcc 
tcgttaaaat 
ttgtactctg 
tgactcaaaa 
cgactatttc 
tttccggcca 
tgctgccccg 
gacgggcgga 
ggtacaaaga 
ggcgcttggg 
tgctggtgaa 
acagtcccca 
ggattattaa 
tctatgt tgg 
atatcctttg 
cagaaacagg 
tccgccaagg 
tgattccgta 
ggtggtggtc 
gaacctccga 
acgactatag 
gtttctatgt 
gatttatgtg 
gaggtataac 
tacccgccag 
ggaggaggag 
acaccaaata 
accatccttt 
atggtggcct 



tcttcttcct 
ttgtggaact 
gaatcgaggt 
tcagagtacg 
gaagaacgct 
gagtctttag 
tcgctcttga 
ttaggatcac 
ttgaaaagag 
ctgttgatgg 
ttttgtacat 
tctcttgttt 
tccactttaa 
ttccaccctc 
gaattaatgt 
aagcccaccg 
^gggtgccaa 
attcaaccag 
gtgtactttt 
ctggttagct 

^ggsgttacg 

ccacaccgtc 
cgagtgactt 
tgtttgacgc 
tatgatccaa 
cggatgagga 
caaaatagct 
cgtcggtagc 
gattggacac 
aatcaaacat 
acccctcttt 
tcctcttcat 
ctctcgcgtg 
tacgttcaaa 
gagctggagt 
gtgtagagaa 
gcaaacattc 
aaacattagc 
aggaaggttt 
tgattacatg 
gcacattctt 
ccagtgcaat 
tctggtcccc 
ctaacggtgt 
cgcccggtac 
gcatcataac 
ttacatcttg 
tgatggggtt 
aagaagattg 
tttgttgttt 
ctttgtgggt 
aatatgttgg 
atgtgtgcac 
ggtcaccggg 
tgataaatgt 
acacagatgt 
tattgtaggc 
gaggagcgtc 
aagaatcgta 
gtctaaactt 
cactcaatct 



cccaagtctt 
cgtcacataa 
ctgttcctcc 
tcataaataa 
tactatgaaa 
cagacgatac 
aagacgttgg 
gaagacctct 
gcttagagca 
ccgtaattac 
ctaagagtgc 
ttcctacacc 
aattcctttc 
tccctttgta 
ccatcggccg 
accggaatgt 
tctctttagc 
aaaatcaacc 
tttacgatac 
gctatttttg 
tacagccata 
cctgaaccac 
cttgaataga 
atcaaacgct 
acgaaacgat 
agatgatgtt 
ggggattgtt 
tccttcttcg 
tttcaatgta 
atctcgcgct 
caatacaaaa 
taaactatcc 
tttctttatt 
aaagtccctt 
acctccaggg 
taaaagttca 
tgtggccaac 
tgctggctca 
gttcaggacg 
gggaatattt 
agaagtattt 
tttcttggca 
tcttttagat 
cctgtcatcg 
acacttcatt 
acaggggttt 
agaaagaagg 
tttgtataaa 
acgcaatttg 
ctttaataca 
tgaaggattg 
gtatttaggt 
agaacactct 
cacgtttcct 
tcctccagca 
gcaagtttgt 
caaaatatcc 
accacttacg 
caattcagta 
tttccccgat 
atctatatgg 



tgccagttcc 
taatgctgtg 
tgttataaat 
aatggtgggc 
ttccttaaca 
agaattttcc 
tttattagca 
agaatttgta 
cgtcaaccat 
cttgtccgaa 
cttttcttgt 
agtaccatct 
gaccttatct 
gaattgaatc 
aggaagaaat 
ttcactctca 
atcttcttcc 
aatatatgtt 
ttgaaacgag 
gcaaaataat 
ctagtctcta 
gaagtttctt 
gagaggggtg 
agacgagggt 
tgcaaacccc 
ccgaacaatt 
tcgctgtcaa 
ggaagaacag 
ctggaagttt 
agcacagtag 
gtatcctcag 
caagaacttg 
acgtattctg 
atttcttcca 
ggagttctga 
gatccgccgt 
atactatgtt 
atgtaaaagg 
ggagctaaac 
tctagaggag 
gttgcatcca 
gtagtttcgt 
tggtcggcca 
cctattattg 
gtgctcacat 
tccatacatt 
tcgttattta 
aattcaggat 
gagatgtaag 
tcataataga 
aaagcgcaag 
ctgcattgcc 
ttaactacac 
gaccaacaat 
acaaatggtg 
ccctctttca 
acatcttcat 
cctgatgcac 
tcgtcttttt 
acagcatcta 
tgttcaggca 



tttctaaaag 
acaaagttgt 
ttcagtacaa 
gtttctgtta 
tccctgagag 
acatctactg 
caaggcactt 
accacttctc 
ataaaaaagg 
tcttgaccaa 
agtatgacac 
ccgatgataa 
ttcacaaggt 
agggctagac 
ttggggtagg 
taaagagcga 
tccagaaata 
gtaacaagaa 
tacgcctgca 
agtttttatt 
caagtaatcg 
gcacgtccgc 
ttattgtagc 
ctgactgtac 
acgcatcgtc 
tttcacttcc 
atatcctatt 
agtaggatgt 
ttgatgcagg 
cagcataagg 
atttttttat 
atttgagata 
tctgagacaa 
gcctttggtc 
tataagagga 
gaacactacc 
catagggttt 
gtgcagtaca 
aacttctatg 
gtaaaacagt 
ggtttaaagg 
cgatacttga 
agaaaattcc 
catgcctgtt 
caaacgtaac 
catacatacc 
gtacatcctt 
cgttgcattt 
aagagttgcg 
ttggtgtcga 
cagtcatcga 
agccccca tc 
gttctccagt 
aaggtttagg 
tcatcacacg 
aaggaactgg 
catgtccttc 
tatggtgatg 
ctatattatc 
gatatttgtt 
cgagaagaga 



12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
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tgaggatcta 
tgacggaacg 
gtgaggtggg 
ttcttctagt 
taattcgtgt 
agcgcagact 
cgcggtgaag 
catcgttcct 
tttagggtct 
gggcgcgcct 
tcttgttaac 
tgactctttt 
tccctttaac 
aattaacaga 
tcgctattct 
aaattggttg 
agacgccatc 
gaatgcaatg 
atcagcgggc 
ggtaagaaga 
tttctgtaaa 
ctggacgatc 
tgaaactaat 
aaaaaaccaa 
tgccatcaaa 
taaaacatgt 
gaatttgtcc 
agcaggcata 
tttaataggg 
tgctcttttt 
agaagtgatg 
ctacaagaac 
tacaatttat 
acctctccat 
ggatatgggt 
acatagaaga 
actaagaggt 
agaagaggaa 
agaagatgat 
gggtgaagat 
ctccactcaa 
gggaaaaagg 
cgatgaccgc 
atatgttaaa 
tacaaatcta 
cgcaaatagg 
ctatgaaaga 
tgagcgggca 
agactctaac 
ttcattatca 
attcacaagt 
tcttggaaga 
tctagtggac 
agaaaagggt 
attgaacgtg 
agaacctgga 
catggaaaag 
ttgcaagatg 
tattgaccct 
aaaatacaag 
taaatgggtg 



tttgcagcta 
ttccatttat 
taattgcttg 
gtcttttcag 
atagcttgaa 
agcgcaaaaa 
cgcatacttc 
gatattacaa 
tataagacgc 
accaacgctg 
cctggagctc 
gtgtacaact 
gagccagctc 
tctcgttttg 
ggaggaagta 
gtcatactta 
gaacgcattc 
aggagacttc 
atgattcgct 
aggcaactaa 
gaatcaactc 
gttgaagctg 
gccatccctg 
cgcgacacaa 
gaacatgaaa 
ctaacagagg 
ccggcccatc 
tgggttatta 
tacgaagctc 
atatctcaga 
aaagaaagag 
aatactaacg 
ggtcaaatgt 
aaatcgagga 
ttggcaaaga 
aaaatgcaag 
gaagaagaag 
gaagaagctg 
ggtgaggagg 
atgaatgggg 
caacctcccc 
ttaaaggaaa 
caagtgaatg 
ggtggaaaat 
ctagattcag 
cgatcagaaa 
aacgtatcaa 
agagaaaggg 
aaccctgaag 
ggggtagaag 
cttgtcacca 
tttatcgtat 
ggagatatgc 
acagtgaaaa 
gccactgctc 
atcaagcaaa 
ggtcgaggag 
ctgaagtgtt 
gcgtcacgtg 
aacattgact 
ggaactggag 



aatcatgact 
tccctttaac 
catcttccac 
ggttagctcc 
aattaacaga 
tacagcctaa 
ggttggtagg 
tggatatctt 
agactgtgaa 
attttacacg 
cttccgatag 
acaaggatgc 
ttcatgagca 
tgtgtaaccc 
ctccccagaa 
gaaatgctgc 
ctgatctaag 
gtgtatggcg 
acgaaccctt 
aaggaagtta 
ttctggctga 
tcatccggta 
aagactct tc 
atcaaggaga 
gcataggaga 
agcaacgaga 
taacaaatct 
tgcagagtct 
aacgactttt 
ggggaattgg 
ctgcaaatgc 
agtctaataa 
ctgattgtaa 
tgaacatgca 
tacgtgtagc 
aaaaggcagc 
aagaagaaga 
gaacatcagg 
aggaggagga 
aaaataactc 
aaaagcgtca 
gagataata t 
tggagagcat 
gtggaga taa 
tgttccattt 
ctggcgtata 
aagacagtag 
atgcttcctg 
atgtggaaca 
attctctcgc 
atccaattt t 
taaataacat 
ctatgcttct 
acacgagaaa 
aatctgcagg 
tgttactccc 
ggagatcggc 
tattcttcaa 
ccaccttatt 
gggtaaagga 
aatacactaa 



ttcgaaccga 
agtcaaattc 
cccg tcatgc 
ttgaactgct 
cattggtttg 
cacgaacacg 
gtaaagtaac 
cctagataaa 
aaataattct 
cacggtatca 
agaaaagtta 
agtggtgacc 
catcatgaac 
tgttaaatcg 
cttaattttc 
tcgttttggt 
agaaggtggt 
agcctttaac 
ttcggttggc 
tgagcgaaat 
tatttcaaga 
caagaataga 
tataaacttg 
tgattctacg 
aaaaaggaaa 
aatgattttt 
ggccgatgcg 
tttaaagcaa 
aatgttcaag 
tgacgtgttt 
aaaaataaga 
tttgggcaat 
tgcaattcct 
agacattgaa 
ggaagaaaat 
cagggagcga 
ggaggaggag 
tgttaatggt 
agaggaagaa 
aagaaaacgt 
gcgaggtaaa 
tggaggattt 
acaaaaactt 
tggactacca 
cagaaagggg 
caccacaaag 
taataataca 
tgccgaatct 
acgtatgcgg 
tatcgaaaga 
caatgccgtg 
tgtaaaattt 
agactcaaga 
atttttcaaa 
acaccaatac 
tgat tgtata 
ccttcacaga 
tattgaccct 
taggcttgat 
tttactagac 
cattggacgt 



gaattccttt 
atcatgtcca 
ctacctctgg 
ttcaatttac 
gctacattat 
gtcacgaata 
ggatctcttc 
tctcatacag 
acgttaaaaa 
ggagtagctt 
gttttggcct 
gctgaggctc 
agacttgaaa 
gctggagaag 
ccgattggag 
acagtggcag 
acaagtaaac 
tggatagcgg 
tgtgctctat 
gtactatttc 

catgctcgaa 
gagtgggagg 
ttagaaaaga 
aaacatatcc 
aagggagttg 
atactggcta 
atcaatttct 
ttgtatatgc 
ctgaatggtg 
gacatggttt 
tattcccaat 
ctatctatta 
aagaccatac 
agagtagcta 
gcagctgcga 
gaagagggag 
tctggttatg 
gaggatgaag 
aaaaacactg 
aatgcgccca 
ttattagcca 
ttgacggcca 
gaattgttaa 
tctattctta 
gcaaactgta 
ccccattcat 
aacaaacgcc 
gaattgatca 
gtacttcaaa 
ttaggtgctg 
atgaatatga 
ggcaagacaa 
ccaaatatga 
atgaacgctg 
atgaaactga 
cagaaatgtg 
tcaaatgctg 
gatctttgta 
cctgtaatga 
gactcaaacg 



ccttcgtttc 
ggagagcggg 
cttctaggta 
ttttcatccc 
aggccattgc 
acgttatttg 
gtcgttcatc 
gtgccaccct 
agcttaaaat 
cttctctcta 
cttcttattc 
ccaagtggtg 
aagctggtct 
tatgcggatt 
catcagagat 
catcggccaa 
atgtggcaaa 
aagcctccag 
acgatcacga 
ttggtgacca 
gttctgattt 
caatcagtaa 
atgttctagt 
ccttagaagc 
tagagtttat 
gaggaaaagg 
ataatgccaa 
ctatactcca 
ctgcattact 
tgtttaacct 
ctcgagacgc 
tcgatatcgc 
atattggagt 
aacacatagt 
gacgtctttt 
gacagagaag 
aagaaatgga 
atcaagaaga 
aagatagtga 
gtaatacttc 
tttcaaccaa 
caattcaaaa 
ggcaaagaaa 
ttgaaaaggt 
acagcataca 
tttgtgatta 
ctgaatgtat 
cttgtcctgt 
tggaccctcc 
atgagatatt 
aaaaaggaga 
ctattgcttg 
aaaaccttct 
ctgcagcaga 
gtcattgtcc 
aatccatcgc 
accatgcctt 
cagatacatt 
gggataggaa 
agggaacaaa 
tggccgcccc 



16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
20220 
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tgttgatttt 
ttcacggaaa 
cgagagcagg 
gttggaagtc 
gggaggagaa 
agacgttcaa 
agaacaacat 
tgcttcatta 
ttctgaggag 
attattggat 
cagtgtcacc 
ttactgacac 
atccaagcgc 
gcaacccact 
cactaaaatt 
tgctggctct 
ccaataaaag 
tactgaataa 
actacaaggc 
tgacagtact 
atgtttctgc 
aaaagaaagt 
gtaatttact 
aacttgaacg 
tgacaagtga 
ttgacgagga 
ctagaaagaa 
aagaagatga 
atgcaatcat 
acattaaaaa 
aaaaatacgt 
ccactgaatt 
gtcagtccac 
atttgatgca 
tcgtgctgtt 
aaaaatccga 
aagagacggc 
gtgctgctac 
ttagagatat 
ggaagacatc 
tgatagattt 
tcttggaaag 
tgcaactctc 
acaacaacac 
gagtacgttg 
cgatgactaa 
gccagagcat 
aattgaggct 
atgcaatata 
gtagacgtaa 
acctgatggt 
gaataaaaat 
cagtcagttc 
aagacgacga 
aactacttct 
tgtctgtaga 
ttcttgcaaa 
tcgccgcact 
tgggagaaaa 
atggacgatc 
atggctaaga 



tacacaattt 
ccaaatgacg 
gacgcttcat 
aacggagacg 
gaagaagaag 
gtacaggaca 
caacagggtg 
ccagctgata 
ggggaaaaaa 
ttaatagata 
attcaagaat 
gtggtctcat 
tatcccagct 
ctttaaattg 
caaaatgtat 
cgattcgttc 
tgaaaacaaa 
tgttggattc 
tgcaaacttg 
aaaaaataac 
ctctgcacaa 
tggagtttgt 
ccaccatcag 
cgcacaaaat 
ggaggaagaa 
agatttctat 
gaagaaaatg 
agaagaggca 
tttccccaat 
gcggtcacgg 
tggaaattgt 
tacttcccta 
cagaggggct 
ggtaaacagc 
tgataagaaa 
gtattttgat 
aaaaataata 
cccaaaaatt 
tgttggattt 
tatttctgca 
aatgataact 
aggagtggtg 
cacctccata 
tggtagcaat 
tggattgagt 
ttcttcttcc 
agatctttcg 
taatggatta 
cgactctggc 
aaggaaaaga 
gagtcagggc 
tgggggcggg 
cagaagaaga 
ctgcgaagaa 
attccaacaa 
agaatataac 
ggatcttctc 
caccatatcc 
tggacgtttc 
tccagcgcgc 
aggagggaag 



tgaaatacac 
tgtattacag 
gtgaaagcta 
gacgtgtccc 
aggaaataac 
gtaatctatt 
gagaagaaga 
acgattcctc 
gtccagaacc 
gcgatggaag 
gaaaatatta 
ttggtcaaca 
aacgaactga 
gaggaccaca 
atagatgata 
gtttcgcttg 
ttctgcgtaa 
cccgctggat 
agcggtaaaa 
accaataaaa 
caacttgacg 
tatgatataa 
aatataatag 
aaaaaagtca 
gaggaagatg 
gaggatggtt 
gaagaagatg 
gaaacttgtg 
ggacaaaatt 
aggaaggggg 
aagagcctag 
tttatgaagg 
agccgtataa 
ccttcgagtt 
actaaggaag 
gtcgtatctg 
aaacatttgg 
cccctcaatt 
aaaataccag 
atggggctcc 
aatggcgcct 
cccagagatg 
tttagaagta 
tcttctagtt 
ataagcaagc 
ccgtcatcct 
ataacgacaa 
cagaaaaaca 
gcattcaagg 
aggactttat 
aatgacgtca 
gcttctaggg 
gtaaggaaca 
gaaggcgact 
cggcaacaag 
aacagagtaa 
tctactgtat 
cggcactctc 
ttcctataag 
cacctacaag 
gtacgtccga 



aatgattgat 
tactattgaa 
ccgcccgacc 
ttacccttcc 
aggcgttatt 
cgatgtggaa 
ggaacttcct 
ctcacccgcg 
gtacaatatt 
aaacaataac 
ccagaatagc 
tttctggagc 
atagatactg 
taacgaggga 
aaaatggact 
catctttttc 
aaattcccca 
taagtgggcc 
gcggtataga 
gagcaactga 
attctgaaat 
atgtgtccag 
gacaacatct 
aggaagagga 
aatatgaaga 
acgatgaaga 
aagaggatga 
gtgctaatgg 
caaaaaggaa 
agtgctctgc 
gtataaagcc 
ggagtgaagc 
ggtcactact 
ggaaatgggc 
aagttgaggt 
aactccctag 
ctctgattga 
tgattaaaac 
aagaagtgct 
tctctgtaca 
ttgcgacgtc 
ggagtaacac 
ttcgcaacag 
cttcttgtgg 
gtggtataac 
cggccatgat 
tcatccaaga 
tgtctgacaa 
cgctcctaac 
tggcatctgg 
atgatgccca 
tgtataaaag 
aaccccagtt 
tttcttccga 
atagctgcac 
gcggttcttc 
ctccgaacga 
ttttcaacgc 
agcactattg 
gatcgtatgg 
tcgttggcca 



gatggtgtaa 
agagcggatg 
ttgtttgacg 
agtgaaaccg 
gacgacagta 
ttattcgata 
tcagcaatat 
catattccct 
tttgattctg 
cctaaaagag 
gtctaattac 
tcctctaact 
gaccaaaact 
tgaagatact 
atatcagtct 
ccatggagct 
cgacactcgc 
ttttaagaga 
tggcctttca 
tattttacat 
gtctcgtact 
ttcaaggcaa 
gatcgaattt 
aaatggtgag 
aggaggttgc 
agagggcgat 
agaagaagaa 
tgttattgat 
gaaaaatggt 
taacacttta 
agtagggtgt 
tgacagctgt 
caataaatac 
taaccctccc 
taagtttgaa 
taatattaaa 
agactttctt 
tatgacgagc 
cagttttatt 
atttgatcgt 
atgcttgaac 
gtggctccac 
aggagtgaat 
agggaataag 
cctgaaacca 
ctcattgcct 
tttctcagaa 
gagcaaagac 
atgcacagtc 
agagggtgtg 
ccagttccag 
agcccagcgc 
tactatagca 
gttaaatcca 
agaagacgac 
caccacagcc 
aaagaggaac 
tctatctgca 
actaccacaa 
agacggaatt 
ccatggacga 



taagtgttcc 
acttactcac 
ccagagcggt 
tagaagactt 
ctgaaataga 
tccctgaaat 
ctgaagtgtt 
catttgggaa 
ccctcgacca 
tcgactggaa 
gtccgagcct 
gctgagaaga 
aacgtgttat 
ggtacaataa 
gccgttttaa 
gatttggttt 
gctgaatctt 
tggagcatta 
ggttccatgt 
ttggtgaata 
tttaaccacc 
gtgaaccagc 
agaaccaagc 
catgaagaaa 
ctatcagata 
gacaatagaa 
tatgacgatg 
tgtgaagacg 
aaaaaaacaa 
tcctttgtgg 
ccgcccccat 
tataatactt 
tctgttaaag 
gaccgacggt 
attgaatgtg 
gtatggttaa 
ccagctatgg 
attttctctg 
cctatagaat 
ataatagaag 
aacgcattct 
acggaccttg 
attggcggta 
ggcgattatg 
ccacctgcag 
cagcccacgc 
gtttctggga 
gtgtttaatg 
aacgataaaa 
gtacgaagaa 
gaagaatgcg 
cgaggttcgg 
gtctctgacg 
acgcactctc 
gacgttttag 
ggagacagag 
tctgccgccc 
aaaacaaagt 
cattgaagat 
ggtcctcgag 
attggaggta 



20280 
20340 
20400 
20460 
20520 
20580 
20640 
20700 
20760 
20820 
20880 
20940 
21000 
21060 
21120 
21180 
21240 
21300 
21360 
21420 
21480 
21540 
21600 
21660 
21720 
21780 
21840 
21900 
21960 
22020 
22080 
22140 
22200 
22260 
22320 
22380 
22440 
22500 
22560 
22620 
22680 
22740 
22800 
22860 
22920 
22980 
23040 
23100 
23160 
23220 
23280 
23340 
23400 
23460 
23520 
23580 
23640 
23700 
23760 
23820 
23880 
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cctgaagaac 
ccaccaaaac 
gcaccatccc 
acctaccctc 
gattatcctc 
tgtaccacac 
aatctgaaga 
ggttgtacaa 
cattcttttg 
gtatttttgt 
aaagtttttg 
ggtggtcggg 
gtcatttctg 
ctagctcctc 
ataaggccta 
gtttttgaga 
gtcagccg tg 
agtcatttct 
tctagcccct 
cataaggcct 
gtttttgaga 
aacccagaaa 
ttgtaccaaa 
ctcaataacg 
ttttggctta 
tttgtcacac 
atagaa tgtt 
aacactgtt t 
tccacagtcc 
gcaattcgac 
ggggaagaaa 
caggcggttc 
attggggcgg 
ttcaaagact 
ggttactagc 
ctcccaca tc 
tacattattt 
gcggtccacc 
tgacccagaa 
gaatacaggc 
tggtatacat 
tcggaact tg 
aattt t tgaa 
tccagaca tc 
tttctcccac 
tccacca tea 
actccttcca 
aaca tgacga 
ctgcaaactg 
tactacgcta 
ccctctcaat 
ctgaagaaaa 
gcgatggcat 
aaggaggaac 
agacattgat 
atatggagga 
aggtg tc tgt 
taccaaaaaa 
tatgtgtgtg 
aaaatttttc 
ttccgagggt 



cagccacttg 
gcaagtcaat 
ccgtccgttt 
cctacgacga 
caccatacga 
gtcaagtcct 
gggagaagaa 
gaagaaaata 
cttgctgagt 
gtattcttta 
agatggagat 
tccagaaacg 
gaaagggttg 
tctgtaatct 
gtcagcggtc 
tggagatgaa 
ttccagaaac 
ggaaagggtt 
ctctgtaatc 
agtcagcggt 
tggagatgaa 
catatatcaa 
aactcagaaa 
tacgacactg 
gatgcattat 
atccacaaag 
acacccttgt 
ccgaagattt 
ttgatctcct 
ttgatctgga 
tacgccaatt 
tccataatct 
gtaggaactt 
gaggccattt 
aaggagctca 
tttcataccg 
ctgttataca 
cttcgaactt 
aaatttttga 
agttctgcct 
ttctagcccc 
acattcgatc 
aagtttttga 
tctagagtgg 
tgacgcagac 
ctgccgaata 
gaaatttgct 
ccctttacgg 
tgtttcgagg 
cgggaaccca 
ggatcaaggt 
ccccatcgac 
gacccacctt 
aagaacgtgg 
gatgtcccat 
aggatttgtg 
tttgtgatat 
tttattatga 
ttcatgctag 
tgggtcgctc 
ggaccgctgg 



ctacacttgc 
attcagagag 
agaagagctt 
cggatcttct 
cccatcagaa 
cagttctatg 
ggccctgggt 
ggattgattg 
gtacatagct 
ttactgccta 
gaagggaaaa 
tctgtctgtt 
taatttatta 
gacgttgggt 
caccccctaa 
gggtaaaaag 
gtctgtctgt 
gcaatttatt 
tgacgttggg 
ccacccccta 
gggtaaaagg 
aagggatgga 
tctcatcaca 
gacctgggaa 
caacaaacac 
aggccatatt 
cgatccagtc 
ctttgtcccc 
ccaaccagtc 
aaccaatgtt 
gtgtatctgc 
ccttgccaga 
tgccagtcaa 
ttttggttgt 
cagtcttata 
gggtcggcta 
tttctagccc 
gacattcgat 
aaagtttttg 
tgctccagaa 
ttcgtgtaat 
gactcagcgg 
gatgaagatg 
ctagagccca 
tatataagct 
cactcttgtg 
ccagttatca 
cagattgatt 
aagagaaaag 
tacaagtacc 
gaagtaggaa 
aagtgcggag 
tacaacgacg 
gtaaagaaga 
tctataaaac 
cacaaactcc 
aataaagatt 
tgtatttcat 
cccctctttc 
gagtttagag 
gtcgacccaa 



ggctacacct 
ccttgcgctt 
gtcgacgtgc 
acatcgtctt 
aatccacaga 
acggatcacg 
cttggccgtc 
tatcgatgac 
ttatatttgc 
tttttatggt 
agagggcgct 
ccagaaatgt 
tagttggtat 
cgacccagcg 
actggagtga 
agggttgaaa 
tccagaaatg 
aaagttggta 
tcgacccagc 
aactggagtg 
ctagtaatag 
aacaactatt 
tgacaaatct 
ctcccaccta 
ttttaccaca 
ctccatagaa 
catggcgaca 
atccatgtat 
gttcaaaaaa 
gagtttcttc 
ttcaaatgct 
attggggagt 
gtccatgttg 
tgctgctgag 
tatagactaa 
gaacccagaa 
cttcgtgtaa 
cgactcagcg 
agatgaagat 
atgattctaa 
ctgacattgg 
tccaccccct 
aagggggaaa 
gaaacatccc 
aaccactaag 
ttgctccgca 
agactgagaa 
ttagagatag 
ctggatctgc 
gtcgacccaa 
catgcctccc 
tggcgttcct 
aagagtatat 
accgccaaga 
aatattccaa 
acacatttat 
gatgttttta 
tgggatggct 
acccttcatc 
ggtggaccgc 
tgtcagatta 



ttattagacg 
acccagaact 
cagaaggagc 
cacaagccga 
ggtcccaagt 
ccagggccaa 
gcaacaactt 
gaactttgca 
caataaaaca 
ttaacggaaa 
agttcatata 
gtttagagat 
attatttctg 
gtccaccctt 
gctgaaaaaa 
ggatagctga 
tgtttagaga 
tattatttct 
ggtccaccct 
agctgaaaaa 
aaggttgtgt 
ctttttatac 
gaatgtccca 
catgttttac 
ttggcattca 
tcagatccat 
atagtaaagt 
tgggaaaaag 
tccctaataa 
gtattttctt 
ccacacttcc 
aggccttcga 
gcaaagaga t 
agaggagcaa 
cacccccccc 
acggtttgac 
tctgacattg 
gtccatcccc 
gaagggggaa 
agatttctgt 
gccgacccag 
aaactggagt 
aagagggtgc 
aactagtttc 
catattttgc 
tagcctaaca 
gaaggaagaa 
aaagacatta 
acatgatcga 
tagaacacat 
tctgcgaccc 
gtactccaac 
aaagaagtgc 
atacttcaga 
ttttattttt 
taatatggta 
taaatgaaac 
tttcactcac 
tccatctcaa 
tgggtcggcc 
cagagagggt 



cagggcaccc 
tctccccgat 
gagttttttc 
atgtgaagat 
gtgtgattat 
cctcataaaa 
tagctactag 
tgtttttgac 
gaccccgaat 
aaatttttga 
ctgcccaata 
ttctggacaa 
gtatacattt 
cgaacttgac 
tttttgaaaa 
ggccacctgg 
tttctggaca 
ggtatatatt 
tcgaacttga 
tttttgaaaa 
aactcggcat 
agtagtttta 
ttccgtggcc 
atacagaaaa 
ttctcttcag 
caacaaatat 
cttgtccaaa 
tacagtatgg 
agaaagacag 
cattgttgga 
ctttatggat 
agaacaggat 
cagcagggtc 
gtagtagtgt 
cccaaaaatt 
tcacatctgg 
ggtcgaccca 
taaactggag 
aagggttagt 
gcattatttc 
cggtccaccc 
gacccagaaa 
tagggtgcgc 
tgggacattt 
acacat ttca 
atggccccaa 
gaaagggatg 
atctgcctca 
gtatacaaag 
cgaggattgg 
atggaagaga 
tacaatgaag 
aaaacaat tg 
caagctctag 
ttcaaggagg 
caccctaaaa 
gtattattta 
caatctggcc 
aaat ttttag 
taatgtcaag 
ctagaaatat 



23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 
24420 
24480 
24540 
24600 
24660 
24720 
24780 
24840 
24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 
25500 
25560 
25620 
25680 
25740 
25800 
25860 
25920 
25980 
26040 
26100 
26160 
26220 
26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
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gtaccagaaa 
agaaatcttt 
tgttggcggc 
caaaaatttt 
gcctaatgtc 
ggtctagaaa 
aaatggct tg 
cggcccaggc 
cttcatctcc 
ggaccgctgg 
cagattacag 
aacttttcaa 
aacatttagg 
taataaaaaa 
atctattata 
ttttacaaga 
gggaagtacg 
ccaatctgtc 
tggtactact 
cattcttctc 
gttcagccca 
aagaacgagg 
ggagagagaa 
gggcaaactg 
aagtactaga 
cagagacatc 
ccttctagca 
aagagcatca 
attttcatta 
tgtggtaaag 
aggcttcaga 
aatgatacaa 
cggattcttg 
tctaacatag 
ccttttcaca 
caaacagatc 
caacagtatt 
aatattgtcc 
ttgaaaatgc 
ttcagcaaca 
ggaaagaaaa 
attcaatcta 
ggatgatggt 
tatcagtcgt 
agatatgtca 
gatattgacg 
gaaaacaa ta 
caacttct tg 
aaacttgaat 
tctcaaaaat 
gtggatgttt 
agtcattccc 
gataatatct 
aggagaat tt 
gtccagaaat 
tcgtggagct 
gcgtacgccc 
ataggggtgt 
atcgtaccgc 
ccatcatcaa 
gaggagtgaa 



tagtgtacca 
gaacccattt 
tgtcaatagt 
tagaaaattt 
aagttccgag 
tatgtaccag 
tccagaaatc 
agcctcacag 
atctcaaaaa 
ctcggcctaa 
agagggtcta 
ca tgtggcat 
tgagaaagat 
tcatacagtt 
ccctattctt 
cacta t tggt 
ttctctgcat 
ccaacatttt 
gaaatcttct 
atgttcttct 
aggataattt 
tgctttattg 
atctaaagca 
tgttaatgcg 
t t taccatac 
gtgtttgtga 
ttcactatac 
agaaaattgc 
ataaaggaaa 
ggggaagata 
atgttaagtt 
taacacaa ta 
gtaagaaaag 
cctttaactt 
ataagattat 
ttgaagtaat 
gcaccacaca 
a teat ttcgg 
tggattgttt 
gt ttttgtag 
gacttgataa 
gctataga tc 
gatgtttcta 
tttgcctttg 
aaaggt tcaa 
attcttgaaa 
ccacttcttg 
a tcct tctca 
ccagttctgc 
ccatta tcca 
tcgtaaaggc 
tcgcagatgg 
ctaaactct t 
ctggaggtaa 
gacata tatg 
gcttcagcca 
gcttttatac 
gtcccagttt 
ttattggctg 
ta tggaggga 
gaaggacga t 



gttataagaa 
ctggaacaga 
agcctatgta 
ttctgggtcg 
ggtggaccgc 
aaatagtgta 
tttgaaccca 
ttacctgttc 
tttttaaaaa 
tgtcaagttc 
gaaatgtata 
acaccgtatg 
aatacacatg 
tcggactcat 
ctacgtttac 
tcaaccaaca 
acaggacacc 
tcatcaatac 
aaacacactg 
agagcccttt 
gaggatgcaa 
acatcgagac 
gctttcaata 
gtatcattga 
aacatattaa 
ttggcgatgg 
ttctatcgaa 
acgaggttgc 
aggccaagct 
ttgcagcgca 
tttcttgttt 
gagtattccc 
cttcaagatc 
catgagggtc 
cgcacagttt 
tgtaatcaat 
aaccactacc 
cagagaattt 
cagccttctc 
tagtctcaca 
tcacactcct 
tagcgtccat 
ggcaagaaaa 
taacacaagg 
gtggtgcaga 
agagcctgaa 
cagaactagt 
taaccttctc 
tgtcatcacc 
tagagaccag 
ctctgaaggt 
gggaagaatc 
ctttattgtc 
ttatgtcatt 
gttcaatgtt 
tgttgatgat 
aaagatcccg 
tacatccaaa 
ctataaaaga 
gaacatcaat 
agaactggaa 



aatgtgaccc 
cgtttctgga 
ttcaatggtc 
ctcgagttta 
tgggtcgacc 
ccagttataa 
tttctggaac 
acacccttcc 
atttttctgg 
gaagggtggg 
ccagaaatag 
aacgtatagc 
tataaatttc 
ctgtccacac 
aagatacttg 
tctctttaac 
gggtaatttc 
aatgtccaca 
gacaattatc 
taatttcttc 
tataccccaa 
cggtggcagc 
tggtgggtat 
acatcaataa 
tctttgcccc 
caatcattag 
atcgtatatt 
aacagctgca 
gtttgttgtt 
aaccattaga 
gactagtttg 
cttttcatca 
ttccatgaaa 
tgttatcaca 
gtcggtcttt 
ctcttcaaag 
cactgcttcg 
tggcatcgtg 
tataaacaca 
gtactcttct 
gtaagcacta 
gttcatttct 
aggtctcccg 
agattcgtcc 
ttttttcatt 
tctaataaca 
agactttttc 
tctttcttcc 
agtgccaaac 
aagagaatat 
tttgcagacg 
agacttggta 
taatttcttg 
cagaagggcc 
ttcaagtact 
gtctcacata 
tgtaagaaac 
gttaatatat 
gggagcacct 
atttgaacct 
caggaactct 



tttccagaaa 
gcatggtcga 
ttttaccctc 
gagggtggac 
caatgtcaga 
gaaaatgtga 
agacgtttct 
tactagcacc 
ctcactcgag 
ccgctgggta 
tgtccgatta 
caaatatgta 
aaatacttta 
atcatttatt 
ttcatcaggc 
ttgatcagcg 
tccacgttgc 
gcgtgtcgtc 
gctgtcgaaa 
cttatcccca 
gcgacaattc 
aacaaggggt 
actaggctcg 
tgctgcttca 
atgggctcca 
g99cgttgca 
atattcttgt 
gcatggaata 
agaatttcaa 
ggtgacatgc 
ctagtatggg 
aaataagtgt 
cacgagactt 
acgtttttat 
tttcttacca 
ggcctactgt 
tttattgcaa 
tcaaatatac 
tcaacaaact 
accatttctt 
gccctttctt 
tctctgaagt 
ataataaaat 
acaaaatact 
tcagccacgt 
ctcgaacatt 
aggctagcca 
tcagcctgtt 
ttgatgccgt 
tttacgaaca 
gttgtcaatg 
ttgttgttga 
aaactacttg 
aattcccttt 
tcttcaagca 
cgactgttga 
tccctccggt 
ttttttaata 
gctcacttgg 
agtcagggag 
atccattttt 



tggcttgtcc 
cccagatatc 
ttctccatct 
cgctggctcg 
ttacagagag 
ccctttccag 
ggagcatggt 
ctctttcacc 
tttagagggt 
gggcccatgt 
ttaaaaatt t 
ccacagtcaa 
tttgcataca 
aggaactggt 
actgccattc 
tcaggtctta 
gtactcaaca 
ctgatga tat 
cgcacattag 
gaaattatgt 
agtattggcc 
tcttcttctt 
tatcctctct 
gttacact ta 
actagtgcat 
ccatcgctcg 
attttttagc 
cgttttcatt 
tacctactct 
ctttcgcaga 
aatatttgtt 
gtttgagcca 
ttacacacaa 
gtgatacgat 
taatctccct 
taatcatcca 
cattttttag 
tgtaaagggt 
ctactgccat 
cttctatcga 
cgttggtcca 
cagttgacac 
tgccattgga 
tgtatccgaa 
aatcagaggt 
tttcaacgta 
aaacaccgtc 
cctttgaagt 
gcgtctcgcg 
aaaagtcgtc 
cgttgataaa 
taaagaagta 
aaggaacagg 
ctgtgaaaac 
cctgacggta 
gtttatccat 
tcagttcagg 
taacaaaaaa 
acatcattaa 
atcctagaaa 
ggaccccaaa 



27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
31200 
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tgaggttctc 
gaggagttgt 
ccaagaagaa 
ggttatacga 
ttggtgctga 
ccaatatact 
ggaatcctat 
atgtggctaa 
gcgtgccctt 
gtctaaaacc 
ttgaggtgtt 
taatggctcc 
aaccacacag 
ttgttactca 
tatataagag 
actagctctc 
gccaatatgg 
tttgaaagtc 
ggaaagtgtg 
aatgcagaag 
ttattccttc 
tgtacttttc 
ctagccatga 
actgtagaaa 
gtagtaaagg 
aggtgcaagt 
gatttgacat 
aatctttttt 
gtatgtagat 
agaagatatg 
gttacaggtt 
attttatagg 
tgatgtagat 
acctgacggt 
tagagttgca 
ggcaatgaat 
ttcagagcca 
aaactttatc 
tgaagtgaaa 
taacatccca 
gtactctgaa 
attttataaa 
tgaatctgaa 
tgttgtggac 
ctttaaagta 
tatagtggcc 
ctatggcact 
cactgaattt 
ctttatgagt 
caaggcagta 
cagaaatact 
gggcctctcc 
tggaaaaaat 
aggaagaaag 
ttaaagaaaa 
gggaatttat 
ttgaagatac 
caactactcc 
tcgaagaggt 
ggggcaggta 
aaagacccaa 



tcttcgagac 
ggaagaactc 
gatacacatc 
tagagcagag 
atatgatacc 
aaagaccctg 
ggatcttcac 
tggagaattg 
caatattgca 
gggagagttt 
aaagaagcag 
agaaaaaatt 
tggaaacttg 
tttattccta 
cccgtgttag 
tctctccact 
acttgacggc 
tactccccaa 
ggaattttga 
atatggagaa 
aagagtttat 
cccaatacat 
aatggatggc 
atgcggcatc 
tgattgttga 
acccaggccc 
gtgatgagat 
attcaacacc 
gtctactttt 
gattctaata 
ttagggatta 
gtaggtaaac 
aatggggtag 
ggagatggat 
aacgtcatct 
gattcaactg 
ttcagagaag 
ctttcaactc 
aatcaattag 
gagactgatt 
aaggtattag 
ggtaaaaaat 
acaataaaag 
gaacagacct 
aataaggaga 
aatattactc 
agcacagcaa 
gtttgtggtc 
gatactgttt 
gaaattttag 
cagtgagatg 
tagttatggg 
taggggtaaa 
agaggacact 
tgtaaccgaa 
gcatgatttt 
accgggtttt 
aaaatacgta 
ttataatcct 
cgaagaaaat 
taatgtagct 



gacactattc 
t tgtggttca 
tggaacgcta 
ggagatttgg 
tgttcttccg 
agagaaaatc 
cttatggctc 
tcgtgccagt 
tcatactctc 
atcctcactc 
ttgtgccgcg 
gaggacttta 
cagatgaaaa 
gaaatggtgt 
ctcctcgatt 
caagatggcc 
tggcacaaca 
ctctggtatt 
agcaagtgga 
tctcaacagt 
cagcaactgg 
ggacggtggg 
tagggatgtg 
catatggatg 
caatgcatca 
agtgtcatac 
t tctggattc 
catgatttta 
cgatagctcc 
cttctatttt 
taataacggt 
gcaaatatta 
aaggaggagg 
acg tagatct 
ttcaagaagt 
ataagataat 
caaccgtaga 
tagatttaac 
aaaattttga 
atttatggac 
ggttctcaga 
atgaggtaac 
agaaggcagg 
acaaatctt t 
aaagggaagt 
ctctaactac 
aggaaaagac 
tgagtaacaa 
tcccttcatt 
gtaaaagact 
gaaggggtta 
tcta ttgatg 
ataaa tgaag 
ataagaagta 
aaaattagag 
ggaaaaatt t 
gaaagtgctt 
tctatgaaca 
tcacctatca 
gcaggt tcga 
gaaaatctca 



cagttctcac 
tcaggggcaa 
atgggtcgcg 
gacccgtata 
attatactgg 
cagatgatag 
ttcctccatg 
tgtatcagcg 
ttctgactca 
ttggtgacgc 
tccctagacc 
ctatcgacat 
tggctgtttg 
aatcgctgtt 
cagtcacaag 
tttaattttg 
acagacccta 
gaggtgatga 
ggtgctatgt 
ggttcccatg 
attgaagaga 
gatggttcac 
actttctttg 
taccaaaaac 
aacccaatgt 
gttattgaag 
tttgtataat 
gtttttatag 
ttcacctata 
accgccaagc 
agcattaata 
cccttcttca 
aggaacaaca 
ttctcctcaa 
gtcaaaggat 
ggaagaaact 
acgtgaagtg 
agaggaacaa 
ctatgaagac 
aactcatttc 
tataaatagt 
tactggaaat 
aaatagtctc 
cttccctgca 
tactgtatcc 
gccagttgga 
ctatctattt 
gtcaactcct 
tgacgaagca 
aggtgtagga 
ttttggataa 
tgggtacggc 
aaaccacaat 
tagtggctaa 
caatgacaga 
caactggaga 
taaaggcaga 
gtttacgtat 
tgggtattag 
aagaattggt 
ttattagagt 



taccaagaaa 
tacagacgcc 
ggaatttttg 
cggattccaa 
aaagggtatt 
aaggatgatt 
ccacatgact 
aagcggagat 
tctgatggcc 
acacatttat 
attccctaag 
gttttatctt 
aatcatgtta 
gtgggcggag 
agcgcacaca 
aagactctac 
cccgccccaa 
agaggcgtct 
cgtatttctg 
tcaagacaaa 
ctgatcgaca 
gtgggggata 
tgtttgttga 
tactagcaat 
tttctgtatg 
gccatggagt 
aaaaccccat 
tatataaaat 
tagaggaggt 
aaacggccag 
gcttccgttt 
tcctcttcct 
acgacaccaa 
aagaaggctg 
caaggagtgg 
gaggcgagaa 
tttaaggatg 
tttaaggaca 
atgacccgtc 
gatccgaaaa 
atagaaagaa 
gtagctgtcc 
atccgtaatg 
ttcaatcaag 
atcaataatg 
gcagcttccg 
gtaatagaca 
ctcatggctc 
gaaagacctc 
agatacacaa 
gatagaaaca 
tattttgcgc 
ggagaagatt 
tgttatcaaa 
taaggaatta 
tggaggaacc 
atataaaaac 
cgatgcgatt 
agaatacggc 
ttttatgacc 
tgccaaccag 



attttctgga 
aaagaattgg 
gacagtagag 
tggcgtcatt 
gatcaattgg 
atgacggcat 
gctcaatttt 
gtcgggttgg 
agtatggtgg 
aatacccaca 
ttgaggattt 
gaggggtatc 
aggaatttcc 
catatttgtg 
cacgcttata 
aaatctcttt 
tatcatattc 
cgtacggcaa 
gctcgaagat 
ctgcttggca 
tggacagtac 
ttttacttcg 
taggaataat 
tggtgcaaag 
taatgcgtgt 
gggtcattct 
aagaaacaat 
caagaataaa 
ggataaaaac 
ggttaaatct 
catcctttat 
cagaattatc 
ctcaaccttc 
aactaagaac 
cctttagacg 
tcaataactt 
acacagacaa 
ttgttatggc 
tcatctttga 
aatatgacac 
tatcctctac 
tcgttgattt 
tcgagtttat 
ttttcttctc 
gatgtgtagg 
gacactacat 
agtacgatac 
taaatattct 
tgacggatgc 
acgccaacat 
attgcaaaaa 
cgtcaattca 
atgggcacaa 
gagaatactg 
aatgacaata 
ttccatctct 
gttccaggag 
aatggaaaaa 
accattcgca 
aagattgaaa 
cagtataatg 



31260 
31320 
31380 
31440 
31500 
31560 
31620 
31680 
31740 
31800 
31860 
31920 
31980 
32040 
32100 
32160 
32220 
32280 
32340 
32400 
32460 
32520 
32580 
32640 
32700 
32760 
32820 
32880 
32940 
33000 
33060 
33120 
33180 
33240 
33300 
33360 
33420 
33480 
33540 
33600 
33660 
33720 
33780 
33840 
33900 
33960 
34020 
34080 
34140 
34200 
34260 
34320 
34380 
34440 
34500 
34560 
34620 
34680 
34740 
34800 
34860 
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ttatgaggat 
tgtttatacc 
cattcgtaag 
tttattgagg 
atagggaatg 
cacgataagc 
agacaaattg 
aatgtcacaa 
aaaaaaaata 
aatgacatga 
gaaagtataa 
tatactgtat 
gcaacctatt 
gaaataccag 
acctttaata 
gtgacaggtt 
aactgcaaca 
acagaggcat 
tatgaaatat 
gtagttttat 
atcagagcta 
agatgtcctc 
acctgggttc 
agaaggtaca 
tgtggatgcg 
ttcaagtgcg 
aacaccataa 
ttttctttcc 
ccctacagcc 
agggggcaac 
atatgaacta 
aggggcggcc 
ccttttcata 
gaaaaactta 
tgcactcaaa 
tgctgagcag 
ataatcaacc 
cttatgtttc 
attcttccct 
gctgctgccg 
aaaacaaggg 
taaaaggcag 
tttatagatt 
tcaacaatga 
acttaatgag 
gcttcacctt 
ggaacaggtt 
ctctaagagc 
tattatcgag 
aaatattaga 
tactactact 
cttcttcgtc 
aggttgaggt 
agtcaggcaa 
tactaatacc 
caggaagtga 
cctcatcatc 
tattatcacc 
tgggagctac 
tacccattaa 
cctgtctaga 



ggtgtttttt 
atataatgtt 
gaaaattcct 
aaataataat 
tttctgctgc 
taaacacatt 
tggataaaat 
ctaatgggac 
taggtgataa 
agagtctagc 
caggaacaaa 
ttgtcaataa 
ataacacccc 
attcgtacag 
taaaatggag 
atcttgcttc 
actgttttat 
catttaccct 
gtatagtagt 
ttaactctta 
tagtacagaa 
cgtgaagtgt 
aaattaacat 
ggactagcat 
ggagggggca 
ttagacacaa 
tcttccatga 
tcagggtcat 
aagtcaggca 
accttgacag 
ggtttatgga 
ttacagcctc 
ccttcagaca 
taaaaggact 
acatccccta 
gtactttcta 
ttcttacagc 
ttcatcattt 
tttattgtct 
ctgctgctgc 
ggtaacacat 
tagaaaattt 
gggcagggaa 
actgtgtaca 
acaggaacca 
caatttctta 
acctctagat 
agaaagttca 
tctacacata 
catcctgtcc 
acttcctgag 
atcagagtct 
aggggcggat 
agacattgtg 
actaatagta 
ctctgagtca 
ttcaatgaaa 
ttcatcatca 
atacttaagc 
agctgcctca 
tagagtttca 



atagactacg 
cagaaaacaa 
gatgaaccag 
aataataata 
cctacgagag 
cctggacaga 
acgatcccaa 
tgcattttcc 
ccttcagagt 
tactggagcc 
ggactggctc 
ccttgcaaga 
tatatatggc 
caaggtcgaa 
aggcgatacc 
catctctgaa 
agaggcagat 
caacttaccg 
agtttgatgg 
atcttatata 
actggtgctg 
tgcgtctgcc 
ttggtgcatc 
catagtccct 
tattagccgt 
acctctcagt 
gggcgtcacg 
cagagaagag 
caaaaactgg 
gagcaggagt 
actcctggca 
caaaaacctt 
acatcatgaa 
tttcattctt 
tctccccttc 
tagatgctct 
tggcacactt 
ctgatgaaga 
ttcctttctt 
tgctgatgat 
gtaataaaat 
gtacatcatt 
gatatttgat 
tgtgcaatt t 
attagctgct 
gagcacaaac 
atacaatctc 
gccagatacc 
tacctcttga 
tcgtcacgcc 
gaagtagagg 
gaatcaacat 
gaggtagggg 
tcatcatcat 
tcaaaggtaa 
tcattatcat 
ttgaaggatg 
tcatcatcat 
cattcacctt 
tgattatctt 
agcatgaaac 



aaacaaagaa 
aggctcttaa 
aagggagtat 
ataatggcat 
ttgatgtact 
aatgttgaat 
acaacatctg 
cttttcgaag 
ggggacttta 
atcttatcta 
aaggctataa 
tcaacactcg 
gggtattgta 
gcggaacata 
ataaacaacc 
gacgcagatg 
atgagtagcc 
gaaatcgaag 
aggacaaata 
catttcatgt 
gage tga tta 
gtctctgacc 
aaagggggtg 
cacaatcctt 
g tctagcaac 
gtcttctgct 
ccttcctgcc 
tttgtgaacc 
cttatttttt 
ggggggtaaa 
aggtatt t tg 
cttagctatt 
cccattctta 
tgcaagaaga 
cccatactt t 
cagagcatca 
cttaaccata 
ggaagttttg 
gctgacgata 
gtcagt tccg 
gcatta taat 
tacaggcagg 
cagct tccga 
cgtcatccaa 
tattagcact 
tcttaccaat 
caagcttatt 
tattggcaa t 
catctacaca 
cataccttgg 
gctgat tcac 
taaaacaaag 
cggatgacgt 
catcatcatc 
tattgttact 
ctacctctgg 
tcacaccaga 
tactcaagca 
ca tattcata 
caaaggtctc 
tcaacttggt 



gggggtgtcc 
agggcgtagt 
cataatacac 
tacaggaaaa 
cacccacaca 
catcttcaga 
acatatctga 
ataccttaga 
ttgatggccg 
gacagcgaga 
tgggttgtgg 
ataatgatga 
aaatggctat 
cagttgaagg 
taataacaat 
tgcaggcgcc 
tctacatgga 
gagctgatgc 
aaaaattgga 
acatttcatt 
atcctcctgg 
ctatcaacca 
tgaagaggtg 
gttcttcccc 
tccaacatct 
tcattgtcat 
tgatgcctct 
cagcatggtt 
acattcttat 
ggagtggtat 
ccctccctgt 
ttagtatgca 
tcaaactgaa 
catcccgcca 
atttcaggaa 
ttttccaagg 
tcctttattt 
attgatgtct 
caccctccct 
gccattttta 
gtaacattct 
catatacaaa 
agggtcagta 
cctagcatta 
ccactttgaa 
tacatcattg 
ctcagtttcc 
ctcattatac 
cagtctctta 
cctctttcta 
tgatgacgaa 
gttaacatta 
agaagcacta 
attaacatca 
tctaggggga 
ggggacaaac 
caacaaacta 
ttgagtaatg 
aggggggata 
t tcactgaat 
ataagccctc 



aaggaggaaa 
acttactttt 
gcactagggt 
ggatataact 
tatgcagcat 
agagaaaata 
aacagtcaat 
aggtatggtg 
taaaaagctc 
ttttgttgca 
tattataagg 
tgacaaggca 
aaaggactat 
aagaaagatg 
catcccttca 
attactttta 
tgagaaaaaa 
gaatgcagtc 
tacatttcat 
aaaaatcact 
ccatgacggc 
tcctacctga 
aagaataagg 
ttctaacagg 
gtgcctggta 
tattaggaat 
caaagagccg 
tcttagcctc 
agggaagaag 
attcaccctt 
cctcttttag 
ttttctcaat 
aacccctcaa 
ttagcctaat 
acctattgag 
ttgatgagct 
ccattctttt 
taaagacgat 
tcttgctgct 
atgactctaa 
tttattttaa 
aacacatcac 
tcaaacccat 
ttagaggctg 
gcgtacttac 
acagttgtat 
tgtctccttt 
tcctcatcaa 
ctctcagagg 
ctgttactac 
gaaggtatta 
atgttaagag 
ggcatattca 
taaagtgaac 
acagtcaagt 
actacagtat 
ggggagacat 
tcaatagtat 
tccttgaatt 
gcatgcattt 
tccatataaa 



34920 
34980 
35040 
35100 
35160 
35220 
35280 
35340 
35400 
35460 
35520 
35580 
35640 
35700 
35760 
35820 
35880 
35940 
36000 
36060 
36120 
36180 
36240 
36300 
36360 
36420 
36480 
36540 
36600 
36660 
36720 
36780 
36840 
36900 
36960 
37020 
37080 
37140 
37200 
37260 
37320 
37380 
37440 
37500 
37560 
37620 
37680 
37740 
37800 
37860 
37920 
37980 
38040 
38100 
38160 
38220 
38280 
38340 
38400 
38460 
38520 
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catcctcatt 
ttgacatctt 
aaaatgagtg 
cagggggtat 
cttcaaatct 
ctccatttga 
aggtaggcac 
taaagaggga 
acagtcccct 
cgattttgat 
gacatcaaat 
ctggaagcca 
aagaactcgg 
tcatcgttgt 
ccagatatgc 
tttgcaactc 
agt ttttcaa 
actgtatcaa 
tacaacaagc 
cacctaatga 
ggacacacac 
ttaacacact 
cccctattca 
atgaaagtgt 
cgagcagtct 
gtgctagtct 
gccatataca 
agaatgtgtc 
aagccaactt 
acaattgacc 
caactcttct 
tgcctcttac 
ggaggagagg 
aggttgatga 
ggaacagttg 
acaggccgtt 
gta ttgatgg 
aaatacacga 
aatagcccat 
ta ttggtact 
gttttactgt 
agtccactat 
taagctcggg 
ctctgctgag 
tcacccccac 
atctggctac 
actaaa tggg 
ggatctggat 
accacctcac 
eta tattccc 
cccataacca 
cccaggcgct 
aagccagcag 
caatactcaa 
gttggactct 
tagctctcgc 
gatct tccaa 
tttctatgtt 
tttactcttt 
cgcctcacct 
ccgaaggcgc 



ctcattaaca 
attaagggta 
aaaagaggca 
ataggcccta 
ccaggtgaaa 
ccagttatag 
ctcatcaacg 
gaaaagccaa 
taaaagggaa 
agaaatcagt 
taaaggcagg 
ttacactgaa 
gaggtgtgtg 
aacaataacc 
tatctataac 
ttgtaattac 
acatacttct 
ccccctttac 
gtgcgtccct 
catcatcaat 
cagtcacccc 
caaaaacaaa 
caatgtaata 
taaagtcctt 
tacggaagtt 
tttgtctaat 
ttatgtccat 
tagtgtcaaa 
tagggtcctt 
tatggccaat 
caatgccttt 
actgcctaat 
gagcaacttc 
cgtcatccca 
tggcttcgga 
ttttattccc 
tagtagtaat 
ttaattcaac 
aatatttaac 
tacttttgat 
tgggcactac 
tgtcactgca 
ttggctgcta 
actgatgctt 
ttccagaaat 
gggttcactc 
gaggacctaa 
gacccacctt 
cccatatctc 
ccaccttatt 
gatctggacc 
agcggtgtat 
aagagcaaca 
gctacaagaa 
aagt tactta 
cggtctgata 
ttcatacaag 
atgctgaata 
gacagaaatc 
gaacgctgca 
ccaaacttcg 



cctttggcat 
tcctttacac 
tacttgtaca 
cacatgggac 
agacactcca 
caatccgtgt 
gcgacggttt 
cgctatagca 
tatgtgcttc 
gtctgttccg 
gccttccatg 
actgtacgtc 
aatgtactgc 
tagagggtat 
aaagttatca 
attacaagtc 
tacaaccttt 
ctctggtaaa 
tgactcataa 
actgatacct 
aactaccccc 
ctcttcatag 
gaccccatat 
cacatcttca 
ttcccagttc 
tcgctcacag 
aatctgggag 
aaatctcatc 
tccatagaaa 
tctcaactgt 
aaagtagtaa 
caccttggac 
agacttcttg 
agtcttgata 
ggttgaacca 
ctgatggtaa 
tacagtgggt 
gcccggcaca 
gttacgttaa 
gggtttggag 
actgcactgg 
ctgcactagt 
gagctggagc 
ctggcctgcc 
ttgccgtcga 
ctggccatac 
gggtgttata 
gtccttctta 
tcaccccatc 
cactcgctcc 
ccagtcttcc 
ataaggcggc 
caacaagcac 
tggcctctcc 
gttctgagga 
tgaagaaaca 
gtaagcacaa 
ttttgttggt 
tccaaccagt 
gccgatctct 
tccccggctg 



tggggacaaa 
cttcatcatt 
cctcttcata 
ataacaggtg 
ggtggacact 
ccgaatcgta 
cagccatctt 
actcatttct 
atggcaaggg 
ctttgcaaca 
ttagaacaag 
attgaatagg 
ctgattagtt 
gagagtccca 
gcctttatat 
tcagccaaaa 
cgggcaatga 
cgtcccatcc 
tactccccac 
gcaaatggca 
ttaatattct 
ttcatatcct 
ttaatctcaa 
ggcacttcga 
tcacactggt 
gcacttgaag 
gacggaatat 
tcttcatcgc 
caggtatcat 
tgacaggtat 
ggggaatcac 
gcacacacgt 
aaacgtttag 
gtgatgttgt 
gacgcgtagt 
acatactgct 
ccccccattg 
ctaataaagt 
tcaaaaaaaa 
cggagactga 
tgacgtagta 
agtagtagta 
tggaactgga 
ccggctcgct 
acgccagttc 
cctgtttagg 
acaacatgta 
tcctatcctt 
tcacctatac 
agtt tcaaca 
cttttatccc 
gcggccaggc 
tctctctcct 
agcccccgcc 
actaaaggaa 
cttgctccat 
gttctcacta 
tgaagtgaaa 
acggagacta 
tggataaggt 
cttgtgttgg 



gaaagtaggt 
aacatagtca 
cactttacgg 
attttcattc 
tttcctgcaa 
atcacccata 
gttgttgttg 
ttactttaaa 
catattctgt 
acatcaaaaa 
caacaacatc 
cacctagaca 
cttccgtgtt 
agtcaatcat 
cgacattaat 
caaaaggcat 
aagagtctgt 
tttgaagcat 
catctaaact 
tctctaaaca 
ttctgtatac 
tcataaactt 
aaaaggtctc 
ccaaa tcaga 
cagaaacttc 
gcaattggaa 
tacgactaga 
aagaagaggt 
ggacatactt 
cataattgtc 
gttcaatcca 
ttgaaaaggt 
ggggaggggg 
tgttgttgtt 
tgttgttgtt 
ggtggtggtg 
cctatggaag 
gatgtgtttt 
caacaacaat 
agtaaatcca 
gtagaagtag 
gtagtagtaa 
get tgtatgc 
tatatacaag 
tccaacagag 
ggtctaaatc 
agcgttgggt 
atccttgtcc 
ccccatctca 
ccctgttctt 
taaccggcac 
cagaagca tc 
tctacctaga 
gcaccaagtc 
ctaacttcat 
ctat tcgagg 
tacactttgg 
aatattctaa 
tggatgtttc 
gatgtccgga 
tgcgaaggt t 



gctagaatag 
gcagacaaca 
aagaaccaca 
accatatgct 
cacggcatcc 
cagatgggac 
ttgtttactt 
caggttcatt 
gattgaaggg 
gtgcatgttt 
ttccagtata 
gtgtccccta 
cctaaaatat 
tttaggttgg 
aattccctca 
gttcaccttt 
tattggtgcg 
gacaacccca 
atccttgaca 
taacccctga 
agaaaggcca 
gacaacagtc 
ccagggacag 
caaacaacca 
cttaattagc 
cacctctaca 
caactcctta 
gggccaatag 
gctagactta 
ctcaagacta 
ctcattatac 
gcgagtagtg 
cgtgatgcta 
gttgtaagcg 
atattcttcc 
gtctccacca 
aaacaacaac 
ttatataaaa 
aatagtatat 
gaaaacaaca 
tagtggtaat 
gagcgttgga 
ttggctgtgc 
ttgtcccccc 
tggggtccag 
cacccccggc 
aagaagatct 
cctgtctacc 
cccatcacct 
gccgagccaa 
catttatgcc 
agttctctgc 
agagacctgc 
cttacaccat 
acgtctcgac 
agcacgagaa 
actttgaaat 
gtccaattcc 
acaatggccc 
cctctatctc 
gtgaaggcac 



38580 
38640 
38700 
38760 
38820 
38880 
38940 
39000 
39060 
39120 
39180 
39240 
39300 
39360 
39420 
39480 
39540 
39600 
39660 
39720 
39780 
39840 
39900 
39960 
40020 
40080 
40140 
40200 
40260 
40320 
40380 
40440 
40500 
40560 
40620 
40680 
40740 
40800 
40860 
40920 
40980 
41040 
41100 
41160 
41220 
41280 
41340 
41400 
41460 
41520 
41580 
41640 
41700 
41760 
41820 
41880 
41940 
42000 
42060 
42120 
42180 
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tggtgagctt 
ccatggagtt 
gccccgtgac 
tgtctgcaaa 
gacaagaagt 
gtttgcttga 
tcttagtttg 
atgtgaacgt 
caagtaataa 
ctaataatag 
gaagatcaaa 
atcatggcat 
aagttgaaac 
atttgagtaa 
tgttcatctc 
tgactgcatg 
atatcctcac 

^gggg^ctgt 

atctggacaa 
tggcattcat 
ttgcaaagca 
caaaggttgc 
acgaaaa ttg 
acgccaaaca 
gtgcattccc 
gtgacacaat 
ttgaggacga 
aagccgagaa 
aggaagaatt 
ctaatgtgga 
tagaaaggga 
tggaggagga 
agaaccaaga 
atgcagatag 
gtagtagtag 
agaagaaaga 
catcagaagc 
tatcaccatc 
cgcctgagcc 
gatttagtta 
ctactgaacc 
ttttggacat 
tgaccgcaaa 
gtttcgatga 
cctccccgat 
ctattaagag 
aggttgaaaa 
accgtggttc 
aggatgaaag 
aacattcccc 
atagtacaac 
ataaaagcca 
aatgctcaat 
gaagttttag 
cagttaggcc 
ggtctgcctg 
acaagaagaa 
aaaaccaatg 
gtgtaaggaa 
agcttctcct 
acaaaaaaca 



ttgccagaag 
ccaaggagac 
tgagaggcag 
cttctctcca 
cgactcatca 
agagtgtggt 
caccctgttt 
acttaaccag 
taatagtgtc 
taacaacatc 
gaaagttaca 
gtttgccggt 
tcttacaagt 
aatgtgccgt 
tgagattaac 
ggataagttt 
tcgctcaaac 
ttattataat 
gagctggggt 
gggtattgac 
attccaacat 
cgcaggattg 
tcagtttatc 
acttctccgc 
atttgcaaat 
cattgaattg 
aaacatcatc 
gaccctgtat 
cttcaggcct 
ggaagaggaa 
agaaggatct 
gaaggaagaa 
agaagaacaa 
tgacagcgac 
tagcagcagt 
agaggaagta 
tgct tcctct 
attggacata 
tatcgttgct 
ctct teat tc 
agcagctgtg 
gttggacggt 
attctttgag 
tcttcttaaa 
tcacccttct 
gtctagggca 
ggaagcccct 
ctctggttct 
ggatatgaaa 
taaatatgat 
atgcatgtcg 
ctctcgtaaa 
caatactcaa 
tcctaagaaa 
ctataacaag 
accgactcaa 
ctatattttt 
tattatatgt 
agaaaaaaaa 
ttctttctgt 
acacaaat tc 



acccgcttca 
gattttaact 
atgttcgccc 
gtctctgatg 
acttccgctc 
agtggctcta 
ataatccaag 
ttgtttggac 
ccatcaactg 
agtaacaagc 
gccacagcca 
gcccctgttg 
aagagcacca 
aaaactgccc 
ccctccaaat 
gtagaggggg 
tatgcctcag 
gacaagtcaa 
aacaagaatg 
attgtaaagg 
ccggaagtgg 
aagtcgggga 
aagtatgaca 
acaggactta 
attgtgacgg 
atgtacaaga 
aaggtgaatc 
tttgagattg 
acatctgttg 
gaggaagaag 
gataaggaag 
gaacaacaac 
caacaacaac 
agtgatagca 
agtagcagca 
cctgccaaga 
cccaagagaa 
ctccagactg 
acaacctcac 
gtaagaggag 
cccgctgctg 
ttacctggcg 
ggcatcaccc 
gccaccgaga 
agcaactcag 
ggttcacttt 
aaacgtaaga 
gcttcttctt 
tcaacatcaa 
gctgtaaata 
tcatcagata 
cactcttctt 
aatgtcaaga 
gatgagcttg 
aagcgtgata 
atgatactca 
ataatatttt 
ttaaaatcaa 
catgattatt 
gtctttgcag 
tctactgctc 



ccaccaacat 
actctgcctt 
ttatgaagag 
acgtcatcaa 
aaatttctgg 
atagtaatag 
tgttcaagtt 
atgttgtttt 
ttgttaacaa 
gtgttggtgg 
aaaatccctt 
atgttaattt 
tccctaaaga 
ttacccccct 
atgaccgttc 
atactgctgg 
ccgccagggc 
agtttctgtt 
ggaagaaacc 
tgtgcgctaa 
aagaacttgt 
ctttgtttga 
cagaagggtg 
atcaatacaa 
taacatccgc 
caaaggatgg 
cagcagaaga 
attccgatga 
ttgctgcccc 
agcagatgga 
atgacggaga 
aacagccaga 
aacaaccaga 
gcagcagcag 
gcagtagtag 
ttcagaagag 
tgagagtaga 
cagttgatga 
ccaaggcagc 
atgacctttc 
ccacttgcac 
atatagtaat 
taccagatgg 
ctgataacat 
cccccagaaa 
ttgacactga 
agcacttgaa 
ctcattgtat 
aggttcacaa 
gtgatgtaaa 
gtgatgcaga 
cctccacaag 
agactgttgt 
gtgatttctt 
atgt taacac 
atcaatgtac 
tacatgtctt 
ccccatttgc 
tctgccatta 
attcgggagt 
tccctcacca 



tgtgatgaga 
gtgtgcaagt 
tgaggacgaa 
cccttcaagc 
tatgtttcaa 
ttcccctgtc 
tttggtgact 
tggatcactt 
caacaacaaa 
tagtaataac 
taataatgta 
ggatgacttt 
agaggtaaat 
agaaattcat 
aatgttttgc 
cgttaaacgc 
cgtgtatgaa 
ccacgataat 
tagactccca 
ggggattcaa 
gcctcctatg 
tgactgggac 
caaaaagcac 
taaactggaa 
ctctagtgat 
cgtaaaggga 
aaagaagaat 
tgaggtgtgt 
gacaacaccc 
agaagaggag 
cgcaccagca 
agaagaaagc 
aagagaagag 
tagtagcagc 
cagtgaaaat 
aaagaggcta 
agaagaacaa 
gatgatggaa 
gacacttgca 
agtagctggt 
ttccgatgtt 
gcaacctggc 
tactgataat 
tataaccacc 
ggatattgat 
tgatgatagt 
aaagaggcgt 
gagtagtgat 
gtcaccaaaa 
taactcatac 
agcacagcct 
tgataagaaa 
acagtctcca 
gtcacgcaag 
cactaataat 
aataataatc 
aataacaaac 
atgattaaac 
aaacaacaac 
ctttctaaaa 
atccccctat 



gaagttaaag 
atgccccaac 
gaaatgggag 
ctcccctctg 
aacgtgtgga 
tctaggacag 
aaagtgtcta 
gatgtggctc 
ccctcgacct 
agtggcggcg 
gatggggaca 
gtttttcccc 
gtagatgaag 
acctttaatg 
aagggatttt 
ttccgtaact 
gcgtcaatta 
gttaaccctg 
gctaacttga 
aagtatatgt 
gctgtatacg 
ctgcctgaat 
agtgagttat 
gagggacaga 
gatattcacg 
gtctcaaaaa 
aa tagagtac 
gagagaacag 
ctcgtacctt 
gaagaggaag 
caggaagaaa 
aatggtaatg 
gagaataagg 
agtagcagca 
gaagctgaaa 
agtgaaaggc 
caacaacaac 
gaaattcctg 
ctcaagacag 
aatacttccc 
ggaaatgact 
gaatgcgacg 
gaa tgcacag 
acatgcttta 
aattgcagtt 
gaaacaaatg 
aacaagtccc 
gaagaatcag 
gctcatgtta 
aacaatgtta 
aaaagccata 
cagaaccaac 
cctagtttta 
cacacaaagc 
gtagtacaga 
ttagtactta 
aaaataaaag 
taactatagt 
aaaaattc ta 
ccacagcaat 
cgcccttatg 



42240 
42300 
42360 
42420 
42480 
42540 
42600 
42660 
42720 
42780 
42840 
42900 
42960 
43020 
43080 
43140 
43200 
43260 
43320 
43380 
43440 
43500 
43560 
43620 
43680 
43740 
43800 
43860 
43920 
43980 
44040 
44100 
44160 
44220 
44280 
44340 
44400 
44460 
44520 
44580 
44640 
44700 
44760 
44820 
44880 
44940 
45000 
45060 
45120 
45180 
45240 
45300 
45360 
45420 
45480 
45540 
45600 
45660 
45720 
45780 
45840 
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ctccttctcc 
ttcaacaaca 
cgacctctcc 
atgtcaagca 
ttagtgataa 
atatgtttgg 
ctacaacaac 
acgaagaagg 
tcgggatctc 
aactcaaaaa 
tcgcagcaga 
atatgtgcgg 
tgtatacttc 
ccaagaaatg 
tatgcatggc 
tcttggctga 
acaaagcttt 
atactaacct 
gaatacatgg 
acaagaggga 
acactagtaa 
ctgatatgca 
tgggtgtgaa 
cgaatgatgt 
attctagaat 
gcttcacagg 
tagaatgtga 
tctttcatcc 
acaagggaca 
agcgtagagg 
tggatgtatg 
gtaatagatg 
gcagagacgc 
actgattgat 
actaatgttt 
gatgtatttt 
cacaggggag 
cacacgcaag 
ataactgggg 
gacaagaaaa 
acacgtgtga 
tttcctctcc 
ctctccagat 
ccagatatct 
actgaattag 
taccagtgtg 
cctttcatgt 
tttagtagtc 
ctggaacctc 
tacacctgtc 
ctcagaagaa 
ttttgaagag 
gtttgaataa 
actacaaaat 
tactggtatc 
actccggcag 
gcagcttcac 
gtgcgtgtaa 
ttatcgcgct 
gtcatagcca 
gcgtaggcca 



tctctatact 
atgccaccaa 
atcgtctttg 
agaagtctcg 
cgatgaggac 
tgattcagat 
accacttcct 
agcagcagca 
tgaagctgaa 
gatgatgaaa 
ttataaaaca 
tgtcgcccct 
ttcctatgta 
tcaattcctt 
ccctcccggt 
acatggaga t 
ccccaatctg 
ttgttggata 
cgatgcgtca 
agaacaaaag 
tatgggaaca 
agaattccaa 
taccaaccca 
accagaattg 
gagtgcaaat 
aggtgcgaat 
cgaagatgat 
attttatgag 
gacttgtaat 
aacagtcccc 
tactgccatg 
tatcgtaagg 
ttcatgtgat 
tggttgatat 
tatatatgat 
tgaaactaca 
tcttagagag 
taaaaacacc 
aatttcctct 
aaattagtga 
gcgcttctgg 
agatatctgg 
atctgggtaa 
ggctgtacac 
ctctctaaac 
catataagaa 
ttatatttaa 
ggtacctttt 
atggtagctt 
aatctggatc 
gcccgtttat 
tatgaagaag 
aatatagaga 
ttgtacacat 
tgatttct tg 
cagaagttgt 
ggcgttgttg 
agatgcctct 
tctcaaaaaa 
taaaatcaaa 
tacagctagc 



tcctccccat 
tacaattact 
tgggatgatg 
aaccgtcccc 
gataacaagg 
aacatgccat 
actcctcgat 
gcatcagcac 
ttgaaaaaaa 
gatcctctcc 
agggcaaaca 
cctcaatttt 
tttcctcctc 
aaggggagaa 
aaaaggggag 
gaatacaagg 
at tgctcgtg 
gttaatggag 
gagtatgcaa 
accaatggga 
agcactgatg 
tcaatgggaa 
cttcccaacc 
gacatgatgt 
acaggaacat 
gtagtagtgc 
ccaagtattc 
tatatggcca 
gttgtccagg 
aaattctgta 
tgcaaatggg 
aggtcatgta 
gttcccaagt 
gtgacatttt 
tctatttttt 
ctctggcaga 
gggtgtgaat 
act tcctaga 
ccagatatct 
tatcataact 
gcgcgacaag 
gtaagaaaaa 
gaaaaaaaat 
atgtgagtgt 
ttttcctcat 
ctggagagat 
taaatataat 
ttctgaccac 
catttgttgg 
ttgtcggaaa 
tgaggattgg 
aagaagatag 
taca t t ta ta 
acattgagag 
atattcgaga 
agggataggg 
ttgttccagg 
gttgtattgg 
ct tggccatt 
ttggtcagag 
ggcacgttga 



tcacttctga 
accccacacc 
acgatgatga 
ccattttttc 
gagaagaaga 
caacttctac 
ccatcatgga 
cgtctattgc 
tggaaaagaa 
ctcacctata 
tttcccttta 
gcgctgaatt 
ccacacctgc 
aggctttgag 
gtgtattttt 
tcccaaggat 
cagacacact 
ttgtcgtatg 
caggagaaaa 
gtgccagtaa 
tgcaagaatt 
caaataccaa 
ctcccccaag 
ggctttattc 
ctcccctgtc 
ctaatggatt 
ccaattctta 
aatatctatc 
agtggttcaa 
gtaacatttc 
cgaagactgt 
catccatgct 
gcagggaaag 
ctgtatattg 
taaaaacctt 
atcagaccag 
ctggctaggg 
aagggaggga 
ggctgtacac 
ggggaatttc 
aaaaaaatta 
aaattagtga 
tagtgatatc 
tctaatctca 
ttcttactcc 
gggagtttta 
caccatggat 
ttgtgcatct 
atttctagga 
aagcttcgta 
aaacgtatta 
tggtattgaa 
ttgttttatt 
aaaaaattga 
gagttagtag 
gcggtggcac 
ataatctttg 
accaccttat 
tctccacaac 
gtgatattat 
atattggagt 



ctatcaattt 
ccctccatct 
tgacgaagaa 
gtacatggaa 
atgttttgga 
tgcccctttc 
tactgattcg 
cgcctcttct 
aaagaggaag 
tgtaggagga 
taaagttgac 
gcccacccca 
catgcataat 
gaaatggatt 
ggctcacttg 
gtttgtttca 
gtgcagtgat 
ctttgataaa 
ttttgatact 
gaagaggcgt 
ccaaacgatg 
ccccatagag 
attggtaatt 
gccttccaga 
taacacccca 
tgtccctccc 
caattacgaa 
ccctcttgtt 
gggatccttc 
ccacgctttc 
aattagacat 
cgcatatcac 
ggttcgcaac 
tgtaaataag 
taaaaatata 
acccctgacc 
gttcatccct 
aggggtgact 
gtgtgagcgc 
ctctccagat 
gtgatatcat 
tatcataact 
ataactgggg 
ttttttatat 
tctttgtttg 
aataacaacc 
atttcaaata 
tgcagtccaa 
cataaactac 
ttctctgcaa 
agggactata 
gaataataat 
tgcattatat 
tacaatttct 
tagcagaaga 
cggtggcagc 
cgcgtttatt 
tcctatattc 
gttgacgggc 
ggttgtttag 
acttgctgtt 



tctgattcat 
atgtcctcga 
gacgaaaaag 
actgtatctt 
tcaaactttg 
cctcctccct 
gatgaatgtg 
tctatccctg 
gaaattaaga 
gaacctcctg 
cctagtatcg 
tccatagatg 
aagaaagggt 
cacgagaatg 
gaacaaagat 
agagtattga 
atgacattct 
gatgatggag 
gtagtgt tec 
ctcacgcctg 
ggaacaaata 
acttcatcag 
actcctttaa 
ggaggtggaa 
attcctacct 
acgttcccct 
gaggataaag 
ccatcatata 
tctcttgcaa 
ttttgtaata 
ggacaatatt 
tacattgttt 
gacatggatg 
ataccaataa 
catataaaat 
taaagcagac 
caatgttaat 
tggtgatatc 
ttctgggcgc 
atctggctgt 
aactggggaa 
ggggaatttc 
aatttcctct 
agaaaatata 
tatgtcctac 
cttcttaatc 
agacatta tt 
ctcaaatcgt 
ttaaaaacat 
gtttaaccat 
atggcaacaa 
aataataatg 
ataaaaaagc 
tccttttttt 
agtagcacca 
ggcagcggca 
catttcagtg 
tccgatgtcc 
tgcccatgca 
aaggtccaca 
tcttggttta 



45900 
45960 
46020 
46080 
46140 
46200 
46260 
46320 
46380 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
49500 



wo 01/38351 



15 



PCTAJSOO/28888 



acggtatgga 
gatgtgcctc 
tcgcaatctc 
ccaactccag 
ttacttagcc 
ctaggtgggg 
caagaatatt 
tcgtcctcgt 
ttagacttgg 
gacatcctcg 
gtaacagaga 
atacttgtaa 
atcatagggg 
gtagagttga 
ttctcagaga 
cgattttcgg 
ctataagggt 
ctttctgaga 
taagcagtaa 
tggcggtgtt 
tgctcacaat 
tcttcgtatc 
tccatctcca 
tggcgccggt 
tcatcagatg 
gaagaatcgg 
gcttccaatt 
gtattagggt 
tcaagaaaag 
tcctttttgt 
tctgtagatc 
ttagaactag 
tcttcctcga 
attggcacac 
tccacatcaa 
gacacaattt 
ttggccagtc 
gaaccgggcg 
ataaacacga 
ggatgtaata 
ccttttataa 
atcacccctt 
tgaacaggat 
atacagactt 
aaatggttgg 
cagggatatt 
agatatactt 
ctctgtctca 
tggtccaggg 
cagaggcgtc 
tctgaaccag 
tgaaggttac 
ttcaaaaaga 
acagtattat 
tttaatgttt 
caatttaact 
agatgcgggt 
caacgtattc 
tcagtacgga 
taatagggaa 
cgctgttatg 



tcaccttact 
tgctaactgt 
cgtaagaagt 
aagtgagtgc 
ctgcgtcaat 
cgagagtgtc 
tcttgttctt 
cactgctacg 
acttgttctt 
ttccagtaga 
taccttgtgg 
cagcagaagc 
ctgtagtggt 
tgtgagacat 
ggacttgtag 
gggttgccag 
cgaaagttgt 
atcgagagtt 
cgtcagaaga 
ggtggtggcg 
aatagtcctt 
tgggttcgtc 
ttttctcaga 
gatactgatg 
attcagagtc 
atgagtcaga 
ctctgacgac 
atgtcacacc 
gaatgtcaat 
tctcttcttg 
caaatccacc 
agtcaataat 
catcacaata 
ttttttctgc 
tcgttccagt 
gtccataaca 
ccataggttc 
tggcacgtct 
cagatgcaga 
agtgttcaac 
tcgtgaaacc 
gtccttgaag 
cataaaaatg 
aaaatgtctg 
gaaaaagaat 
tcttatgact 
cattctgata 
tttcctgtca 
tgttaccgga 
ctgatgcaag 
agcttagtca 
aacacaacac 
gttaaatatg 
gacgcagact 
gggcagtatt 
ggaaatacat 
gctgctccgt 
ccttctgagc 
gatggtgtag 
ggtgtagtag 
gatgcggcta 



tccatcattc 
cccgtcgtcg 
gtcagcaacc 
agcgctataa 
ttcagacata 
gttgtttctt 
agactgtctt 
gttgccattc 
ggggccagga 
ttcagtcaga 
tactcggaac 
agaagtccga 
gcaaatcggc 
cttgatttct 
tcgactgtgc 
atactaatat 
ataccttgag 
gcgatcacgg 
attgttggag 
gtggtggtga 
gggcacttgc 
ctctcttttt 
atttcttccc 
acgtcttatt 
agatgaatca 
agagtccatg 
atctatattt 
agtttcgttg 
aaaaccttcc 
ttggtttctt 
agttccacgc 
cgttctctct 
tcttaagaaa 
actatggttg 
aggtacacta 
gccgtcggga 
gatatccatt 
gggcggaagt 
tgagtccatg 
gtctggagga 
cccacacccc 
gaggaaacta 
cgtttttata 
agccttcagt 
tcgggctttt 
caagcaaact 
atgaaaaaag 
agaaccctga 
ggtacaagca 
atcacgttga 
tgtgggctgc 
acgtgcctgg 
aaaatgtggt 
ctggctcatg 
ttactaatct 
ggtctgatgt 
ccagatctga 
agacatctgc 
aaatagaccc 
gaacagagaa 
tgcgccttca 



aagaattctg 
aggacaaaag 
gtgaagtggg 
acttctccgg 
tttccttctt 
tcagccgtag 
ccagaaccgt 
tggttcttct 
acaggtataa 
gatgtaagtg 
attgcagaag 
gaggcgatgg 
agagtgttgg 
tggttggttg 
tctacccaga 
actcctccac 
ttgatccggt 
catggcctcg 
cggggttggt 
gcagcagcag 
attcttcttg 
atgtgttcag 
cctccattga 
cttcggacct 
gacgaatcgg 
gagccactcc 
ttagtatctt 
ctttgttctg 
atttctacga 
gagacggtgg 
acgcttcttg 
cccgtggttt 
atcaactggg 
cgcagaatca 
gtgttgttct 
aacttgtcta 
tcttcagagg 
gcagtttcct 
gctggataca 
caag tgagta 
cgctaaggca 
gaagataact 
tgctaatgag 
ttatgctttt 
agtacaacca 
tgat tgtgac 
agtaggagag 
gggaaaaaca 
aagagacccc 
ccatgaaact 
tgtcccttgg 
ctttgcattc 
catttcaaag 
ttatcgatct 
ttct tacaat 
agtttctgta 
aatggatgaa 
tcgccagaaa 
atcttctgtg 
aaaatccgac 
agtcatgggt 



cctcttctct 
caggtccagg 
aaggatatag 
tgatttggcg 
cgggtgtggg 
tagtagcaag 
cgtcgataac 
tcctctttga 
catccacatt 
ctggcacggg 
aagacgtaga 
cagcagctct 
atgaagtcgt 
gttggttgtg 
tccacagccc 
gccctcttct 
tataaccgcc 
tgggggaatt 
taaagtgcct 
gggcgtcgta 
gatgaatata 
tttttatgtg 
cgtcatccga 
tattatcccc 
atgagtcaga 
caattccact 
tttcttccac 
cgaggatatt 
tctccccttc 
tttcagtaaa 
cttgattttt 
cattaatata 
caatgctggt 
ctttcaattc 
tccatgtcat 
ttccatatcc 
gaaataggtc 
ccccgggagg 
gggagtacac 
ccttcctcgg 
gtaaaaaaaa 
tttatcatgg 
aagctgaggc 
attgacataa 
ggacagaaat 
gcattttctt 
tgcaactttg 
ttgcgccatt 
catactggtt 
ggaaataaaa 
atacgacctg 
aaggaggatg 
gcttattgtg 
ggatggatga 
t tagctaacc 
ttgacagata 
attatcacca 
gcagagaata 
gatgctttaa 
cgcctcatgc 
ctggacgaca 



gtgcagaaga 
tatgggtgtt 
tccattatct 
agtgactgga 
aaggaagggc 
agatgtgacc 
aagagatggt 
atccttcttc 
gggtgtagaa 
aggcatggcc 
attagagttg 
ggcggcagca 
ggcagcagag 
gactattgtg 
ctttcatatc 
tgcgccatat 
attgttgggt 
ttcattgacg 
ctggtgttgg 
gtgttcgtat 
tctttctctt 
agttactctg 
ataactcaac 
tccttctgaa 
ggaagaagaa 
actaaattca 
atcttgattc 
ttcgaaacct 
ttccaattct 
atttgggtta 
gttgtccttt 
cacaatctgt 
tccctttctg 
tcccctataa 
cccagaacgt 
agtagagatc 
gtaggcgaca 
ggcgaatctc 
agtggagacg 
ttcctcccac 
ttgacccacc 
caacatttac 
aggaaagaat 
aagaaataga 
tagctccttt 
gcataccttc 
ccgaacacac 
tcacggcatg 
tgccagtagc 
tgtgtgaata 
gagaccttac 
acgaaaggga 
atttctttaa 
aatttgtcca 
caaaacctta 
accctattgt 
agaagaagtt 
taatacgttc 
tgcagtttgt 
gagtggcaga 
gtcaatctag 



49560 
49620 
49680 
49740 
49800 
49860 
49920 
49980 
50040 
50100 
50160 
50220 
50280 
50340 
50400 
50460 
50520 
50580 
50640 
50700 
50760 
50820 
50880 
50940 
51000 
51060 
51120 
51180 
51240 
51300 
51360 
51420 
51480 
51540 
51600 
51660 
51720 
51780 
51840 
51900 
51960 
52020 
52080 
52140 
52200 
52260 
52320 
52380 
52440 
52500 
52560 
52620 
52680 
52740 
52800 
52860 
52920 
52980 
53040 
53100 
53160 
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acgattattg 
tttttccagt 
taaaggcgct 
aagcagatgg 
tatcactgaa 
caattcatct 
ttcttcgacc 
agtagtgaga 
ggacgataat 
ggccagtaat 
ctttatagac 
gttctccgta 
agcgatacag 
aattgttata 
ttggtacgat 
agagtttgca 
aatcgtcgct 
aggagcacga 
gggaaagaaa 
aacaatgaac 
caccacacaa 
tctagaaaca 
tctagattac 
caacactact 
atttctggtt 
caatattggt 
atcatatata 
tgctctagtt 
aggtgtcggg 
cgagagtgac 
tcaaaccaaa 
aggtcccaaa 
actcctcgtg 
tgtggaaaat 
tgttttcatc 
tgtgcaagaa 
acacagatca 
ctcaccttcc 
tgaagatgat 
tggagaagat 
gaatgaagag 
tgccgccgct 
aaaccacgca 
actgctccag 
ctccacaccc 
cgagctcttt 
agacatctta 
ggaaactccc 
taaaactagc 
tatacttcaa 
agacgattct 
aaactt tcga 
cgatgacacg 
tgtatcaaaa 
agatttgttc 
ccacccaaaa 
ctctaaaata 
ttcagacgcg 
taacaacaat 
taatataagc 
cacaaacaat 



ttaaaaaata 
tctctcaaat 
tcagctaaaa 
tcacccgaga 
gacccaaaac 
atcaagagcg 
agaaaactgg 
cagttcttgg 
attcctctga 
atgtgcgcta 
tcggcgagtg 
ttcttggagg 
gcatttgccg 
ttcgtaatac 
cacattttta 
aaggcaggaa 
atcaacgttt 
aaatcaaaga 
agtaagtttg 
gcttttgggc 
gagggaagga 
ggtcaaactc 
gtgggagagg 
agtgataacc 
ctaggaatag 
ctagcttttg 
aacatgaatg 
ggagatatca 
gctaaataca 
taaaaagtga 
ttcataatca 
aatattcctc 
aatgaccgtc 
actcttaacc 
aaagaaatgc 
atattcgaag 
gtttcaccaa 
cctcctccat 
gaagaaatat 
tcagaagagg 
ggagaaaaca 
gccgccgccg 
gttagtgcgc 
caacagcaac 
aacaaattca 
gtttgtttcg 
Gccattgttg 
ccagtgaaac 
tttgaaaatg 
aggcctagag 
gtagacatta 
gtgtccgaat 
tcttcaaaca 
ttaaatattg 
gagacgggca 
gctagagttc 
actcgcaaag 
atcttttctc 
aataacaata 
agaaaaatga 
tgcactaacg 



tgattaaaat 
taattggagt 
gaaaggaaac 
cggtgacaga 
accctgcgcc 
gttcttcttc 
aagaaaaggc 
acattttaga 
tgataggcgt 
gatttgcctc 
ctatcacttc 
agtctgttat 
atctggcaac 
aagtacttgg 
gcccagagga 
atgtagatgt 
tccttcaaac 
ttgattttct 
tctacataca 
agcgtataat 
gggatgacga 
ttaaggatta 
aatgggtaag 
tattcaagaa 
gtgtattggt 
cgttcgcagg 
ctatgggtgt 
aaacagaccc 
acatgattac 
gggttataac 
ttgtgaagac 
aatcaaaaaa 
ttggaaatca 
ttaacagtct 
agtcctattc 
atggactaat 
gttcccctat 
cgatgccttc 
acgaaacaga 
aagaagaaga 
agtacgttct 
ccgcagctgc 
atgactacac 
agcaacaaca 
acaagttttt 
atgtggataa 
ctgaatacat 
cgtgcaggag 
atgctgtgga 
atttccctat 
aaaacccctg 
attcgcgtca 
cttgctttat 
aatttgaggt 
ttttgagcga 
gaaatgttgc 
aaactataaa 
ccccaagtga 
ataccagtgt 
ccatcacaaa 
gtaactatcc 



gagtagaaac 
aactttggcc 
ggccattaat 
agaagatgca 
ctttgtggac 
ctcaatatgg 
gagtgttttc 
aggtaaactg 
ggaccaaata 
ctcagctctg 
taggttagcc 
agaatttgta 
tcttgccgcg 
gttgattttg 
tttaaagaag 
gggtgtagct 
tgaagaaaac 
tcaaaaatac 
agaagcagct 
aacagctgct 
aacagtgact 
ctcgtctgct 
aaatactgcc 
aactgtttct 
agcgtcccat 
tctcctagca 
agtgaattcg 
cagaagagta 
agatttcgtc 
ccccccccca 
atcgacggtc 
taccatggat 
tcgaacaaac 
agaaagggca 
tggatgtata 
tactttcgaa 
acctctttct 
tattaaagaa 
tgaaaatgtg 
agatataatt 
cgcattctct 
tgctgctgct 
actgtccgcc 
ccagcagcgc 
actgccgagt 
aattgcgcaa 
tatcaacggc 
aaaggaagtg 
agataaacat 
ccctaaaaaa 
gggttcgtgt 
ttttaatgaa 
atactctcaa 
aatgatggag 
ttcttcatta 
attgttttat 
atgttcagag 
taatcctaat 
gaatatcgag 
ctaccaatgt 
cgatcgtggt 



aacccagaat 
ataaaaagat 
aatggcgaac 
cttttatttg 
at tttacact 
aacgatatac 
gttaaaaatc 
ttttcggacg 
t tgagagaag 
gagtctagtt 
gtacaactgg 
gt tgccgcaa 
tcagccttaa 
gatcttgcgc 
caggttctag 
caaccagtca 
ggggaagaaa 
ttccattcta 
caagaatact 
gatgatagtg 
aagaaaatga 
g t taactata 
ctaaaagaag 
cttgctagta 
attacacttt 
ttta t tgcac 
gacgcaatat 
ggaatggtcc 
tctccaatgt 
acaaaggctt 
gaggaaggtc 
tcgttgacta 
aaaccaatca 
agtcttttga 
cccaagaaca 
tggagagatg 
acaaaaaaat 
gaagaatttg 
gaagatttca 
gttgatgacg 
aatcatctga 
gacattgaga 
ctccagcaac 
tcctcatctg 
aatggct tct 
tataatggac 
cttggtctga 
aaagatgtgt 
ctcgcattcg 
atcaccgcct 
ccgcttttga 
ttc tcaggag 
aaaaacccca 
ggaattataa 
gctacggcta 
ttttctgtat 
acggataagg 
ataagtgctc 
gacagaccta 
a tggca tgca 
aaccagcact 



atgcaagaca 
ctgttttctc 
agcatagaag 
caagagaaaa 
caccagatat 
tttcaagaat 
ttgtggtgaa 
gatacgaatg 
tcattaaggc 
tggttactgg 
cagccagaac 
gccttcggct 
ccgttattgg 
tagggttagg 
tgtttaggag 
cccctgaaga 
agaaggaaga 
ctcctttgat 
tgggaggaag 
acaccaccac 
ggagtattat 
acgcctcccg 
agacaaggag 
tggccggcgc 
tacgtttcac 
ttatgagtat 
acaggtctac 
agcgccacgt 
tagacgagat 
taaattagac 
cgtgctcaaa 
atacagtaac 
ccgaacaaga 
agttatattc 
agtacacgaa 
gaacaaaagt 
cgcctcggtc 
aagaagaatt 
taaatggtga 
aagaagaaga 
ggcgccagac 
agaaggacaa 
agcaacaaaa 
agaaggtcac 
cagaacagac 
tcgtggagct 
aatgcagtat 
ggtgtcagcc 
cagaatcgcc 
atttttgttt 
aaagtggatc 
ttaaaaatga 
acattgaaat 
cccatagaaa 
tggcattttg 
atttaccatt 
tacatattgg 
accagaacaa 
tccgaaataa 
aggaaagatg 
tgtcacatag 



53220 
53280 
53340 
53400 
53460 
53520 
53580 
53640 
53700 
53760 
53820 
53880 
53940 
54000 
54060 
54120 
54180 
54240 
54300 
54360 
54420 
54480 
54540 
54600 
54660 
54720 
54780 
54840 
54900 
54960 
55020 
55080 
55140 
55200 
55260 
55320 
55380 
55440 
55500 
55560 
55620 
55680 
55740 
55800 
55860 
55920 
55980 
56040 
56100 
56160 
56220 
56280 
56340 
56400 
56460 
56520 
56580 
56640 
56700 
56760 
56820 
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tgtaaaaggg 
attgagcaga 
tagaagctct 
ctcatcggac 
caatagttat 
tgaggaaaaa 
ggatatactt 
attatcctct 
cattatgttt 
agaatcttta 
caagagggca 
tggtatcact 
aagtgccctt 
atgtgttata 
ttgggaactt 
ttccgatgaa 
acctaatgat 
aacaaacact 
taataataaa 
tatctctatt 
agtcgtggcg 
caacagcagc 
accattgtat 
gttattttat 
tgattttgta 
aaaattgcta 
gtatgtgcaa 
gtgtaacaaa 
aattgccggg 
cactacctca 
catccactcg 
ctactactgc 
gaactaagat 
gacgtggaag 
cccgtgatgg 
gtgaagcagt 
accctgagca 
cacttggacg 
ggcgcgaagc 
gcgaagcagc 
cacttaaagg 
ttggacgtga 
gcgaagcagc 
tggaggaaga 
ttaaaggtgc 
gacgtaagag 
aagcagaa tc 
cagctccccc 
aaggtgctct 
gtaagaggcg 
aaaagcgcga 
ctcccccact 
ctccccctga 
tatggaaagc 
ttggataatt 
actctttcct 
gcagacatac 
acatgtttgt 
ttttcttgtg 
cagagttgct 
tctaatccaa 



gaagatttct 
gtactgattc 
atgtgccata 
agttttgaaa 
tctgccataa 
gataataaga 
gttcacgaat 
aaagatgggc 
gagaaacacg 
aaggatattt 
attattggag 
aggattatta 
ctcgacaaaa 
cgtaacaata 
gagattagac 
gatttggaac 
aagctacctg 
aaacgtaata 
ataaaaaatc 
caagaattta 
tcagctgctc 
agccctcctc 
attcctcccc 
aatgttgtgt 
tatccttgtt 
gaggccttga 
gtcatatcct 
gtcgtatctg 
agcaatgggc 
ctattgcacc 
cagagacatc 
cctgctgttt 
agacgaccgc 
aagatgtgtt 
agccaaagtc 
atctctaaca 
tcccattgga 
taaaaggcgt 
agaatccttg 
tcccccactt 
tgcacttgga 
aaagcgcgaa 
tcccccactt 
act tgtgtct 
acttggacgt 
gcgcgaagca 
cttggaggaa 
act taaaggt 
tggacgtaag 
cgaagcagaa 
agcagctccc 
taaaggtgca 
agacgatctc 
accagaacct 
gaaacaattt 
acctaaggtt 
ctgctatttt 
gacctcttga 
tgttagaatt 
cctggtgttg 
agcgctctcg 



ttaagatttt 
ccgctcctcc 
gcttcttttg 
agactaaact 
aaacttccca 
ctatcccctc 
gtaataaaga 
atctatctaa 
tggaagattt 
ttaatgattt 
cacaagaccc 
gtacattatc 
caatgctctt 
gtgtgttttc 
actatgtcat 
aattcacgcc 
taggggcaca 
gtctatatga 
gttcattaaa 
atgcaaataa 
caaagtcacc 
tttcaccgct 
ataaaagaat 
gatattttat 
ctgtgttggg 
atcctggtca 
ccagtcccac 
cacatgtgca 
gtgagagccg 
attttttggt 
accgtcaaga 
gctgtggtgg 
tgcgccttcc 
agagctgctc 
tttggcgaaa 
ccagtccatg 
gggcgtctac 
gaagcagctc 
gaggaagaac 
aaaggtgcac 
cgtaagaggc 
gcagctcccc 
aaaggtgcgc 
gctgaagaag 
aaaaggcgtg 
gctcccccac 
gaacttgtgt 
gctcttggac 
^ggcgcgaag 
tccttggagg 
ccacttaaag 
cttggacgta 
gacttctttt 
acaggttaaa 
gactttttag 
tattaattgg 
taacgctttg 
ataaattttg 
agaaggagat 
ctcctcttct 
aattttctcc 



aaataatagt 
ctctggaaat 
tagagggatc 
tgttttgtat 
taataataga 
tagagccgaa 
acgagctgtt 
caagtggtgg 
ttacaagaaa 
tgaaaaaact 
ttctacttct 
cgaatttcat 
gggatcgagg 
gggctttgaa 
ctctatggga 
tgtaagaggt 
tcagacatgg 
ctcttacaat 
actatcagat 
agatgatgtt 
ttcaccaacg 
ctcaccaaca 
gaccactact 
acaataaaaa 
cggaaattcc 
cgacaaggtt 
gcatgagaac 
taaattcctt 
gttcctggaa 
gtataaaaag 
tgacgtctcc 
ttacggctac 
cgccatgtga 
gaagggggaa 
tcgcaaatgg 
aagatatgcc 
cacgtgaagc 
ctccacttaa 
ttgtgtctgc 
ttggacgtga 
gcgaagcagc 
cacttaaagg 
ttggacgtaa 
aacgtgaaaa 
aagcagctcc 
ttaaaggtgc 
ctgttgaaga 
gtaagaggcg 
cagctccccc 
aagaacttgt 
gtgctcttgg 
agaggcgcga 
acgcacctgt 
atgtggctga 
tacttaatgt 
tacattgtta 
tttaacgtgc 
ttagcatcaa 
gtggaggttg 
tgattcacat 
tttacaccct 



aaagtagatt 
tatacatcta 
gaaccagtgt 
ggtaaagtcg 
atcagggtct 
agtgcaaaaa 
tcatattttg 
atcgaactta 
tgttctaaag 
tgtgataaat 
actccctcta 
tcaaaagatg 
acaataatgt 
aataagaaca 
ggtgctgcag 
gctgtctctg 
aaggatgaac 
tcaaaaagga 
tttaattgga 
aacaagaaga 
agcagcagca 
gtgaagaata 
gctgtttgac 
tatataagaa 
tccgcgccca 
aacaacaaaa 
ctgttattgc 
cacgctaaaa 
aaccccacct 
gcacggagag 
tactagtgct 
tgtgatttct 
ctacttcccg 
caacaacaat 
actgattggg 
agatttcttc 
agctcctcca 
aggtgcgctt 
tgaagaagaa 
aaagcgcgaa 
tcccccactt 
tgcacttgga 
gaggcgcgaa 
gcgcgaagca 
tccacttaaa 
acttggacgt 
agaacgtgaa 
cgaagcagct 
acttaaaggt 
gtctgctgaa 
acgtaagagg 
agcagcagca 
tgctttgcct 
acatcacata 
ttattaattg 
acacttgctc 
tttttacaat 
tatttatttt 
ttgatgtagt 
tgtcctcttc 
caccacaatc 



cattgaaaaa 
agttttgtga 
ctacttcttt 
ttgacgttat 
tttttaactc 
atgcattcaa 
agcaaaacaa 
atgacttgaa 
taaatgatgc 
acaaaactgc 
aaaaggagaa 
aagctacagt 
ctggtgttag 
ctaataataa 
tgacaaagat 
tcactacagc 
aaacactaaa 
ataataggga 
gaacacccaa 
ggtacgcaga 
gcagcaacag 
gtaataataa 
caatttatgg 
aacaaccaaa 
ttatgatgaa 
aaaattacct 
accacgattc 
tgaacgacaa 
ccgttccacg 
gtagctaaaa 
ctccgcgttg 
gaagctgggg 
gaacctgaat 
aacaacggaa 
cgtaagaggc 
ccctaccctc 
cttaaaggtg 
ggacgtaaga 
cgtgaaaagc 
gcagctcccc 
aaaggtgcgc 
cgtaagaggc 
gcagaatcct 
gctcccccac 
ggtgcgcttg 
aagaggcgcg 
aagcgcgaag 
cccccactta 
gctcttggac 
gaagaacg tg 
cgcgaagcag 
gcagctatgc 
ctacatggag 
tacaagttaa 
aaacaattgg 
tttgctcata 
tgatattggt 
att t ttatcc 
agatgctgtg 
aacatactcc 
tcctatcgaa 



56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 
57900 
57960 
58020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
58560 
58620 
58680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
60480 
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gactgtgcta 
ttgttacttc 
tctttgtcct 
ggcaatgtac 
ccctcttgcc 
ttctgagcca 
aggggcaaaa 
tggcataggt 
tgtcactaga 
ttcttccccc 
tcctccttct 
tagtctcatt 
tgtaatatca 
attagtatca 
tgcaccagcc 
atacttttca 
ccggcaattc 
agtttcaatt 
ctgtcctcct 
gagactactc 
ttcagtgatt 
gtttaattct 
agaaaagtga 
cgtgctaagc 
tcctacaaca 
gtaatcttcc 
tctcctcgca 
atttaaaaga 
taaagaggct 
tttcttccac 
aaggcgtctt 
tggaagaaga 
ttcctttata 
ttgttctaga 
ggatgtggta 
tgccatttgg 
agaggatatt 
tggtgaccaa 
atcaaccatt 
gattccttta 
atttgttaac 
ggaacgtctt 
taaaccaacg 
gttcggcagc 
aggcatggac 
attatactga 
ttttaaagag 
tagaggatga 
gagccaatat 
tatctgataa 
ccaagaataa 
tgaaacaccc 
gtatcggagc 
tcaccatgtc 
acgttctcag 
cactcttgtt 
ccactattgt 
ttaacaacaa 
caaggactgt 
tgtataatag 
atttcatcat 



atataatttt 
tgcccatagc 
tttctataca 
acctatagaa 
ccatagtttc 
ggtgttctgc 
ggcccccca t 
gacaataatg 
gattctagat 
gacgtggacg 
ttattatttt 
ttcttattgt 
gatgag tagt 
gaattgccat 
gtgaaagtat 
tcgatagttt 
gcacatgcat 
aaagctgata 
ttctctagtt 
ttgagaaaag 
gggtcgaacc 
tccgtgtcag 
atatttttaa 
attacatcta 
ttttctggtg 
tcagcaacag 
gtctcattat 
tctagg tagc 
tgtctgaaga 
ggtctgctct 
tgtgcggatg 
ggaacaggag 
ggaactgtct 
ctacaaaaaa 
tatccaagt t 
aataaacaac 
ggttataagg 
caagaacaaa 
gaactgatag 
agagaaactg 
cacaaggact 
attgattttt 
gacaag taga 
taagcaagca 
gacttcagca 
tgatgatatt 
atatgcaaaa 
tgtcattata 
eta tttcgcc 
agtgctgtgt 
gtggacgtat 
cgactacaag 
atcggaaaaa 
tatatacagc 
taataggagg 
cttaatcaga 
aaagaagact 
aaagat t tec 
tagttctctc 
gacagattat 
tagt taaca t 



tgtagtagaa 
tggtagtatt 
aaaggataat 
cagtgttatt 
gtatctccat 
ccataatttc 
cttccagtgc 
aagggagtgg 
ccgggggagg 
gttgctgttg 
ctgcttctag 
tattttttat 
atttgcctgg 
gtgcattatc 
ttgccagaag 
tattaaataa 
tattattctt 
tt tggcccat 
gaattgatcc 
agtgtaaaag 
cgaatttgat 
aggaagggaa 
gtacatttcc 
atgataagaa 
tcaagagatc 
tttggttgtt 
ctgcatcatt 
tactcatctg 
cgcgagtgat 
ccaaggccct 
attctagaaa 
agagcaatac 
catctattgg 
cgtcagacag 
tgaagaaaac 
ttccaaagtt 
attatgttgt 
tgatta ttga 
gcatgtataa 
cgttgcacgc 
ctatacctct 
ctgagagaca 
gaggatcttt 
ctggctattg 
gcagcagcag 
acacaagtgt 
c ttatagata 
cacactagag 
ataattgacg 
ggttctctca 
ggatgtaacc 
gaagagacta 
ctgaacatta 
ccagaaatgt 
gctgtaatat 
c tagatatag 
agtcccattt 
cacattgaag 
cagagagatc 
atgaaaaaat 
ctttttaaat 



gtagagaata 
tttgaaggca 
cgttcatccc 
tcttctatat 
tttactattc 
tgatgtttta 
actattccca 
tgaaaacaca 
aggtgtaggc 
ttctactcct 
ttccttcatt 
gcgttcttta 
acaaataagg 
ttttgttata 
agccgaggct 
tgtaactaaa 
ctttttatta 
gattttaagg 
ccattcatta 
tgtttgagta 
ctgatttctt 
agaaattata 
gtctgttgaa 
aggggtaact 
gtctgtttct 
gttatagaaa 
agctcctcct 
ggagggaagg 
cacaacttca 
tctcccagat 
taagggagag 
tgcgtgcctt 
tacaattatc 
ttatgcggca 
aactgaagac 
gtatgaaaaa 
ttcaattgaa 
tgaagaatct 
caataacaag 
acaatctctg 
attttatgaa 
actacaacaa 
tcaacacgta 
aaaaacacgc 
cttcttctaa 
ctattgcaaa 
atttagcaat 
atgcctccaa 
gtgatttatg 
accgggaaaa 
taaactttga 
catctacaaa 
cccactattt 
cagccatgat 
tgatcttttt 
atatcaaata 
ttaaggaacc 
atggaggaga 
ctcgtgtact 
aaaagtacca 
ccacaatcag 



atggtacaac 
tgccttaaaa 
catgaaaaag 
cttccagtat 
ctgtctctct 
tcaacaacaa 
ttagaaggaa 
ggataatcat 
aacataattt 
tctacctcta 
accatttcca 
gcttgctttg 
ttatcactca 
aactctttgg 
gatgaagagt 
ctggccgaac 
tccattgagg 
ctcgcacaca 
aatttttctt 
gttatacaat 
acactatcca 
tcattagctg 
gaagggtgct 
ctttctcctg 
gtcactccca 
gacccataag 
cctcctcctc 
ggagggttaa 
acgatggacg 
gaaaaacatc 
gatggctgct 
gaacaactaa 
aattctaatc 
ttatcccatt 
gtattgcctc 
actcttcaac 
gacgacgaca 
tataaaacta 
tttggtaatg 
aggtacgaca 
aacagcacgt 
ctaaaacaag 
tacagggaat 
agcagaaaga 
ttttaataat 
cagtgttttg 
atcttcttta 
ctctacagtc 
tgtataccct 
ggcactgttc 
tatcgttgac 
acatatacgt 
aaactacttt 
gaatgagaat 
cataggtgtt 
tgggaatgtg 
tatccagaca 
agaaagtgat 
tttaaccagt 
aataccatat 
agctaaaaca 



gttattgatg 
aagggatata 
tgttagatgg 
ttataccaga 
ggtgaaccca 
tatcagctga 
gtggactagc 
ccacaccctc 
ccatctgcat 
cttctactct 
cgcctgttgc 
ttacttcata 
tggctgtata 
gtggtttatt 
tttgcttaac 
acttactatc 
cctcctttgc 
gtgccatttc 
gtagagtcag 
ttttcaacgt 
gtaaatcctt 
gaattgattc 
ttagatcaca 
tatcgagatt 
ccaatctctt 
tctgaaaaat 
ctcctgccac 
gggttgtata 
tcgagttcgg 
aacccgttat 
gctcgttctg 
tagacgtttg 
tttcaaccag 
ctagttttct 
attctttacg 
ccatagaaga 
atgttgatga 
ttggagaaaa 
aatttataag 
ctgaagctaa 
gcacatgtaa 
atggaatgga 
gcagtaatac 
agaggagaaa 
gtacaacaag 
aataacccct 
cctcctgata 
agagtagatg 
aaacaatata 
tataatagct 
gctgccatca 
aaaatattgg 
atccaataaa 
gaagaaaaaa 
gtcattttat 
ggcgagagag 
gccatcaact 
gttatagctt 
tcatttggat 
caattgtttt 
tagtgcagtt 



60540 
60600 
60660 
60720 
60780 
60840 
60900 
60960 
61020 
61080 
61140 
61200 
61260 
61320 
61380 
61440 
61500 
61560 
61620 
61680 
61740 
61800 
61860 
61920 
61980 
62040 
62100 
62160 
62220 
62280 
62340 
62400 
62460 
62520 
62580 
62640 
62700 
62760 
62820 
62880 
62940 
63000 
63060 
63120 
63180 
63240 
63300 
63360 
63420 
63480 
63540 
63600 
63660 
63720 
63780 
63840 
63900 
63960 
64020 
64080 
64140 
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atacaataca 
tacttttcaa 
tttctccagt 
gggtttatac 
aacacatctc 
cctcctattt 
tggacatcat 
ttgatatcta 
gattctggaa 
tctgggacaa 
gtttgtggat 
ccttgctcgt 
tccagtagta 
aaataatcca 
tctgtggtgc 
caggacgagg 
gctggaggta 
cgagcactgg 
ttccctcttt 
tgtctttaaa 
tgattctggg 
cagacataaa 
cactatgtat 
tgttcatata 
atctttcatc 
cttctctctt 
tgtacgttat 
tttcttcatc 
gttcatcctc 
ttacagcttc 
agtcgtcagt 
cctcctcctc 
tgctgttatt 
acgatactat 
aattattacc 
caccttcagt 
ttttagtaac 
tttgagggtc 
cagattctac 
tactgatatc 
tcaaatatga 
tcaaactaac 
ggtctcgtac 
gtactagttt 
taccaacact 
tctcccatcc 
cccatgtgtc 
gcataatcaa 
tcgtatttgg 
cacccgaaga 
tttcagagtc 
tcaggttcca 
gattttcatc 
cttttctgtt 
gggagaagtg 
catcctctgc 
ttagaacagg 
gcatcatcat 
tttcattcct 
tcgtcatcat 
taaggttttt 



ataatacatt 
tcaaagccga 
acatgggacg 
aacttcttga 
ttataataga 
tagatttatt 
ttttacctaa 
tataatgtac 
cagggatctt 
cacccatgga 
taaaagtctt 
caccaattac 
cttcctttcc 
cactctcctc 
acatattatt 
agaattggga 
gaatgaagac 
catgaattgc 
attgagagtt 
atcgtccaat 
agtctttttt 
attagttcta 
tatgggcatt 
cacgataact 
gtcgatgcct 
attggcgtcc 
attgtcaaga 
catcacgggt 
ttctatatta 
taaaaagtca 
ttctagatct 
tacatcatcc 
ttgttcagtt 
tcctcctcct 
aaaaacatca 
gtggctctca 
aggtgcactt 
attaccactt 
atccgaccct 
tgataaggtt 
catgacattt 
gtcagtttcc 
aaaatctata 
cttatcacat 
atcatcacag 
gcccacacaa 
gtactcacag 
tagtgtactt 
ctttgttctt 
attactgttc 
cttaaattgt 
tttcaccagc 
gttggtgtgt 
accagacata 
tttcccagga 
ttcggtgaat 
gatcctcatt 
ccgctcgcac 
tgattttgcc 
cttga tggct 
agacttgagg 



tctccctcca 
gtcttgttcc 
aaacactaac 
tgtctttaca 
aacagactca 
aaagggtgca 
tacacttaaa 
atcttcaaat 
ctttgcagtc 
tagggcgata 
cttctttcca 
acgtattgtg 
ttcaacaaga 
taaagacaag 
attcaaacct 
ggagttatgt 
cacagcgtga 
ccctgagagg 
gccaaaagtt 
acttttggaa 
agttcaggca 
tctgtgcata 
aattgctggt 
aaaaggttca 
tgaactggtt 
attaaaactc 
gaaatgcatt 
tgtttattgt 
tttttagcga 
gaatctttat 
gcctcctctt 
atttcattga 
tttaagattg 
cctccatttt 
atcctttctt 
aaaaatccac 
tttctggttg 
ttacgtttat 
gtatttttac 
gttattgtac 
gccgctgcat 
aaatcaatca 
gcttgagaca 
tttctacaat 
tctttattat 
gtacagtacg 
ttactaacaa 
tcaggtgaaa 
cttt tagctg 
atgcacctaa 
gtaccagtaa 
acatcagaat 
tgct tcaaaa 
ttataagaat 
gtctccagat 
gatgcattag 
atgtccagta 
gagtagaata 
ttttccttac 
ggagcctttc 
agatttttca 



atgatacaat 
ttggataatt 
ctcatactag 
tcctcagtag 
gaatcatact 
aagaaagggg 
cctatctgtt 
ttttggatat 
gtcatcattc 
gcattggcca 
aaacgaagaa 
tcccttttgt 
ggtattccaa 
gaagaggatg 
tgagaagagg 
tagttagtga 
ggggcgacca 
gtagccattt 
tattatagtt 
tcctctccca 
tatactgcaa 
ttttatcctc 
acacaatagt 
tggtttttac 
catagagcac 
cacttagacg 
tgtcgaattc 
taagttttga 
gcgatgacgt 
actcgaacgc 
cctctttttc 
attttttaaa 
acgtctgctc 
cttctcccaa 
tagtcaatat 
gaattgattc 
cattaacctt 
tattagaaaa 
tctcattatt 
caaaaacacg 
ttacagccag 
agtgtgttgg 
gatagaaaaa 
tactt ttaga 
tcttcctagc 
tgttttcaaa 
ttgcactatt 
gcatgagcgg 
caaaaccacc 
agaacttgac 
gacgacacac 
cgtgggagaa 
cattcaataa 
cgtactccaa 
tcaaaaggca 
aaaggtacat 
atgcagatgc 
gtagatcgtt 
ttctttcttg 
catccgtaaa 
tcactcggat 



gttgatgttc 
tatccatgac 
gattcgtatc 
gcatttcacc 
ttaaatcttc 
tttcttcctt 
ctcttacaaa 
tgcgattaac 
tggaacacgt 
gtttcactag 
agaagacgct 
ttaatatttt 
gttcttcaaa 
tgcatttggg 
acatttttct 
tacgtcttgc 
tggcttagat 
agtgttttca 
gtctgtgcat 
gttcttgcgt 
cattgtgtat 
ttctgtttta 
attatcctct 
actataagga 
cacgttacta 
tgataataaa 
gttaacaatt 
tgatacagaa 
catttcgttc 
tttccttgcc 
accttcctcc 
gaaatctaaa 
agtagataga 
tgcttctgga 
tcccctatca 
gaaaactgac 
acttttctta 
tgggataaga 
attagcaccg 
ttgattaaag 
ggctctatga 
aggaaaacac 
tttgattgtc 
caaattcccc 
ttcttgttca 
tactgcaact 
gtagtctgat 
cgattctgcc 
caactcgttt 
atggccaatc 
gtttttatga 
aagattcata 
atcttcccac 
tatcttcctc 
tgatgttaaa 
gcatttgctc 
catcagacct 
ggtcttccta 
tctgatctgt 
gagtacagta 
catcctaaaa 



tcctcccata 
cagcttagaa 
attgtgggaa 
caataaatta 
tatgcattca 
cttcttcttt 
ggactgaaaa 
atttttgtat 
gcaggcaggt 
ccacattttt 
acatacgtct 
gtaggcaaat 
atccttttct 
cagcaagatt 
ctttactgtt 
cgctgctgat 
atggccacgc 
ctcggtattt 
gtcgatacct 
tttatttgct 
tctagtgcag 
ccacaaaatt 
agaccgcaca 
acaccagaga 
ttgttcccgt 
ctattgtcat 
tctgcccact 
catcttgaat 
gtggctgcag 
tcatctaacc 
attttttcct 
aagtcactcg 
cctggtattg 
ttcgtatcta 
ttaattatac 
gatgataatt 
cgggtagatt 
tcttcagtat 
gaaagatgta 
gaccccattg 
agtgcgtgtt 
aactgaagcg 
cagaaaggaa 
atagaaatat 
ctcaaaaaaa 
tctttcctta 
cctttagtta 
aaattatata 
gcttgatttt 
tttaagaacg 
tgtccaacca 
gaaagtgttt 
ctcttggatt 
ttttctagta 
cgtacgatat 
cccgtagtct 
tgaggaagac 
ttgaggcaat 
cgtttggctc 
tacctccctt 
gcgtagtaca 



64200 
64260 
64320 
64380 
64440 
64500 
64560 
64620 
64680 
64740 
64800 
64860 
64920 
64980 
65040 
65100 
65160 
65220 
65280 
65340 
65400 
65460 
65520 
65580 
65640 
65700 
65760 
65820 
65880 
65940 
66000 
66060 
66120 
66180 
66240 
66300 
66360 
66420 
66480 
66540 
66600 
66660 
66720 
66780 
66840 
66900 
66960 
67020 
67080 
67140 
67200 
67260 
67320 
67380 
67440 
67500 
67560 
67620 
67680 
67740 
67800 
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agatgacccc 
ctgtcgggga 
aaaacttcct 
agtcgtccat 
cagttagcca 
tagccttaat 
catccctgca 
tatcttcttc 
tagtattgat 
caatttgcct 
cgagtgtgta 
ggaagttctt 
aatcttcttc 
cagttttctc 
tggcttctgg 
atgctcactg 
tcctagaaga 
gcagctctct 
gggtactttc 
agtttctaga 
acacctcgtt 
ccacgtaatc 
tacgaatctg 
cactttactg 
aaaaaataac 
gctacccaga 
ccatattatg 
aggggtaaat 
cctaaagaaa 
gagtttgaac 
aatttggcag 
acaatgacaa 
cagcagacat 
gctgcagaaa 
gaagaagatg 
gtatcacaac 
cccgctcctg 
ccattaccac 
gatttcttag 
aatattcctc 
gcttcaattg 
gaaagagaag 
caacgggaag 
gctgctgctg 
gaacaaagag 
aaaagggaaa 
gcccaacaac 
gaactagcag 
cgtcaagaaa 
gaaa tcacgt 
cggtccgcag 
caggcagtcg 
tacgatctca 
gtgaacaaaa 
gccat tgaaa 
agaataataa 
ctcggggtct 
aaagaatcaa 
gatctcaata 
ttatacacta 
gaaataagta 



ctccatctgg 
agaattgaat 
cttcttcata 
tttgtgatta 
atcaacttcc 
tttctcgtgt 
gtaaaaattt 
cctgataaca 
aaacaggatc 
gcccaacatg 
gtcacaggta 
ggagttgatg 
tcctgaacaa 
cttgatgact 
gacacccatt 
atgggcgccg 
acttcgtgga 
acagagactt 
ttcctcttcc 
tcgaagaaca 
gaacacttct 
aaattcggat 
gaacacctgt 
atgttttata 
gttatacacc 
aaaggagacc 
ccaactttgg 
ttgtattaga 
aggaggaaag 
agaaagtgct 
agaaaggatt 
aaaaaataat 
ctgacaaagg 
gagaacaacc 
acgaagatat 
ccgttcaaca 
atattcccat 
ttgatgatta 
tggcaccacc 
ctccacctcc 
cagccttgga 
tagaggaaca 
cagacgaaaa 
ccgctgctgc 
cggccgctgc 
gagaagtaga 
ttcaagaaat 
ataaggagga 
tattagctaa 
ccaaaagagc 
ccagtgcaca 
aacctcaagg 
ataagaaatt 
caaacgaaca 
aattaactag 
cagagaggga 
attttacatt 
gggatctgga 
caatactgga 
aatttacaga 
acagatcaag 



ctagcaaaaa 

gaagtagtag 

acatctgtgt 

gaaactggag 

tcttgggatg 

ttatattttc 

tcataaatat 

ttaaaaccgt 

atctgaccca 

accagatctg 

ggtgaagaag 

ttggcaagag 

taatcctcgt 

ctcccattct 

tctgttgatg 

taatttttac 

gctgctttaa 

ctcggagact 

tgttgttgtt 

gtgtagtctt 

agggccaatt 

tcactcaata 

ctcctccttg 

ccctctcccg 

aaaaatgagc 

tggagatgaa 

cgatgacgcc 

gccacctaaa 

ggaacaacgt 

acaagatcga 

gcaagaacgg 

caggtttcgt 

tgcagcaacc 

agtagaaatt 

gggagaggat 

aatcgttgtc 

acaagaagat 

tgaggacgaa 

aacagcagca 

tcctgttgtt 

aaaggaggcc 

aagagcggcc 

aagggaaaga 

ccaacgggaa 

tgctgctgct 

ggaacaaaga " 

gaaagaacaa 

agaaaaacgt 

agaagagcaa 

actcgaacaa 

gttagcgatc 

acagttagtt 

agaagtacag 

gaaccaattg 

tcaaaaacat 

tttttcttta 

aaaaagggac 

ggtgcgtcta 

aaggttggat 

ggcagacacg 

atctgcctta 



atgcatcttt 
gcacgttcct 
gtttgttgat 
gagccatcat 
aagggattat 
tagaaaagaa 
gtttcttgtt 
catgtaaaac 
tatttttctg 
a'tacattata 
aggaagattt 
aagtggcttt 
cgtcttcgtc 
gtgcaccgta 
ttactcccac 
acactctgcg 
aggttgtagt 
tgccagaagc 
tgtagagtgt 
cttcctcttt 
cccacacacc 
actgtaagga 
actgttaact 
acggaagatg 
gatacagggc 
gaagaggaag 
acgtactcca 
gaaagaagtg 
tctaa tgttc 
tcgcgagaac 
caaaagaaat 
gaaggaggaa 
aatgttcttg 
acaggagata 
gaattaacta 
agttctccta 
atagtgggga 
gaagacgaac 
gcagcagct t 
gctgttgcag 
gaggaacaaa 
gctgctgctg 
gaagcagagg 
gcagacgaaa 
gccgctgctg 
gcggccgctg 
atgcgcataa 
cgagaactag 
cttgaaaaat 
atgttagaag 
caagcaa tag 
cctatggata 
aataatacat 
gttgctcaat 
cttcctgtgg 
aagaatctgg 
ctttttaaat 
acagatgtat 
gtatccgtta 
gcattagcag 
ttgcca t ttt 



gaatctggat 
gaacaagaaa 
ccagtaatcg 
catgtcttcg 
aataggcact 
ttcagaaata 
ctgcatccgt 
gtctattatc 
tcccttcaac 
agtgggcaaa 
cttgcctctt 
gagaagtgag 
atcgtaacaa 
cgcatt ttcg 
gggatgttag 
tagtctggtg 
gcttggaatc 
gagtttcctc 
tatagattca 
cacagtctca 
ttgttcctta 
ggaagaatct 
gatgtctgac 
tatacgtcac 
agatggaaga 
aaactggtag 
tgtacactgg 
tacaaagggt 
ggacacggag 
ggtcagaaaa 
atactccaaa 
gaaaattcaa 
aaagggaaga 
ctatattagg 
tacaacattc 
taccgccaaa 
aaaatattag 
atttgtacga 
ccacaagacc 
acgaaacctt 
gagcggccgc 
ctgctgctgc 
aacaaagagc 
aaaaggaaag 
cccaacggga 
ctgctgaaag 
aggaagagga 
cagccaagga 
tgaat ttcca 
aagagaaggc 
aatatgagga 
cgga t ttgta 
taacttctgc 
cccttgaaaa 
atgatcctgc 
gaaatgttta 
cgaaggcatt 
cgacagatct 
acatacgctc 
atcaagttcc 
catc tgcagg 



ttcaaaactg 
ctgttagtca 
agaccctcat 
attggaggtt 
ctttcaattc 
accttctttt 
gaatgcatag 
tgttgatttg 
atggtggttg 
tcaaccatgt 
ccaaaatcca 
ctagttgtgc 
tcttcatatc 
tacacttttt 
acgataatga 
tttttacccc 
tgcagcaaac 
ttcttaatag 
gtagggaagt 
atagaattgg 
agtgtctcga 
ccatccatga 
cccttgtaac 
aatttttttt 
aaataggcct 
tagtaatgtt 
agaaggaaaa 
gcaaaaacca 
acctggtcaa 
acttgggcaa 
ggtagcacaa 
ggcgccgcaa 
aattgagatg 
tgggctagga 
atctatggct 
gcccactagg 
ccagttacca 
agaagtgaat 
tcccaggcct 
gaagaacttg 
agttgaaagg 
tgctgccgcc 
ggccgctgct 
agaagtagag 
agcagacgaa 
ggaaatactt 
gaggcggaaa 
ggaagaaaag 
gttgggtaca 
ctcacgctca 
tgaacttcct 
cggaaaaatg 
atttgaagac 
atccgctaaa 
ttttatgcag 
caaaagagtt 
aattacagat 
cagggctaat 
tggtggaaca 
ttcgaggatt 
tttggatact 



67860 
67920 
67980 
68040 
68100 
68160 
68220 
68280 
68340 
68400 
68460 
68520 
68580 
68640 
68700 
68760 
68820 
68880 
68940 
69000 
69060 
69120 
69180 
69240 
69300 
69360 
69420 
69480 
69540 
69600 
69660 
69720 
69780 
69840 
69900 
69960 
70020 
70080 
70140 
70200 
70260 
70320 
70380 
70440 
70500 
70560 
70620 
70680 
70740 
70800 
70860 
70920 
70980 
71040 
71100 
71160 
71220 
71280 
71340 
71400 
71460 
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aattttacta 
gaggctatga 
gtcaccaatc 
tctgatgtgt 
atatcccctt 
atttcctctc 
atcaatacaa 
tctccagact 
tctcaattta 
aaaaccactg 
cataaaaatg 
atggaagaaa 
gattctagcg 
cagcaacaac 
aatcaacaac 
caacagcagc 
cagcagcaaa 
agtgaccagt 
caacaaggaa 
ggggctgaaa 
gctgacttga 
gcaaaattgg 
ttgagagagg 
aatagtatta 
aatttaatgg 
gtgaagaata 
caacaacaac 
ccttctacta 
ctagaaaagg 
aaatccgctt 
gggaaaacat 
gaccaacaaa 
gaagaaaact 
aagaagaacg 
gtgcagtctt 
gaagatactg 
gaaca tgagc 
cacatcaagg 
gcgt tgaaca 
tcaagaacag 
gctcaaggac 
tgtagtaacc 
aaat tagcgg 
tgtcatctca 
tatacgttaa 
acaactaaag 
ttttcagtat 
aagggtttca 
tctctctcca 
gttattgaaa 
aaggaaactg 
tacagtaagg 
gttaaggctg 
tctgatattt 
attgtggtgt 
tttagacaga 
caaaaacatc 
acaaaggcag 
gatcttttat 
ggatgggccc 
gaagtaaacg 



atagttccga 
atattttgaa 
tattaacagt 
cccgaatgga 
attttaaagg 
agataaaatc 
cctcaagaat 
ttttaaatgc 
cttatgatgt 
aaaaggttac 
ctttgagttt 
gtggttcatt 
ctgaacttag 
aacagcaaca 
aattattaca 
aaagggacca 
gggaacaaca 
ttcgacaaca 
gaagaagagg 
aggatgattg 
ctaccttatt 
gaacccttgt 
aatattccac 
tgcgtatttt 
gaaaatactt 
tattgtctgg 
aacaacaagc 
ctcctcctgt 
ctgccatagt 
cagctccttc 
ctatccgtat 
aatggatcga 
ttgtagaaac 
atccgacccg 
tcccatatta 
tacatcctga 
gtgaagatac 
actactattc 
ctctattgta 
cctttattta 
ctgttcggtc 
ttcttcgatc 
ccgctgctgc 
tgtacaattt 
atgtattgag 
ggcatacatc 
tgtacaagat 
atgcatggga 
tctcagatgc 
aacatgaaaa 
cactcacact 
ataccgatat 
ttctcagatt 
tcaagctacc 
tcactttgaa 
aagacggggg 
aacaaactgc 
cacacgtgtt 
ggggtgcttc 
aacaatacaa 
cacctgcctc 



caagtacaat 
agaaaatatt 
ttcaagctct 
ttctgaaata 
cgattggacg 
aaacgataaa 
acgtagtaat 
tgttaacgac 
cctttcaggc 
agaattgtgc 
aaatttgcct 
ggctgttgac 
tcgcacactg 
attccaacag 
acaacaaata 
acaacaacag 
acaacaaagg 
attattgcaa 
gggtgacgat 
tgtgcgtaag 
ccaacgagaa 
ctttgccacc 
attcacaacc 
ccccgagagg 
gtacattata 
tattttcaat 
tgcttcagct 
tacaagcatg 
agaggcaatt 
cacggcggct 
ggaaaaagtg 
cgaaagtact 
tgcacataat 
tgatgccaat 
cgtactccgc 
atatttccgc 
attaaaggca 
tcccagtgtc 
caacatattt 
ccgtaaattt 
tcaactgtgt 
gtctccctta 
tgtagacacg 
tatagtggct 
agcgtcgggt 
ttcttcccac 
tcttcaacta 
atcgtgtgtt 
cgaccaatcc 
cgatgcagaa 
catggcaaag 
ggaaaagttg 
aacaaataaa 
tgtgattcca 
taatgtaatt 
agtttcttca 
ttccatttta 
tatttcagga 
aatgaaattc 
taacgaaagc 
tggactctta 



gaaatagtga 
gtcccaacat 
cgtcaatatg 
agaaaattcc 
gatgaaagac 
attaaagaga 
cccctcctgc 
tttagaaatt 
caaaatattg 
ctcgaattat 
gcaattactt 
attagacaag 
tcagaggcgt 
cagctgttac 
gaagaacaac 
gaacaacagc 
gaacaacaac 
caacagcaac 
ggtgatgaag 
gttgcagaat 
gaaaataact 
cctccttcac 
cagtgttttt 
attgtagaag 
actaccgcac 
caaattgaag 
tcttctacta 
caagtttgtg 
actctggcca 
gagcgagaaa 
gatctaagtt 
tccaaacaag 
gaaatggata 
cttaggctcg 
acatggctag 
caatacattg 
ctgggtgttt 
aaaaatgatg 
gctatcgacg 
ttaaggcaat 
gaagcgacaa 
gccgataaag 
tcaaccggag 
tatgtgaacc 
ctggcaaaca 
cggtttggat 
caaaaacaga 
gccgccatgg 
atactcttcc 
aaaaatgtcg 
gaactaaatt 
gcgagagtgt 
gccgagagtt 
attgatgata 
aaaccatgga 
gcctatttct 
gatgcatggg 
tatgaaaatc 
cctgccgacg 
gtattggaag 
ataccgccag 



accaactaag 
tgaaccaaat 
ctattgaaga 
tcgctataat 
aacgctctat 
gtgttgctac 
acaaatcctc 
ttctcgatat 
atgacctttc 
ccataatttt 
accccgcagg 
aaattggtaa 
tacaaatttt 
aacaacaaca 
aacgggttca 
aaagggaaca 
agcagcaaca 
aatttcagca 
aaagagagga 
cagtagcgac 
tccaatctaa 
ctatca tgaa 
caaaactaac 
tatgcaagag 
aaacagaaat 
agttttcaaa 
atcctcctcc 
agttggatga 
atgctgtact 
ttgctctaaa 
caggagctac 
aattggaaga 
ttggattaat 
tgaaacctca 
gagaaacaga 
atcgcaattg 
ctttatcaga 
catcaaaa tc 
gaggaatgat 
ctatgacaga 
tagcgtctct 
tagaaccacg 
acatgtttcg 
tatgcaataa 
aaaaagtcgt 
cttatgatgt 
atatctcttt 
cagcct tcac 
cactagaagg 
atatggttca 
catactacaa 
gcaggatgat 
tagtagacac 
ccaaaac tct 
tggtttcg tt 
cgtttcaaaa 
cttgcgcgcc 
atatcaaatt 
gacgaggcac 
a ttttaccga 
atcctt tact 



cagtataaat 
caaaattgat 
aagggtgtat 
gaacagtaaa 
tgctgacagt 
actacacgat 
agttttatca 
ccaaggaggt 
actggcatca 
agacgtgatc 
ggaaacatct 
gaacatatca 
tcagcaacaa 
ggaccaacaa 
ggaacagcag 
acagcagcaa 
gcagcaacag 
attactacaa 
aagagaagaa 
aaaatatact 
aatagcatca 
cttgacaaaa 
agctgaaaat 
taagaatctc 
ggaagatcga 
caatgtaaaa 
tcct tctact 
tcaacgtacc 
tcaaactaca 
gctagagaat 
tggtgtttcc 
tttcattgca 
tttggatgcc 
tggaataaat 
tatattggat 
gaaggtggaa 
tacgttggca 
agtaccattt 
t tctagcctt 
taaagaggtt 
tttcacggca 
ccttcaagaa 
tatacgagtt 
tcgtataaac 
ggccggtaaa 
caca tatgat 
acttttagag 
tgccgaccct 
gggagagatt 
agaattgtgg 
ctggttcata 
tatcggtata 
taatgctctt 
ggcaataaat 
caagcaaatg 
ta tccaacaa 
cggtaagtta 
aaaaaaagat 
tgtggtcgaa 
cttttcaatt 
ttcttccatg 



71520 
71580 
71640 
71700 
71760 
71820 
71880 
71940 
72000 
72060 
72120 
72180 
72240 
72300 
72360 
72420 
72480 
72540 
72600 
72660 
72720 
72780 
72840 
72900 
72960 
73020 
73080 
73140 
73200 
73260 
73320 
73380 
73440 
73500 
73560 
73620 
73680 
73740 
73800 
73860 
73920 
73980 
74040 
74100 
74160 
74220 
74280 
74340 
74400 
74460 
74520 
74580 
74640 
74700 
74760 
74820 
74880 
74940 
75000 
75060 
75120 
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ttcggtaaag 
aagggaggat 
acgtcgtcgg 
ccttatagtg 
gaggatagtc 
atagggcaac 
cactacagtt 
ggacaatctt 
tctctggagc 
aagagcgaaa 
aataaagcta 
aagaggtatc 
cttatgaata 
aataaaattg 
tttggcaata 
ggagtaagag 
actggtggca 
gttgggaata 
aattaggaat 
ataaaggtta 
cccttccaaa 
aattgcacgg 
gagtgatgtg 
aaatgtgttc 
atacgagaat 
tagctggtgc 
ttcgcaaata 
cgtgtttctc 
agagagcaaa 
ttctattgtt 
tgtacatttg 
tttggtggcc 
ctgggtccac 
aacgaatttc 
atctcagaat 
attccccatc 
tttctctatg 
tgggcgtaga 
gccggatatt 
ctccattttc 
ttgtgatggt 
gggggaggga 
cagaaatggc 
gacattggcc 
caccccctaa 
taaaattctc 
agaaacgcct 
ctggtgtaat 
gagctagcgg 
gatagaggaa 
tcgaccagtc 
aaccaacaat 
tgacattcgg 
aaaagttttt 
cacatccttt 
aatgtacgtt 
gtccaccctc 
ccctaaaaaa 
cagaaggtta 
atacccataa 
ggcccgaccc 



gaaa tggtgg 
taatt t taaa 
atactaaaaa 
taattaaggc 
cagaagattt 
gcatgactta 
cacctaaaat 
gggctccttc 
ttttatatag 
gggaagaaga 
tcggaactat 
agcaacactc 
gtaaagatgc 
cagcagcagc 
gattggatga 
gaggacaaac 
atagtagttg 
atttagttgt 
aaggaattat 
tgatcataca 
aatgaa tttt 
agcgaatctg 
aggattcagt 
ca ttattcgt 
gttgttgttc 
aaaacaactc 
gaaaacagaa 
agataataat 
gaacattttc 
gtgtttgatt 
cttgactgcg 
attatctggg 
aaacttgaaa 
gtctgcgcct 
cccattcaag 
tacataaagt 
agaaaatatt 
ggccaagagt 
gtctaccatg 
tggagagggt 
t tgaatgagg 
gggggagagg 
cataagatac 
cgacccagag 
actggagcgc 
tagtgaaaac 
gttccagaaa 
ctgaca ttgg 
tccaccccct 
gagtaaaatt 
tccagaaacg 
ttctggtgta 
tcgagctagc 
gagattgaag 
ctaggcacgg 
tctggagcta 
cgaact tgac 
tt t ttgaaaa 
taccccacct 
gtcctt tcct 
agaggtccac 



aagcagcagc 
ccgccaagtt 
aataagacgt 
aagtaaagga 
cgcccttaaa 
tacaagacct 
tattctcgaa 
ttcatcatcc 
agaacttgcc 
taaaggacaa 
ccaacaacaa 
tgctgatcaa 
tacttccatg 
agcaagggat 
agcaataaac 
accccagagt 
gtggactact 
gcttgtaccc 
acacatttat 
taatcaactg 
gtacctccga 
ctcacccatt 
ctgttgtcag 
ctacgcaaat 
atcccaaaca 
ccaaaatctg 
atggtcacat 
tttcccgttt 
atttcttcac 
gaaaattgtt 
catcttcttt 
tggtatccat 
gcctctctat 
tctgccttgt 
gtgtgttttt 
ccagtaacaa 
gccttgttta 
agagccttgg 
ttagacatag 
ttgttgaagg 
tgtggtacag 
gttgccatca 
ttcctctatt 
gtccaccctc 
cctaaaaaat 
agaagggtat 
cacacaaaag 
cccgacccag 
aaactggagt 
ctctagtgaa 
cctgttccag 
atctgacatt 
ggtccacccc 
aggagtaaaa 
ccatgtccag 
ttacatttct 
attcggtcga 
gtttttgaga 
tttctggatg 
ctatttctgg 
cctcggaact 



agcagcagca 
gttggacaag 
gatgcaaata 
gtatcaattt 
acttccatca 
atatttgatc 
ggatcagatt 
ttgactctgg 
acaaaacaag 
aaacttaatg 
catcaatatt 
gctagtaatg 
agaaaggcaa 
ggggaaaatt 
cctggagcac 
tcaatgctaa 
aatacacccc 
aacctattgg 
ttgtatgttt 
gttgttttat 
atacaatttc 
caccaacaac 
cgagttcaac 
cgagtgcatc 
actcaaaatt 
ttttcatcag 
gtggaggcaa 
ctgaatagaa 
ctttaaaatc 
tcaccacttc 
gcacctgtgg 
tcaatatgtt 
taaaccctct 
ccattaacat 
ctggatagag 
caatctttaa 
ctttggtgga 
aggaaaattt 
aagtattatc 
atagatgttc 
tcattatata 
agcaatatct 
tctggagcta 
ggaacttgac 
ttttgaaaag 
accctctcat 
ttaatgtacg 
cggtccaccc 
gccctgaaaa 
aacagaaggt 
aaacacacaa 
ggcccgaccc 
ctaaactcga 
aactcactat 
aaacgcctgt 
ggtgtaatct 
gctagcggtc 
tggagtaaga 
cgactagatc 
aaccaaccat 
tgacattcgg 



aggataa tac 
aacaagcacc 
t tgaaccaat 
cagtactgga 
tcaatgatgc 
atcaaacaca 
taaagaatgg 
cctccgattg 
tagagaagga 
aaaaattatc 
ctgaaagggg 
gtggcataga 
aactggcatt 
catcagctaa 
ttttattacg 
caatgttccg 
ttattcaacg 
attcacaccc 
tacatgtatt 
tacatagttt 
ttccctcggt 
accctcatta 
atttctccct 
atctccaaca 
atccccagtg 
agtattgtat 
ctgagaaaat 
aqgaagtaac 
aacttgattt 
aaacatgcca 
ttgagtagat 
gtagccggca 
ccctagagga 
catatttaga 
gcaaataaca 
cgctttagat 
gagttgttcg 
cttgtccata 
actaaagtca 
ttccatggtg 
cacgtgggcc 
tcgtccagaa 
ttacatttct 
attcggtcga 
tttttgagat 
ttctgggtcg 
tttctggagc 
tcggaacttg 
aatttttgaa 
tataccctct 
aagttaatgt 
agaggtccac 
gtgccctgaa 
atgaaagg tg 
tccagaaaca 
gacattggcc 
caccccctaa 
gtaaaattct 
cagaaacgta 
ttctggtgta 
tcgagctagc 



aattattgga 
accaataaac 
aatagggacg 
tgatttcaat 
catacgagaa 
gaaaaatata 
acaacgttca 
gaatctacct 
agaagaagaa 
atttgtcgtg 
aggaggaatg 
tgatatagaa 
agccgttact 
accgtcaaac 
tagaggagga 
tcctggacaa 
cacaactagt 
tcctacattt 
ttattcaaca 
ccattactga 
ttacaaagga 
atcttcccat 
attgcatctg 
catactttgg 
tcgattttga 
tcttccatat 
gatttggaga 
tcttcgctgt 
atagtgaaga 
ttgatttgtt 
gaaggggaga 
caagggataa 
ataagtccac 
tgcacaacat 
tcatccacat 
aaaacaatag 
tcactgggaa 
ggaagtctgt 
aagaagacta 
gtggtggtgg 
ctttttaaaa 
acacctggtc 
ggtgtaatct 
gctagcggtc 
ggaggaagag 
accagtctcc 
tattacattt 
acattcggtc 
aagtttttga 
catttctggg 
acgtttctgg 
cctcggaact 
aaaat t tc tg 
tgtagaacac 
cgcaaaagtt 
cgacccagag 
actcgagcga 
ctagtgaaaa 
tgttccagaa 
atctgacatt 
ggtccacccc 



75180 
75240 
75300 
75360 
75420 
75480 
75540 
75600 
75660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78420 
78480 
78540 
78600 
78660 
78720 
78780 
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ctaaactcga 
tttcttgcct 
ccatcattaa 
tcaccagcac 
tcttgtgata 
ctaccagttc 
agcattcctt 
tggaagggga 
acaaggctat 
attccattga 
agattgaaag 
aggatttaga 
gttcaagaat 
catttgagaa 
aaacaaccta 
ttttcttgac 
tgaatgcagg 
tgtgcatgtt 
agacgcagtc 
tggagaggaa 
actacagtgg 
ctagttggtg 
caattagtga 
atgaagaaca 
atgctagtcc 
cctctccaac 
aagaagagga 
ttgttatgaa 
aacctgcact 
atgatggtga 
cagtgaaaaa 
ctatgcaaat 
tattcaccaa 
caaccatttg 
tcacatcaat 
gaatgtttaa 
tgtatggaaa 
ataatgcact 
tattctcctt 
gtgcttcatc 
aaggggaaaa 
aacaacaagc 
tgccacgcag 
tttgtggact 
ttgaaaatga 
ctcgcttgta 
ttttgtctca 
aagaagtact 
aaaacaagta 
accatgcgtt 
aaggagacaa 
taatgaccag 
acgaagaaac 
aagaaagtat 
aaaaattcaa 
attcgatggt 
agaaaa tccc 
caaggtttgt 
atgaaactgt 
tacctgttga 
acaccataga 



gcgaccctaa 
gaaaggtgtc 
taaaaaatgt 
accacagaca 
ttattcaaga 
taccccctca 
tgccaatctt 
gaggaagaaa 
gttggacgat 
atcgaccatc 
tgaagccaag 
aatctactcc 
tggaggcaag 
caacaacaac 
tcctcttgtt 
tggtgttggg 
tattcaaacc 
tggacgtaca 
ttttgacgaa 
gaagcgcatg 
agacgggaaa 
caacgcaaac 
tatgaatgac 
tgaagaacgt 
ttcccccatc 
agttcctcct 
agaaattgat 
gaaattagcc 
caaaattact 
tgatgacgat 
ttccttcatg 
ttctcttgct 
caaactccaa 
gactgaagaa 
tgtggcacat 
cgccaactct 
caagcacatt 
atttgcgtgg 
caaaatggca 
atctaacgat 
ggttgcacaa 
tggtcatgac 
taatgtcaat 
gtctgatgct 
aatcatgagg 
tgagaaatgc 
ccaaggggaa 
ggagaatatt 
tgcaaaggta 
tagcaagttt 
ctatacccat 
gaacaggatt 
cgatattgtg 
ggacgtagaa 
cagtattggg 
agtttctatt 
tcccggtttc 
gt ttagcgat 
ttttggagtc 
tt tttcaagc 
cggaaagggc 



aaaatttttg 
cagaagtgta 
cccctcccgc 
tcattctcaa 
aaacctactg 
aaaatgtccc 
catgacaagg 
atgtctaccc 
attatcaacg 
aaggaaattg 
aacagtgtaa 
aaggggcgag 
tatgcagtgt 
caagttttct 
gggttcgaaa 
ctagaaagat 
ctaaacatga 
gaatatgaca 
gaggaaaatg 
aatgacgcac 
gaacaattgc 
ggtgtggtca 
ttgtgctgtt 
tctatgaata 
aagacggaag 
cctacaccat 
gaagacgaaa 
aagctttcta 
tctgggggta 
gccgttgacg 
gccccaaacg 
aaaatttgta 
gagtgtttga 
agtcaacaat 
cagatggctc 
accaacatta 
tcttcctcgt 
gtgaaa tcga 
tcagaaaagc 
gaacaccaag 
gaatcatcat 
aaggaaacta 
ggctcggctt 
tccagcacat 
agctcattca 
aggtctcaag 
ttgt ttgccg 
aagagttgga 
ctggtgagcg 
gtaccaatgt 
gacgttcact 
tgcgatgtgc 
gaggaagagg 
caacagaagc 
gatcaagtca 
gcaattaata 
ttgaaggatt 
gtcaaaatca 
agtacgcgtg 
gcaggactag 
attttgacaa 



aaaagttttt 
aacacacaca 
tcccccctca 
gacatttcaa 
aaatcgtccc 
agttgtattc 
ctactcttgc 
caaagtcctc 
agtatcaggc 
aaaatgtact 
cttccagccc 
tgtgcaaagg 
caatgagtat 
ctgaagaacc 
cttctacaga 
ctctacctaa 
ctggtttgag 
acttggaaga 
atgccagaat 
cagcgattac 
tctatatgat 
catctgacag 
ttgctgactg 
ttgtggtcga 
aagatggaga 
acgaaggtaa 
gtagcaagta 
ctatgaaaca 
acaatagtag 
ctactgcatt 
acgagagaac 
ataatccatc 
ataccatgga 
ttgctaaggg 
aagatatttg 
agggtaaata 
gcttcaaaac 
aactccattc 
cctcaaaaat 
aaccatcaac 
ctccttcttc 
tcaatttaat 
catatttgtc 
ttaagagaat 
ctagactgac 
ctgtagatat 
agttcttgga 
gcctagaaaa 
ttaatgctat 
tcttgttcaa 
tttcttccac 
tccagtacat 
aggaaggaga 
aagttcgcaa 
ttagaaaatt 
gtttgatctc 
ctagcacaat 
ataggaaaat 
tggattcaca 
ataaggcctc 
tttcacccaa 



gagatggagg 
cgcattctcg 
aaatcgggta 
gtactgagaa 
taaaatggat 
atcagtgaag 
atcaaaggtt 
ttctgatgga 
cgttaagagc 
tgaaagtgtg 
agaaaaagtg 
tctcaagtta 
caaaaaacac 
cagggattgt 
agatggaaat 
atatgtacca 
gatggccaag 
tttttacatc 
gaggtgtcac 
acctcatgtg 
aaccgagtat 
tggattttct 
catcgatgtt 
atctgacagg 
aaattcatcg 
cgcagttgtg 
tgaaggttca 
aatgagaagg 
cagtagtatc 
atgcccccaa 
tgaaaatatt 
atctatgagc 
cgatagtatc 
tttgttgttt 
caagtctgaa 
tgaaggacaa 
caacacggaa 
tggcacagtc 
gaagcgcaag 
aaaaatgatg 
atcgtctaca 
tcccctcagt 
tgaaatattc 
gtgcaagact 
tagatatgag 
tgaggaaaat 
ggacccgatc 
cgttaacacc 
tcgtaggaca 
tctgattaag 
ttgcctgtgg 
caacaacaac 
aggagaggag 
gggagggaga 
tgtgaaaagt 
tggaataagc 
caataccctt 
caatggaaca 
tattgtaggc 
atgtggcaaa 
atatgattca 



aagagtaaaa 
caccttgtca 
taaatagagc 
catcctctcc 
tttgaaggaa 
aaagttgcag 
attaaggacc 
caaaaactgg 
actgcagata 
cgcagaacca 
ttttctgtcg 
aacgccaact 
aacgtctcct 
tttatgcttg 
acatatgcag 
gttttcgaca 
cttcctgttc 
acttcaattg 
accgaagatt 
gccgtgtacg 
gaaaacacgg 
aacgaatgtg 
actgttaata 
cgtctttttg 
tcatcgtctt 
gagggggagg 
gaagatgctc 
gttaagaatg 
aataacgaag 
actgaagcta 
ttgtatgaaa 
agttaccgtg 
cgtcgccgtc 
gatgaggttg 
atatttggag 
aagaagagtc 
tctaatgtga 
atacctaacg 
cgtacctcta 
aaaaatgatg 
ccagaacaac 
ttcataaaaa 
ggtcaacgtc 
tttgaagatc 
agggaggtaa 
gaaatggatg 
gcttactttg 
cctaagcgca 
tatgaagaat 
agagaattgg 
tacctgactg 
aataatgata 
gataaaatgg 
aagaagggtc 
ttgtgtgaaa 
tggatgaaca 
gatgaggtct 
gatgataaat 
ccctttagta 
ttgtacgtta 
ttaaacgatg 



78840 
78900 
78960 
79020 
79080 
79140 
79200 
79260 
79320 
79380 
79440 
79500 
79560 
79620 
79680 
79740 
79800 
79860 
79920 
79980 
80040 
80100 
80160 
80220 
80280 
80340 
80400 
80460 
80520 
80580 
80640 
80700 
80760 
80820 
80880 
80940 
81000 
81060 
81120 
81180 
81240 
81300 
81360 
81420 
81480 
81540 
81600 
81660 
81720 
81780 
81840 
81900 
81960 
82020 
82080 
82140 
82200 
82260 
82320 
82380 
82440 
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aggatgttga 
atgacacctt 
gctcttctct 
tgagcaataa 
tctcgtctat 
caaagaaaaa 
gtggatttgt 
aacttgtagc 
ggaagattag 
agtctacata 
gtatttatac 
accaggacgg 
gtatcaaatc 
atgatggttc 
actttatgca 
agaaatcttt 
aaaattatat 
atcgtctaac 
gccatagatg 
ttgacgaata 
cgtttccttg 
agcgggagga 
atcagtagcg 
gggtagaact 
ggaaataggt 
gagttgaacc 
ttgttgttga 
ctgctgctgc 
gtactgttgt 
aggatttaag 
aagaggaaag 
ttgttgttgt 
cactgcactc 
tgcacatttc 
caaaagatcc 
acgaggtatt 
attcttatca 
agcaagttgt 
actgctcatt 
tgaacttaat 
ccaacaacta 
agtgtttgtt 
tgaagtagga 
tctgtactgt 
gtattgtggc 
caatggtcga 
gtcacttctc 
cttcgataca 
ctctccatct 
ctccccttcc 
accttcttga 
aacacaatcg 
cttactagaa 
ggattcggcg 
agtattaggt 
aaaaatattc 
ttcaaaacca 
tttggacaaa 
caccttttct 
tagtatatgg 
cccatctctt 



ttctactaca 
tttcaatatt 
cactgaaaag 
taatggcatg 
tgtagctcct 
gagcatctgg 
tgtaccaccc 
tgaaactgca 
atttaacggt 
tttgctcaat 
ttcatcttta 
aaatgctttg 
ctgctcgtcc 
tactacttct 
tgtaatgatt 
ggcaaagaaa 
gttttacatg 
cccatatccc 
gattcattga 
attggtgcaa 
acagcttgac 
agatcaaaat 
ttttcttctt 
ggtgtaggtg 
ggagaaggag 
ggaaagggct 
ggagtagggg 
tgctgcagct 
tgtaattgtt 
tatccttggt 
ggaggaggtt 
ttgggcacaa 
tttatcattt 
tccagcatgg 
atagtcgatg 
gataggccac 
aagttcttgt 
tcaaagtata 
aaattgttta 
tcttgccact 
ttagccagaa 
ctttttcctc 
aaagccttgg 
atagccttct 
gcattttcac 
gacacggtca 
tttctttttt 
gatgtagtag 
ttttcttctt 
ccttcctctt 
tcagtgtctc 
ccactccttg 
gttggcgcgt 
tcatcctcct 
ttatgtttag 
aagacattat 
tcttccatta 
taactatcac 
aaagcagcct 
tcagggttag 
ctaccgctac 



acagacaagc 
aataagaata 
aaaaagacaa 
tgtgtacaaa 
tcatcttctg 
aacgagaaca 
aaactttgca 
cctaaaacaa 
ctaaagaagg 
aagaatatga 
aatgacccat 
gatgactatg 
aaagatgaaa 
acctcttctt 
aagaaaattg 
acaaagaagc 
ttgtgtttta 
agtcaaacgc 
catctaaaag 
taaatgcctc 
gttgcgcttt 
caatactgcc 
ctgctcttgc 
gaggttgagg 
cagtattagg 
gctgctgctg 
gaggaggaat 
gaggcggcgg 
gaggagcata 
aagtaggttg 
ggaaaggggg 
acactccagt 
tttcaaccgg 
agatagaatg 
acatttttac 
gtacagtggc 
tcggtaatat 
ctcgttcttt 
tctctgttgc 
gaatctctcc 
gagctttcac 
cgaacttttt 
ttttattggc 
ccttgaccac 
tccttacacc 
tcatttttcc 
tagagggagg 
gtgtaacatg 
tctcttcact 
tcccttcctc 
tttttccctc 
acagaatttt 
ccaatttaga 
tttttcgttg 
gtggcatatt 
tttattttgt 
catatatctc 
tacattttct 
ttgtagactt 
ttatttcatc 
aatcataaca 



tagagaagga 
aggttcttcc 
aattcaatag 
ctccttctag 
ttctggctct 
tgtttttgac 
gttttattgt 
agttgtgtag 
tatgcaagag 
ctgcaacttc 
tgtatacttg 
gtgcagt ttt 
ccgctgacga 
caactgatac 
a tgccatgaa 
attaaatgta 
ttcaaattca 
tgctgctgct 
ttcatctcta 
ctcttcttct 
tattatctct 
aagctcagag 
tgtagacgga 
ggtagtagaa 
tataggagga 
ctgctgggga 
attattgggc 
ctg tgattgg 
tgggttataa 
t tgaacctgg 
aaattgttgt 
cgataaatta 
tcttgctgaa 
ctctgattta 
ccacacgtta 
gttagagtta 
acgtgtacca 
cacagttcgc 
atttccatcg 
tgaccgta ta 
caattcagca 
cttcccttgt 
ctttaaaagg 
tgcctcgtct 
taaaatagag 
agcaggcact 
aaatatgtgc 
ttcgtcttca 
ctgttcttcc 
ttct tct tea 
atcattt tct 
actagtacga 
gacctcctga 
ttggttccta 
tgtaaggtgg 
aggcaccgct 
aatctct tct 
taaagctaga 
acaatcagaa 
ttcacatatc 
ag tcatggta 



tattttgcat 
attctataat 
gaagaagatc 
ttcaaattca 
atcttgctcc 
atctaggaac 
taaccataga 
gaatattatt 
tgttagcgcc 
acctagtgat 
taaattgacc 
tgtaaactat 
caatgctgct 
tgatgctgct 
agacattaga 
tacctttatt 
tatggttcat 
gctgctggtt 
aaatttcctg 
tcttcttctc 
tcaacgttcc 
tctatatcag 
ggaggaagag 
cgagagtcaa 
ggaggagggg 
ggaggaatat 
ggctgtggtg 
t tgcctaaaa 
tactggttat 
aaaggtgcct 
tgttgttgct 
aatgcttgac 
gctcttgcta 
ctcctcatcc 
ttatttatag 
tcagaagcta 
aacaggttat 
tgtatacgat 
ttaagagatt 
tctttcctat 
cccacatttg 
tcagatgaca 
ttagctaaaa 
gccaagtatg 
cccgaagtta 
gctaatgcct 
tttttcattt 
aca ttctcct 
ttttcttccc 
ttacctcctc 
aaatctgaac 
atcttacgcc 
cccatgtttc 
attatatgct 
gataaacatt 
gttttagtat 
tcttctcctc 
caacatagag 
tacaatagtt 
caacaaaaaa 
atacatccac 



ttgtctaagc 
atttctccta 
tcatctggta 
gtctcttccg 
ctttcttcta 
atgtggaggt 
cacgctgtaa 
gataggaata 
tttaccggcg 
ttgaacctat 
catgaagagt 
acatttaaga 
gctgctgatg 
gcaatccaag 
ggaaagtaca 
tttaccaata 
ccccatttaa 
caatgtctga 
gtgtacgcgg 
ctctgactcc 
tccctggagt 
taaaggacat 
gagtaggctt 
atcttacacg 
gactagaaat 
tactgggtga 
gttgctgctg 
agtacagcgg 
attgataata 
gtggtggagg 
gacgttgctg 
ttgtttcctc 
aaacaccagc 
ctcctccaac 
ctctaaaata 
cagatacaaa 
tacacacagt 
cctccacgat 
tcaatatttt 
gggaaattgc 
tgacaacaaa 
ataaagatga 
gagaccttat 
acacgtaatc 
tagcacctct 
ttgattctgt 
gttttgcccg 
cttcttcctc 
gctgctgttc 
ctcttccacc 
aatcgccagc 
tttgcctatt 
gttggaatct 
tttttaaagc 
acaagaaaac 
gtacatcaaa 
ctcctctttg 
tacaattata 
tatggtcaat 
ctgtagggta 
ctaattttat 



82500 
82560 
82620 
82680 
82740 
82800 
82860 
82920 
82980 
83040 
83100 
83160 
83220 
83280 
83340 
83400 
83460 
83520 
83580 
83640 
83700 
83760 
83820 
83880 
83940 
84000 
84060 
84120 
84180 
84240 
84300 
84360 
84420 
84480 
84540 
84600 
84660 
84720 
84780 
84840 
84900 
84960 
85020 
85080 
85140 
85200 
85260 
85320 
85380 
85440 
85500 
85560 
85620 
85680 
85740 
85800 
85860 
85920 
85980 
86040 
86100 
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gttaggtgtc 
cttccattct 
tgtcatgttt 
aggtctgata 
atggaacaca 
tttcgcccat 
tgcaattaga 
aaaaacactg 
gattacagat 
ccagttggct 
cttcttttcg 
atttggaggt 
gagtccggga 
a tcttctaca 
tggaggaggt 
gggtagtaaa 
catcttaaag 
cttgtctagt 
ttccacagta 
ggctgcaatg 
tgaaaattcg 
aaaaacgtcc 
ttttatagta 
gtccatgaaa 
gttgaagaat 
ggtattccca 
ttggtacacc 
catgcatttg 
gaaagatcct 
attttctaag 
ctttggctcc 
gtatctgtac 
ttcttggggc 
acccttattg 
ggccatattt 
aacactaaac 
catgtaccac 
ataaatgaca 
gagtaagaaa 
cgcat tcttc 
gcgcgtattt 
tggcccatct 
aagag ttgta 
tttcttcatt 
tccctccaaa 
aagatcacca 
tctgcctccc 
ggatgttaca 
ct tatatacc 
tacatgtatg 
gaacataaaa 
tct tctcgga 
tggagtacat 
taa tagagag 
agttcgggtt 
tgaatagcca 
ttggtcgatt 
tgtttagtta 
accacttgga 
aagga ttcag 
at tgaa ttgt 



ttacatttag 
aatgtcctct 
tgacgtacaa 
cccgataagg 
cattctggcc 
aatcctaatg 
ggtatactca 
tttaagtaat 
atgtcgttga 
agcgctagaa 
ggagttcttt 
acaaaatttg 
ccagtatcat 
tcctggaagt 
acatccactg 
cactgggtac 
agtttaacgg 
gcaaagtcct 
tagaagcatt 
tattctgggt 
tgacatctac 
tttagttttt 
caatcagggc 
gcatgtacgg 
tttggtttca 
ctcaatatct 
atcgccgctt 
gcgattgctt 
agttttctgg 
agcgaagcgt 
tctactcttt 
actatatcaa 
acatattcca 
tagtggttgt 
tcccaaatcc 
gttctagcgt 
tccttatgaa 
cttttcatgt 
atgcgtatca 
tcctccactg 
ttcatgacaa 
tgttttttgc 
aatggaactg 
aatagagata 
agttcatccc 
ctttctccca 
attaggtcgg 
ccttccatgt 
cttcagatgt 
tctttattgg 
gagcagcagc 
caggagcaga 
tcttgtcgac 
tcttgatccc 
cttccacaaa 
tgtccaaagt 
tagggatatt 
ctgaattcca 
atgatcctga 
agttgcccag 
gaacaccata 



ggcattttac 
tttcatccac 
tatttacctg 
cacgcctatc 
gctccaacac 
cactctcatg 
cacctacaac 
ggactttaac 
gaagcccctt 
gaaccttgcc 
ttagtataga 
acgccaagaa 
atcttactag 
aatagtcttc 
ttacaatgtc 
agatcaggga 
gcggtctgaa 
tgtccatctt 
tctctgctcg 
cgacattttc 
ctataaactg 
cactaccagc 
atagttttgc 
ggaaataatc 
tgtccaaatc 
ttgtaacgtg 
gactttctcc 
cttctagatg 
ctgaaggtat 
tcggggagat 
cagtatctgc 
tcatcttggc 
tagcgtcata 
tggtggatgg 
tgtcattgtt 
ctcttatttt 
acacatcctt 
ggcggtgctt 
aatccatatc 
catcgcctgc 
acatgccact 
aactctttaa 
ggtcgtcgtt 
tctctttcaa 
tataagaat t 
ctatacccac 
tgatcaagat 
tgaacgaggg 
gaaaaaaaac 
ctaggaagaa 
aggatctgga 
aaaagctgca 
tgtgaactgg 
cttctcccat 
caagttgagg 
agtacgagga 
aggaagcgtc 
ttctcccagt 
aagtacatta 
gatctgtgca 
cttcataatg 



agtcttaatt 
agtgaacact 
tttaacaatt 
tacacaatga 
catttcccct 
gttcaatttt 
ttgttcaaaa 
cctattctga 
catattctct 
ctccggagac 
aacagcctgc 
accgcaggac 
ctcatcggtg 
cagttgttcg 
ttccatctca 
acatttgcta 
tctattatcc 
acacatctcc 
cttgatgagt 
tcctgctaaa 
taccatatat 
cgagaccttt 
agaacacgtt 
gtaagaggct 
cgcgtagcac 
tgcttgtatt 
ccaagactct 
catattttca 
gccgtgtcta 
tattagacca 
tgccaaaaga 
tatttttgtt 
cacgggcctg 
accacttaaa 
tcttccttca 
tgtccccact 
caaagtgttg 
cttgttctga 
tgatatcatc 
atttccacgt 
ataaatacga 
atgtccaatg 
acttttcatc 
aagaccggtt 
gcggttgcag 
aattcgttca 
tgcaacttct 
aatgaatgcg 
aacatgagga 
cacatttcac 
acaacaatag 
acaggagagg 
acagctctag 
gca tacatag 
gat tgagctt 
ttaatttccc 
tgaatagatc 
tttatgagct 
cgattataca 
gtggatgcag 
tccct tctca 



tcctgtcctt 
ggatccttta 
tccatttttg 
aggacaaaag 
ataagattct 
tcatagacat 
tctctatcat 
aacctctccc 
ggcatacgat 
tctgggttcc 
ctacaatttg 
aatctcatgg 
ggagggtcat 
ggaggagggt 
ttaggctggt 
tcaccagttt 
accacagtaa 
a tgaattcgg 
tcggtgatga 
atcgccctct 
tctccacaac 
tcgtacacat 
tcttgtcg tg 
ttcatgtttt 
aacccacatg 
acctccactt 
tcttgcagtg 
gcgttttcat 
gataccaggg 
gaaggcgaca 
tagggaacat 
ccaaaaagaa 
gcagcgtaca 
aatgcttctg 
tgcatatcag 
tgcaaagcag 
agagaaacta 
tctcgtttga 
tcggcgtacg 
ataatttcat 
tgaattagat 
aattcttgtt 
t tgatgttca 
ggattaatca 
atgtccagta 
taggtggacc 
gggacagcag 
ataaagtctg 
aaaaag taaa 
aagcaccagg 
aggacttctt 
gagcttcctt 
ctgcgccct t 
tcattgaccg 
ggtcaacaaa 
atacagtttt 
caccacttgc 
ctctaa tcac 
tgttggacgt 
taggcatggg 
attgctccca 



ctttctcaac 
ttgattcaga 
caccttggtt 
aatgggctct 
tgacaaacat 
aaaaataggt 
caaaaccccc 
tggtgttgtg 
cagtcgagtg 
atcctcgaat 
tcacctcttc 
gccatttatg 
caaattcagc 
acggcaatac 
cacatacatt 
ctccctccac 
attttgaatc 
cctgaagaag 
gaagtttgtt 
cgaataaaga 
atgggcacct 
acattaggtg 
cttcaaaagt 
catgagaaca 
ttttattggt 
cgtcagatgg 
tctggctaac 
gttcgaggat 
acagtggtgt 
ttacggattt 
tgtagataga 
tatggtgtaa 
tgcgaagatg 
ttacggagaa 
aaccaaataa 
agaagggcca 
ccacatcttc 
tggcaacaag 
cctcacacaa 
ccaacatcaa 
ccttgtatgc 
cgttttggtc 
tgtcgagggc 
caactgtttc 
tggcctccag 
tcttattatt 
ctgcaactat 
gggcctccat 
aaattcaata 
gttattgata 
ctcctcttct 
cttgacttct 
tgtgcgtaga 
caccttgctg 
catacctctc 
gaatagttcc 
cataatcctc 
atattcgttc 
taaaggctca 
agcaacaaac 
atcgtgaatt 



86160 
86220 
86280 
86340 
86400 
86460 
86520 
86580 
86640 
86700 
86760 
86820 
86880 
86940 
87000 
87060 
87120 
87180 
87240 
87300 
87360 
87420 
87480 
87540 
87600 
87660 
87720 
87780 
87840 
87900 
87960 
88020 
88080 
88140 
88200 
88260 
88320 
88380 
88440 
88500 
88560 
88620 
88680 
88740 
88800 
88860 
88920 
88980 
89040 
89100 
89160 
89220 
89280 
89340 
89400 
89460 
89520 
89580 
89640 
89700 
89760 
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ggcaaagagt 
aaaatccctt 
gcaatttcac 
attagtgccg 
ccctgcacac 
gcatagaaat 
ttcatctccc 
ttgttgactg 
acaatttcag 
ctattgatgg 
atttggtcga 
ttctcgtaca 
gggcacatca 
cagatagctg 
tcaataaagt 
ctcttgtctc 
atgagaccag 
tgaaagtgga 
tcataaatac 
ggtgtgactg 
atgagatcat 
cgcataatca 
ccaatcttga 
tccattttgt 
taggtatcat 
ttctgagttg 
atgaatcttc 
tcggccagaa 
tgagatgcaa 
actggaagac 
gtgccattcc 
attgggatgt 
agatgtgagt 
ggatattttt 
agaaatcaga 
actgttcctc 
gctgtctagg 
tgcaccagaa 
gacgt ttctg 
gaattttact 
tagggggtgg 
gccaatgtca 
tgtttctgga 
ctgttttcgc 
gggtcactcc 
gaccgctggg 
attaactttt 
agcttgtagc 
gttttatttg 
agtctgtaaa 
ccctctagtg 
cattctaata 
tccattggat 
gtggtagtga 
ctcccacgca 
acacccgca t 
aaacccccat 
aagtggccct 
attacccacc 
gaaggagcta 
cccgaccacc 



tgaaatatat 
tgctcaaagg 
atgaagcttc 
cttcttcaga 
ccaatcccat 
tgacatcaat 
ggtaatcaac 
cgatagaagc 
tgcacaaatt 
tatccttaaa 
ataatgcacg 
acgccttaaa 
gggaccaatt 
ggaaaagatc 
ccttcacgtc 
ctccctggct 
ggtgggtacc 
tgccgagtcc 
cctcaatgct 
ttccacaatt 
atgtttcaat 
tgtcctgagg 
tcaaa taaga 
aatcgataac 
taactactga 
t tgcactaaa 
ctgccagttt 
aatcgtccat 
gttcttgagg 
aggcattttc 
tctttgagat 
tgtta tttgg 
gaagaggctt 
gaaggagggg 
sggtgaggtg 
aaaaactttt 
cctgatgtca 
ccgactgttg 
ggcatagctg 
ct tcctcact 
accgctgact 
ga t tgcacca 
acaggcgttt 
tagagaattt 
ag tttagggg 
tcgggccaat 
gtgtgtttct 
ctagaattta 
agtaatatag 
agatgggcaa 
cacgtatcac 
gaggcagatt 
tgacactgta 
cgcacaaaac 
ctagaggcac 
tcttcacacc 
gtgcattgac 
gggtggcatc 
tctctctgtt 
gtgtgagagt 
gctgccaagg 



gtccctattc 
actaccctca 
caaggcacca 
ttcgaagggg 
tggcctagtt 
caccttgtcg 
atagggcctt 
caaattgcac 
tgaagacttg 
gcacacataa 
tgccttcacc 
ctcttcgcca 
cttcccagcc 
cctcgtcctc 
cagatgccaa 
aacctttttc 
actccatgac 
tccagcagtc 
atcatcttga 
aaacagtgtg 
gacagatttg 
gcgctcaacc 
gtattctagc 
agcatcgaga 
tgctggttta 
actattccac 
tccaaaatca 
ttcttggaag 
gttaattgcg 
aatcctcttg 
gaatgattgt 
agtattttcc 
ggacggtagc 
aagaaatata 
acagctgctg 
taaaaatttt 
agttccgagg 
ctccagaaat 
cgcccagaaa 
ctcacaaaat 
aggcctgatg 
gaaacgactg 
ctgggcatag 
tactctttct 
gtggaccgct 
gtcagattgc 
ggacaagtcg 
gtacaccaag 
tatccaagtt 
acgtacacct 
gttttcgccc 
gtttgtcaaa 
gttgttcgaa 
tcatgcaaga 
agaaaaaatc 
ataaaaggac 
gtcatgggga 
ggttagtaga 
gttcgtcgtg 
gaaacggtgt 
gcgactgtgc 



ttaatatttt 
aacagctcat 
tagtatatag 
attctgagtt 
ttcatattgg 
agatttctgg 

actgcagttt 
atgatgccga 
ggtgttccag 
acctctttac 
tggacgtcgg 
ttcactctct 
aaatcttcat 
tcagaaatat 
acagagacgt 
gaaatggggc 
ttggagatga 
aggcccagaa 
ggggaagcat 
atgtctgatc 
aagattctct 
gtcctcagtc 
atttcagcat 
ccagttccag 
tctttggtgt 
gggtggtcaa 
gagatggtag 
ttcttgtcaa 
attatctttt 
tgctggttag 
tatacgtctt 
tgtatttata 
tagtattata 
ctgctgcagt 
tctggatcac 
gtggaccgct 
gtgcattaac 
gggaaggtgc 
tttgaaaaat 
tcaagttcgg 
ttgctccaga 
ctgcgcccag 
ccctctcaca 
gggtcgaccc 
accagaaacg 
tttctggtct 
gataaagaaa 
aaaataacaa 
gagagtattg 
tattaccatt 
ataacactcc 
cggttttcct 
atgtgcgtgg 
aactgctgcg 
atgattcgtg 
attgctattg 
cagaaacaaa 
gctgtagtag 
gctgttagcc 
cgcaggtcta 



ccttccccat 
atgtttctcc 
tttcaaaaat 
tgaagaacaa 
aaatgcgggt 
tcatgatttt 
tgggagaata 
cctccgaatc 
cattttcttg 
tttcgatacg 
ctttgccttc 
aaaggccagg 
ccatgaagag 
taccggcatt 
agatggctgc 
taaagatttg 
ttccttttgc 
ttgccgcctc 
ggaagcacga 
gggtaaaata 
cgtgaatgcc 
tctttctatc 
cgaaacaggt 
tggccataac 
ggtggattgc 
ttttgtggat 
caatttttgt 
cgggaagacg 
gcttgggcac 
caagactaat 
aacccatttt 
ctcggctcga 
ccggcccggt 
cgtagttctt 
attagtagtg 
tcgagtttag 
gggtcgaccc 
ctttgtgtgt 
aacatactgt 
ttttctgggt 
agggtggacc 
aacgtacatt 
aaagggaagg 
aaattttgaa 
aatgtcaagt 
gaatagctcc 
aggaggcaac 
tataacaaat 
tcacgcactt 
tcccgacaat 
gatgcacaaa 
ctatagtaac 
gtataacatt 
gtagtttcat 
tggcagtgac 
ttagtcgtca 
caccacctta 
ccgtcaagat 
cttccgtcgt 
cgtgccccga 
ctcgaggagg 



gtcaaattgg 
cttttctttg 
cctcttgtta 
atctgctagt 
cttgtcaact 
tacaaccctc 
cttcacaaac 
actgtactgg 
gttagacttt 
tgcagaatta 
agcctcatat 
gcactcgtgg 
atcagatacc 
ct ttctgcag 
agctcctctc 
gaggaacgcc 
cctcaaatca 
cttaagagta 
ggaaagttgg 
gtgcctcgac 
gacagccaca 
agtgggggaa 
gaagagataa 
attttcatag 
gtgcctcaat 
gttcgaacag 
ctttgcataa 
gtccatgatg 
atactggttg 
ttcttgctta 
atgttattaa 
ggagggtgaa 
gaaggtcgct 
tgatgtagaa 
caggtgcaaa 

ggggtggacc 

aatgtcagat 
ttctggaaca 
tttcgctata 
cactccagtt 
gctgggtcgg 
aacct ttgcg 
ctgtatctta 
aaatttttct 
tcggagggtg 
agaaatgtgc 
tagaaatata 
acca tactat 
gtacattaat 
cgtacatgaa 
tttcctcctt 
aaccaggat t 
tcccagctgc 
t tctaagaat 
gcacggccac 
catctctcag 
tggtgtataa 
ggtgtcgtct 
ttt tacaact 
cgttattgac 
tgacgacgac 



89820 
89880 
89940 
90000 
90060 
90120 
90180 
90240 
90300 
90360 
90420 
90480 
90540 
90600 
90660 
90720 
90780 
90840 
90900 
90960 
91020 
91080 
91140 
91200 
91260 
91320 
91380 
91440 
91500 
91560 
91620 
91680 
91740 
91800 
91860 
91920 
91980 
92040 
92100 
92160 
92220 
92280 
92340 
92400 
92460 
92520 
92580 
92640 
92700 
92760 
92820 
92880 
92940 
93000 
93060 
93120 
93180 
93240 
93300 
93360 
93420 



wo 01/38351 



27 



PCT/USOO/28888 



gacgacgatg 
aagcgtgccg 
gccgcacctc 
cctccacctg 
cctgaggatg 
gatgaagaag 
gaagaggatg 
gatgagttct 
ttctaccgca 
cgcaaaaagc 
aagcgtgccg 
gccgcacctc 
acgcacgaaa 
ttttcatgaa 
tttatctgca 
gtgtccaaag 
cagagtcgta 
ttctgagaaa 
tgcctctgcc 
atttttactg 
atcatggata 
acacagttgg 
gtaaatagtg 
acagagatac 
ttggatgata 
gccataattg 
cgtatagcgg 
tttagacact 
ttgtgtgacg 
cagaaattca 
ttgataattt 
gctgtaaaag 
agggacaacc 
atttttgatc 
aattacttga 
tcattgactt 
atttctgata 
aaccaactcc 
ttttctcttt 
cttgaagctc 
caatatggat 
ctggatcttc 
agttatctcc 
gatggcaaac 
aactggaaga 
aaaacgagaa 
agttatgcaa 
ttctgtaaag 
agtgaaaatg 
agactttata 
tttgctcaag 
tctcatgcta 
aatctccagc 
caagctccgt 
acaaccgcct 
ctacattaat 
attccctcct 
aactatcctg 
gaca tgcaat 
cctttgccca 
taaaccttcg 



acgatggagg 
cacctccacc 
cacctgagga 
aggatgaaga 
aagaagagga 
aggatgagtt 
agttctaccg 
accgcaaaaa 
aaaagcgtgc 
gtgccgcacc 
cacctccacc 
cacctgagga 
gtgacggtgg 
atgtgtacac 
tctgtatttt 
atagtggtgc 
tgaccattga 
cattgggcgt 
agttcagtat 
tttttctgtt 
acttgaaagg 
atagatctat 
aacaaacagc 
aacaattcaa 
tcatctacat 
aagcgaggga 
acatgttaac 
ctagtgaaca 
acttgtgcaa 
ttactttcca 
tcaaagatgt 
aaggtgtggc 
tcaaggataa 
acgggggaca 
aaacactcat 
ttctaacagt 
gagagatcaa 
ccaaacctaa 
ctgtccaaaa 
ataagtattc 
agttgttgcc 
ataccagaaa 
caattcagat 
attaaaatcc 
gaaggacttt 
gagttattgg 
gtaattggta 
gctgccctct 
accttttctg 
tttcgtcata 
aattccagga 
tggacccttt 
gatagtgatt 
atttttaggg 
ccggaggtgt 
gcaggcgatg 
atacctaaga 
tcaaagat tt 
aaatgcaata 
ccttgcagaa 
tttgtgcaaa 



aactttcgat 
tgaggatgaa 
tgaagaagag 
agaggatgag 
tgagttctac 
ctaccgcaaa 
caaaaagcgt 
gcgtgccgca 
cgcacctcca 
tccacctgag 
tgaggatgaa 
tgaagaagag 
ttgaagaata 
acctgctata 
tttattagta 
catgatgcca 
cgtcaatgcg 
atgacgtcaa 
tttatttttc 
catccctctt 
ggaatttgtt 
attggtattt 
taaaaaggaa 
ggacgaaata 
ttttgatcat 
atactctaaa 
cttgactttt 
agccactaat 
tcaattacca 
tgcgaaccta 
catacaacaa 
cattagatgt 
ttacaataca 
ttt tgaagaa 
gggattaaaa 
gtacactaat 
taattatctt 
acaataccgt 
acatctgagt 
aacctctcct 
tga tatcgag 
acaact tcaa 
at ttcgacca 
aagacaggaa 
tatgtagaag 
ggaatctccc 
ctctcttgga 
gtgctggttc 
ctatcacaaa 
aacaattcct 
aggaacatgt 
ccaagatgag 
acggca tctg 
aacgaaaatg 
ataattggga 
aaattatcgg 
tggttgtacg 
tgaagttcag 
ggattttcag 
agaaaagtgt 
aaggaacaaa 



acagtagggt 
gaagaggatg 
gatgatttct 
ttctaccgca 
cgcaaaaagc 
aagcgtgccg 
gccgcacctc 
cctccacctg 
cctgaggatg 
gatgaagaag 
gaagaggatg 
gatgatttct 
gactaatatt 
tatacgtgca 
tcctttttgt 
tatacaaagc 
tgggcagagg 
ctacattatt 
ttcctataca 
tcctattgta 
gcgcttaaaa 
gttgatgttg 
ggtctagcaa 
aacaacaaat 
ggagggagtt 
ccgctgaggg 
atgactgtgt 
agtataaatg 
aaagaagagg 
tacatgctgg 
cttcacgtga 
gcgaaacaga 
ttttcaaaca 
gtaaaacaca 
tgcatgttca 
gttatagcag 
gtccaacttg 
cccctcagtt 
gggtttcttt 
actggtaaca 
gataacacca 
gattgtccag 
tagacattgc 
acaaaacacc 
aaccatacct 
tgaaggtgca 
caagaaacta 
atcatcgtta 
catgaaggat 
caaccctgaa 
tgcccatcta 
gttcacagaa 
tgaaaaatgt 
ctgcgataga 
tggaaaaata 
catgctaaac 
aagagtggtg 
gcaggcaaat 
gctcactatc 
tgcagtaaat 
acgtctacga 



ctggtatact 
atttctaccg 
accgcaaaaa 
aaaagcgtgc 
gtgccgcacc 
cacctccacc 
cacctgagga 
aggatgaaga 
aagaagagga 
aggatgattt 
atttctaccg 
accgcaaaaa 
gttgatatgt 
tatttgaata 
gggataaata 
acgtgatact 
cggaggtggt 
cttcctcctc 
ataaaaagta 
aaaaaaccaa 
cagacctcac 
ttggtagatt 
ctagagtggc 
ataatgctct 
tcaaaagagc 
aattagagtg 
acaccaatat 
tcaccctcgg 
aagaagagga 
acacacgcct 
ttttgcaaaa 
tgaacgaaat 
ttttgaatga 
aagccataac 
aacgcatatc 
aatttataaa 
taacatgtaa 
tgatagataa 
agtatgtgga 
tctcttcctc 
gaactggctg 
aactcactcc 
tatacgttta 
acagccatat 
gtatttttgg 
gaaattttca 
gaaattgacg 
tacctaatca 
ataaacgaag 
ttcttcaagc 
gtaagacgtc 
atggaagaaa 
gcacgaaaaa 
tgttgccgtc 
acccaacaat 
tcaaatgata 
gacggtgtct 
atccccacgt 
ttagggccta 
acacaacaga 
gtggataccg 



tggacgcaaa 
caaaaagcgt 
gcgtgccgca 
cgcacctcca 
tccacctgag 
tgaggatgaa 
tgaagaagag 
agaggatgag 
tgatttctac 
ctaccgcaaa 
caaaaagcgt 
gcgttaaact 
taaccccttt 
aggaataaag 
atcggtattt 
ttccaagaag 
gataaagcgt 
ctcctcctat 
tcagatgaat 
taactaacta 
ccattacaaa 
atatgttata 
aaagcaagcc 
aacaaatact 
aaaacataag 
catgtttacg 
cattactgaa 
acgtcttttc 
agatttgaaa 
aaagaaagat 
ggatacctat 
aagtcaatac 
aattgtctac 
tctgactaga 
cgaaatgttg 
cgctagcaat 
cgaattgtgc 
catagcttat 
gtattattgt 
tctataatcc 
gcaagttaac 
cagacgacca 
tggagatttt 
gtgaattgac 
gttcagagga 
ggcctagaga 
acggtatagc 
tgagccacat 
aatatttcgt 
accttatatc 
tagaacactt 
actacttccc 
ctcccaaata 
tttatcacca 
ccaataaagg 
agggaaaaac 
acgggcaagg 
gtctattcgt 
caagaaacat 
aaggagaaaa 
gtagcaacaa 



93480 
93540 
93600 
93660 
93720 
93780 
93840 
93900 
93960 
94020 
94080 
94140 
94200 
94260 
94320 
94380 
94440 
94500 
94560 
94620 
94680 
94740 
94800 
94860 
94920 
94980 
95040 
95100 
95160 
95220 
95280 
95340 
95400 
95460 
95520 
95580 
95640 
95700 
95760 
95820 
95880 
95940 
96000 
96060 
96120 
96180 
96240 
96300 
96360 
96420 
96480 
96540 
96600 
96660 
96720 
96780 
96840 
96900 
96960 
97020 
97080 
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gaacacgtta 
gcttggctac 
taccagatat 
ctcctacata 
gggtgtattt 
tgattccgat 
cgcatgagta 
taccgtcaac 
aactgttcac 
agtcagaata 
ccaaaaaact 
accgttttgt 
ttgcctgttt 
aactcactca 
acggaattgt 
cgaggagttt 
aactgattga 
aacacttccc 
acgatttggc 
cattcgcatc 
atgagttcat 
agggttttgt 
gaattgaaca 
agttgatgag 
aaaagcacta 
agaccaactt 
tcaataagat 
ctgcacctcc 
gggactacat 
tattattatt 
ttacttccaa 
atctgccttt 
gtggtatacc 
gatcttttga 
ccttgtccaa 
tgtaaagggt 
tccctttgtt 
ggtatagtaa 
tcttctaaaa 
gagaggatta 
ggccacttgg 
ctagagaatg 
aatcgtgcaa 
cacaagaaga 
tacaagattg 
tggaaataaa 
atttgttgta 
tatgtctgga 
ggtgacgtcc 
ttgtccagtg 
ttgtttcaaa 
gtctaataga 
agtattttta 
attatagttg 
ttttccactc 
actggaatct 
ttctaaccag 
attgagtaaa 
atccgtgcga 
aacacaattt 
cttttgctct 



gaaaaattct 
actattgagt 
aaggaactgt 
aaaalitgaag 
gagaccgtct 
accgactaaa 
taaatatggc 
ttccattatt 
cgttgctggt 
taaaaccgag 
cattatgcaa 
atcccgcccc 
ttggacattg 
agatgagaag 
aattgaaaat 
ctttgacttc 
tagactggtg 
cgccatcaag 
ggaactaatt 
cattttctgg 
ttctagggac 
tgatacccca 
agaatttctc 
ccagtacatt 
taatgttacc 
ttttgaaaag 
caagaaggac 
tccagtctct 
cagttatgac 
gttataatat 
tattcctgac 
tacctgatgg 
ctacagatgt 
ccctctgtat 
ggatcaataa 
ttagaaaggg 
gggtagacag 
ctgatgatga 
tttttggtaa 
agatattaaa 
ctccggctgg 
cagtacctca 
gaactacgcc 
aggaatatct 
tgtacgacaa 
atacataatt 
tgttttaaag 
gctctgatga 
gtttgagaga 
acggtaaatt 
gcttcataat 
acaataacag 
ctgctccatg 
gctctcccca 
ttattgttat 
gttagaattg 
aaatctgata 
gggtgtaaag 
ggtaaaaatg 
cttgcatttt 
gttgtctggt 



gttcctggga 
ctaagtggca 
gggcctttgt 
acatcgacca 
gcaaaataaa 
aaggctgatt 
cacttctcac 
actccaataa 
ctgaatatgg 
gaggaaaaga 
gtagatacgt 
attgtccatg 
gaagagattg 
gatttcattc 
cttacaacac 
caagttggaa 
cccgacgaaa 
aagaaggagc 
gttgcctttg 
atcaagaaca 
gaaggtcttc 
tcaagagaaa 
acagtttccc 
gaatttgtgg 
aaccccttcc 
agagtcgccg 
caacaaaccc 
tcctcatcat 
gatttttagt 
ttttaaaaaa 
aacctttttt 
gaagaaacaa 
gtgtgggcca 
aggaggtaaa 
cgctgtcgtt 
gttccgaatt 
agatatgcac 
agatttggta 
aggattcaat 
gtctcatttt 
agattttttc 
aatacagaac 
aggtgccgcg 
agacaagaag 
gaataacaag 
aaagaa ttta 
acaatacaga 
atgtcttgcc 
gggttcgctg 
gtgtaccgtt 
cttcctcctc 
agttggggga 
gtgtaggatc 
tcatatcata 
cacaataa tc 
tatgagaaac 
cactatcatt 
gttccacgcc 
gcctgtcttc 
ttaatgcctc 
tattaaatcc 



aagattcaat 
gaactgggaa 
gaacaaacag 
gt tattgagg 
gagcagagat 
gatggcaacc 
accacagca t 
taccaacaac 
agcaagccaa 
ggattgccca 
ctaaacactt 
aagatctctg 
atttcgaaag 
tccaga ttct 
gtcttcgtca 
tggagagtat 
aagacaaggc 
agtgggctat 
ctgcagttga 
ggggtatttt 
atcgcgact t 
ggattcttga 
tgcctgttaa 
cagataaact 
cattcatgga 
agtatcaacg 
aagaaactgg 
ccgaacaaga 
tccactattg 
taaatgttct 
gttgcggtag 
cccttggttt 
aagggagctg 
aacaatggag 
gaacgagatt 
cctggcactg 
gccaagtggg 
gattctggta 
ccgagacctc 
aacaagagga 
ctcgcttcag 
cataacaatg 
tgggctgaga 
aagaag taca 
gaat tgttcc 
tattgtttta 
taat tttcct 
agatgcca tc 
ggtttcaact 
at ttagtgtg 
atcttcatag 
aaaatcagtc 
atgctttcta 
cagtgtacac 
ctcatcttct 
taatgcctct 
acatttattt 
aaataaaggt 
gttg tatgat 
agctgatata 
tccacacaac 



actgaagttt 
tcttttctgg 
gaaatatctt 
agtatcttgc 
ggtttgtgaa 
cccccccctc 
cattccctcg 
cccagaaatg 
ccaagtggct 
ggaagtgttt 
acttacaaga 
ggaaatgtac 
ggatcctaaa 
ggcgttcttt 
agtggcgcag 
tcatggcaac 
tatcttgttt 
taattggatg 
aggaatcttc 
gcctggtctc 
tgcatgcatg 
aattgtcact 
attagtggga 
attggttgaa 
caatatttcc 
tgcccaggtc 
ttctcctctc 
agatgttgaa 
tgtcaatagg 
ataagactaa 
atgcatgctc 
ttgattctgt 
gcgaattatt 
gtgaatggtc 
actcccttga 
accattttca 
tgcgcaacaa 
ttaggactaa 
tttactccct 
cgggtaattt 
agagatgggc 
gtgaatggaa 
ctggaccaat 
tccctatccc 
gtgtacagag 
ttttgtcatt 
ccatgacgaa 
ctagtttttc 
gcagccaaga 
atgacattat 
tcatctgcaa 
gtgctacgtc 
ggcttgtctg 
actagtgtat 
ccaacaagag 
ttaaaaatat 
gcagtattat 
ataggcacaa 
tttgaaaagg 
cgtagacgta 
gagtacatgg 



tgctcccttg 
gttattcgag 
ccatgaaaga 
aagaccagaa 
ttggccacac 
cagactcagc 
tcatcggtcc 
gagtcaatca 
gaagaaatca 
gacaaattca 
gaaaacccca 
aaaaaagagg 
gattgggaga 
gcatcctctg 
attccagaag 
gtctacggag 
aacgctgcac 
caaagcaata 
tttagtggtg 
acctcctcca 
ctgttgaaaa 
gaagccgtcc 
atgaactgca 
atgggactag 
ctcgagaata 
atggcttcta 
ccaattctga 
gacggcgtcg 
ttgtgtattg 
aacaatgaat 
ttgctctacc 
attagaagag 
cactggtgtg 
aggaaaaggt 
cgaggaggat 
tactgtcttt 
aataaaccct 
atttaaatac 
cgactatcaa 
ctttgctcga 
aacttttgct 
agatattgaa 
attttaccaa 
tcatgccctc 
tgatatgtct 
tatttgatac 
acactggtga 
tcctggggaa 
aaatatccct 
ccctctctga 
cattctcagt 
cagactggcc 
aatgtagacc 
taccaatgcg 
ttgagtacac 
cgtcactctc 
caataaatga 
actttgcctc 
ttgttgcaag 
tactgtttaa 
cattatcgat 



97140 

97200 

97260 

97320 

97380 

97440 

97500 

97560 

97620 

97680 

97740 

97800 

97860 

97920 

97980 

98040 

98100 

98160 

98220 

98280 

98340 

98400 

98460 

98520 

98580 

98640 

98700 

98760 

98820 

98880 

98940 

99000 

99060 

99120 

99180 

99240 

99300 

99360 

99420 

99480 

99540 

99600 

99660 

99720 

99780 

99840 

99900 

99960 

100020 

100080 

100140 

100200 

100260 

100320 

100380 

100440 

100500 

100560 

100620 

100680 

100740 
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attctccctc 
gtctttaccg 
ttcacagtgc 
attggaacct 
agaagggaac 
gacgaatgtt 
agctggaaga 
gtgtagagat 
ctcctttgtc 
cttcccatct 
catcagggac 
cgaagagata 
tctgtttgtc 
gtatgtgagc 
attaacatcg 
caaggcataa 
gtctgccatg 
tgctttctcc 
atttttgtca 
gtcattatct 
caattttgag 
gaaaccgagc 
ataagcaaat 
ataattcgtt 
tggaaggatt 
aatatctgac 
gggctttttt 
gtttgccatt 
t tctggagca 
ttcaacgtgt 
ggtgcaggcc 
tattatggaa 
actatttact 
cgaaaacagt 
gtgccgtata 
gagtgccttt 
gaggtacttt 
taaagacgtt 
atttttttaa 
aagtgggagg 
taattatctc 
agaaatcgcg 
ccaataattc 
aatttgatcc 
ttactgggtt 
aagaagacga 
taatgctcga 
tgtctataca 
ttgtcccaac 
aagaaggaac 
ccttaatcat 
aacagtacat 
cccctttttt 
gcacaa tgaa 
tgcagaccat 
cacaaattta 
taatatctgt 
accagaagat 
tataggaggt 
tcctgcctat 
tcctcctcca 



gcatgatcct 
agaggaagaa 
acagatctct 
agcctattcc 
aaaaattgaa 
tgacgttgtt 
gggtgttgaa 
tttgtgtatt 
tttaattctt 
tgccctcgtt 
atttcttcca 
aaagacccta 
atccctgaac 
ccacaagcaa 
ttcgtaaaac 
gaagcagaca 
tagccatctg 
actctatctg 
ttagagttta 
tcattggagg 
aagggagtat 
aggtaaagtt 
attaatcccc 
tcaacgttaa 
tcaaaattct 
cttatgtatt 
gcatttttaa 
gcttcatcgg 
attttttcag 
ctagattcta 
aatctctcca 
gcaactccat 
ccatcacgag 
cttctggatg 
actctgtcta 
tctgcttcta 
ggcacgcacg 
ggctgtttta 
ttaacacacg 
agaactttcg 
tactatattt 
agtcagtaaa 
tccctacatg 
caaaaatatg 
ccctaaagat 
agaggaacag 
atctctgaga 
aggttctcct 
ccctaaaacg 
aggagaaaag 
gggctcttgt 
gtacacattt 
agaaaaattg 
aagttgatca 
gtaatgaaat 
ttaaaacaag 
ctgttaaaaa 
gttgcattct 
aggcccaata 
ggttatctca 
tctccagctg 



ctatgaccaa 
tatctgcatt 
ttatgaattt 
aaggaagtga 
gtaatctggt 
ttagtagctg 
agccacagac 
cctcttcgtg 
ccaagaagtc 
tcatcacgtc 
actttttagg 
taacattctc 
caaagcctat 
ctgtcaattt 
tctgtttata 
cctttcgtcc 
attcgtgttc 
ctacaaattt 
ttcttgcata 
atgaattttt 
ccagtacgtt 
gtacacatga 
tgatgtggtt 
agtacctttc 
cactaaacat 
tttcataaaa 
gagtatccga 
gcaataattt 
catataaacg 
ttgtcttcac 
tcttcttgtt 
ccaacatgtg 
cagtgaagaa 
atgttttcag 
cgaattccca 
tagttgtagt 
tgtgcactga 
cacctctcag 
caaaactgac 
cctgtggtcg 
gccatctact 
gaccccttgg 
atgttcaata 
aatgaagagg 
gtggaaaagg 
acgacagata 
ggggataagg 
ggtacaaaaa 
gaaaagtatg 
gcagaacatt 
ggctaccctg 
gaaccaataa 
gaacaacaac 
aacggttaca 
ccatatcaga 
ttgaatatgt 
tcctagaaag 
tcaattctaa 
tgacggtgcg 
cagatgataa 
caagaagatc 



accagccttg 
gaattttgaa 
agaggctaga 
attttgatgt 
ccaattaacc 
aagagacact 
gccaaatctg 
gatttttctt 
ttgtacattg 
gttgttctta 
gtcttcatgg 
aatgcaatga 
tactggaaca 
cttcagaagg 
cttgtgagca 
caaaagttta 
ggcgataggt 
cactgct tta 
ttggagttca 
agttttgttg 
ttctctacat 
aaaggcacct 
gctagggaat 
caatctacga 
acccgacaac 
gtcaaaa tec 
aagatgcacc 
gctcacttca 
ttgccatatt 
atcgacaaga 
gtagtagtct 
gtacagggct 
atcttgaagg 
tctcatgtca 
tgtcaggttt 
agaagttgaa 
ccgagacaca 
aggaatccct 
gctgaaatcg 
tagggatact 
acttttattt 
ctatacgcgt 
caaacat taa 
aacgagcaga 
aacaggtgca 
ctaaccccaa 
ctggattttt 
ctgtctgtaa 
ttccttcaga 
attccttcac 
tgatgaagga 
aggaaaaaaa 
aatgtcgttg 
aacctctgta 
ctacataa t t 
tttcgatgaa 
atgcgcacag 
gatgggggaa 
ggttaatttt 
tgatactact 
cccttttt tc 



gtcattagaa 
attatatcat 
gatatgaaat 
cttttattat 
gcagctgcag 
tgggttggct 
aacgtcatct 
ctcctttcat 
tcctcattcg 
aacaggcgcg 
atgtccttct 
actgcatact 
accatgccgc 
atctcgtatt 
acccagttca 
gaaatttgaa 
gtcctcatca 
acaatacttt 
caggggtaat 
tcgttcttta 
gcactagccg 
ccaaaaatat 
ctaaagttgt 
ggcactttaa 
ttgtttgatt 
gtcatcgtta 
aaaccagttc 
ttgtatgcat 
ttcttatcag 
aagtatcctt 
atcctcttcc 
gattgggtac 
aaactgtata 
gttctctgca 
tcatttacgt 
tccataatgt 
acggtgtact 
ctaccagaat 
tttcgcaata 
ttcggtggtg 
ccgtccaaag 
agatgctaac 
ggctagaaat 
attt ttcaag 
gaagaagaag 
aaataagaaa 
gatagatgag 
gaaaacgacc 
tgaaagattt 
gtctgttaag 
tataagcaaa 
gtaatgtctt 
gctgtgacag 
tatcacaccc 
tctcgtcgat 
gaaacaggag 
aaaggaggaa 
gtaacgcgcc 
aaacatgggc 
actgttactc 
acacgcactc 



aatggtctgg 
tgacaacacg 
gtctatttaa 
ccctctcgtc 
tttctctctg 
tgtcactact 
ttagattggt 
tgggcacaaa 
cgaacgttac 
cagaaagctg 
tcctcttccc 
gtctcaattt 
acttgtggtt 
tttcacttcc 
ctataaataa 
gggagagcca 
ttgtgacgga 
ttctgcacgc 
attcttcccc 
tatactgttt 
cagaaagagt 
taccaatgaa 
gctccagact 
tgtttatgtc 
ttcccaatac 
aatcagttgg 
tgtagttgta 
cgacaacttt 
gatctagtag 
cacgtatact 
tgaagcaaga 
accactcgtc 
tagagatgaa 
taagtttctc 
attcttttag 
aggaatgagg 
gaacaagggt 
aagtcct ttc 
atgggagagg 
ttaaccgcca 
gaagggttaa 
catgtcctct 
acactaacag 
atccattatt 
gaggaagaag 
gtggaatatg 
aataagacca 
ctaccaaagt 
ttcaggatgg 
acaggaagaa 
ggtacaagcc 
tgtttatttc 
aagattacgg 
ctctattagg 
tcatgaacta 
cagttatagc 
tatatgatgc 
tatttactat 
agacaaataa 
ctcctgttac 
tcatatccga 



100800 
100860 
100920 
100980 
101040 
101100 
101160 
101220 
101280 
101340 
101400 
101460 
101520 
101580 
101640 
101700 
101760 
101820 
101880 
101940 
102000 
102060 
102120 
102180 
102240 
102300 
102360 
102420 
102480 
102540 
102600 
102660 
102720 
102780 
102840 
102900 
102960 
103020 
103080 
103140 
103200 
103260 
103320 
103380 
103440 
103500 
103560 
103620 
103680 
103740 
103800 
103860 
103920 
103980 
104040 
104100 
104160 
104220 
104280 
104340 
104400 
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gtcgtcttca 
ggtgtatgat 
ccctaaaacc 
acgagacagg 
tttagtggat 
cggtctcatt 
tggcatccac 
cacaacccaa 
ctgtgga taa 
tttctagtac 
aacatcttct 
ctattgaaac 
tatcattatc 
ccaacgcgtg 
gtaaaaagta 
atggtcctct 
acaggctggt 
taattaatag 
tatcgagggg 
acagtaatcg 
atacattttc 
tagtgcataa 
cattatcgga 
aggtgatttc 
attctaaggg 
cgtttttgcg 
ttatatgta t 
tccctctct t 
ttagagatta 
agtctgcttc 
atgatgatga 
ccttaataag 
gagacaagca 
atacactctt 
agaatgtttt 
gaacatta tc 
caacaccttt 
gtctttacaa 
agtttgtcc t 
cctacgata t 
acgaaa tagc 
ctatgaacac 
aaaagaatag 
ctgaatgtgc 
gcggtca t tg 
atggcagccg 
tcggaaaaag 
aaaa tgtcg t 
gatatgaat t 
aatattggta 
ttttgagatt 
ccaccctcgg 
cagaaaaat t 
gaggtctgaa 
gaaatgtgt t 
ccagcggtcc 
gagtgaccca 
ctgaaaggga 
ctgt tccaga 
ctgaca ttgg 
tccaccctct 



g ttgaccatt 
attcacgcag 
tcgtttgaaa 
gatgttttag 
gttgtgcacc 
agacctgaaa 
ttcttcatca 
aaagaaaaaa 
aaaggtggta 
cgaactgcaa 
agacgaagga 
aatatacgaa 
aatgttgatc 
cgttcaaaat 
tctgatattg 
t tctcacatg 
gcgttacttt 
tgtcttgtgg 
tgttttcatt 
gccacaaaat 
atccagaatc 
ctttgattac 
cagcgtcaac 
cgactggaaa 
g ttgatgaca 
cgccttgaag 
actgaaaaag 
caaacacttt 
cagtctcatg 
acttttcatg 
aggaggaaag 
ccaacttatt 
ctttactccc 
g tgccttata 
gcatatagcg 
cagcgaacaa 
aatgatcgcc 
gcccaaaata 
cttaaagggc 
taacatcata 
agcagcagga 
gtggaacttt 
gcagaaattt 
aatatgcttg 
cttccacaac 
cgcaagagga 
actaggggtt 
gccttcggta 
tcttgtaggt 
ttatttttac 
tctggacaag 
aact tgacat 
1 1 taaaaaat 
gaggctatga 
t tgagatttc 
accctcggaa 
gaaaaatttt 
gatctgaaga 
aacttactaa 
gtcgacccag 
aaactcgagc 



atgtattgat 
aaacctttcc 
tttctgacgt 
acaatgttca 
gtggatctag 
aatacccttc 
actaagaaaa 
tcaaagaaag 
aatctcatac 
atgataactg 
gaatttaacc 
atactctctt 
cacatacttt 
gttttgggga 
gaagacttac 
ctccattctt 
agcacgtatg 
ggaattgata 
gtgtcgtacg 
actccttcac 
tcagtgtaca 
ggttccaaag 
acttttgtga 
atgccttatg 
ttatcagtgt 
gaaagggaaa 
tctttgttcg 
ttaaaaattg 
tgcaaaactc 
gaagaattcc 
ggacataccc 
tcacacccaa 
ctcatgcacc 
ataaatggag 
attgaaaatg 
attgaaaaaa 
ttggcgaggg 
aaggtccgtt 
ctaaaggaat 
aaggatgcag 
ctgcgaggca 
ttcaccaaaa 
gtaaagattg 
gatagtctgg 
gtctgttggt 
ggaggaataa 
gccgactatg 
gaagaaggaa 
ggagtgtgga 
accttacaca 
ccatttctgg 
tgggtcgacc 
ttttgagatg 
cggttctagg 
tggacactcc 
cttgacatca 
taaaaatttt 
ggctataccg 
agttaaagta 
cggtccaccc 
gacacagaaa 



gcatgataac 
ccataaagct 
gactctcgat 
cgactatatt 
tctccgttga 
atttgttaga 
gagtacatga 
tactaccatt 
acaagatatt 
aatgtaatgg 
ctactataat 
cttctgctga 
tcttcgctga 
acgactataa 
tgaacggcgt 
caatccctat 
atggagatgc 
aaagtgttct 
taccgatgcg 
aaaatatgtc 
gaaacaatac 
atgcatcatc 
ggttgtacac 
ggttctttaa 
ctgaatatac 
ttttaatcta 
gatttaattt 
aagaggttaa 
aaaaagattt 
cttggcttgc 
tattaacatt 
ttttaaaatc 
tcgccaacac 
ctaaaccaga 
ttaactatgg 
tggtcaatgt 
agaatattgt 
tcggttcttc 
cagttgcata 
taatggacaa 
ataactgcga 
attcaaccaa 
tggatggtat 
acggggatct 
tatccttgat 
aatgcccgtc 
atattgaaac 
gaagggaatg 
caaatgaaat 
tgttccagaa 
agtaatctga 
gagcggtcca 
gagaatgagt 
tgcccgccct 
atttctggag 
ggccgaccca 
ttgagatgga 
gttctaggtg 
cgtttctgga 
tcggaacttg 
aatttttaaa 



ccaaaaagat 
ccttctgttc 
tgttcaatgg 
gctaacgacc 
gtaagtactt 
aaagtgtgct 
agaagatgaa 
tcctgttgac 
agatcaagaa 
tgcgcgagaa 
tgaagtagta 
cgacaagaag 
taagggtact 
agtggaattt 
ttcaaatcat 
tgtacaagac 
tcaattcgat 
caacgaattg 
tgtacgaaca 
agcactaggt 
ctttaaaaaa 
atcatctcct 
caactatgac 
gaaaactttt 
actcaaaaaa 
taaaatggag 
caggtgttta 
acatatagca 
gcagagtttc 
aaaaacttgg 
tgctatagtg 
gttagtgaat 
gtctataatg 
gttcataaac 
agtcatcact 
aagaagaatg 
actcgctcag 
aaagaggcta 
tttggaaacg 
cagtcttttt 
ccctgaagca 
atgggcaagc 
gaataggaca 
tcct tcaggg 
aaggatgagc 
ctgcagacaa 
agaggaagaa 
gaggaagatt 
aaaactataa 
acgtctgttc 
cattgggtcg 
ccctctaaac 
aaaatctctt 
ctccagaaac 
taatctgaca 
gcggtccacc 
gaatgagtaa 
cccaccctct 
caagccattt 
acatcaggcc 
aaatttttga 



cttcatttaa 
ctaccttccc 
agattttttc 
ccgtaccatt 
tagttgcagc 
tagatcaaaa 
aatctcatcc 
aagtatagag 
aaggaccacc 
gatctgctta 
tcatccatgc 
tttgtacaga 
atgtgggtat 
gaaaatatac 
tggtctgaac 
atgttattga 
atatcattca 
acacaattga 
ccttcaaagg 
atgaaactca 
ctaaccgagt 
cctcctcctt 
atattcttaa 
gacgtccttt 
gagttggtta 
aagagagaca 
aaacaattac 
cgttttgtct 
cctgccatac 
atcgacgacg 
cacagatatc 
actacatgta 
taccaatgca 
aagttcaacg 
gaattgagag 
atggataata 
ctttttgacg 
aggataccag 
aggaatatat 
gaagaggagt 
gacgagaaga 
tctattttcc 
tatgaagact 
agaacaacgt 
gggccgaa ta 
gtcacctgcc 
cgtgacacga 
ggtgttgaca 
aatgaactag 
cagaaatgtg 
acccagcggt 
tcgagcgacc 
ccctgaaagg 
gtctgttcca 
ttgggtcgac 
ctctaaactc 
aatctct tec 
ccagaaacgt 
ctggagtaat 
gaccgagcgg 
gatgtagaac 



104460 
104520 
104580 
104640 
104700 
104760 
104820 
104880 
104940 
105000 
105060 
105120 
105180 
105240 
105300 
105360 
105420 
105480 
105540 
105600 
105660 
105720 
105780 
105840 
105900 
105960 
106020 
106080 
106140 
106200 
106260 
106320 
106380 
106440 
106500 
106560 
106620 
106680 
106740 
106800 
106860 
106920 
106980 
107040 
107100 
107160 
107220 
107280 
107340 
107400 
107460 
107520 
107580 
107640 
107700 
107760 
107820 
107880 
107940 
108000 
108060 
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gagtaaaaca 
cccagaaaca 
tctggagtaa 
cgacccagag 
agatgtagaa 
gcgcagctat 
gagatgtgcg 
ggaaaatgct 
tgagtaacca 
aacaagtacc 
aaacgtcacg 
tataaacaga 
caactcgggt 
acattcttat 
aaagagtatg 
aaaagattgc 
atgaagaagt 
gattgggaga 
ataatagtgt 
caccaccatc 
atgtcttcca 
agatggacaa 
ccccaacaac 
cctccattcc 
gaccagtgca 
agggaaagag 
agatcttccc 
aaccaacaca 
cttgtgaatt 
gctcgcgccc 
gttgtcatcg 
ccagatattc 
gattcttcta 
gtttattctg 
atgaaattta 
agtgcgcctt 
ctctttggtg 
tcaccctctg 
tcatatccaa 
aagtagaggg 
gttgtttttg 
cctcttgtac 
ggtgtcaata 
tcatagaaac 
ctaacgcgga 
tcttccaaat 
cttcctccta 
gttggttcaa 
tctatgtctg 
tagggggaaa 
aagtccacta 
atgtggaaaa 
aaaatgtcat 
tcgaacgtac 
taaggagtgt 
ttggaagata 
gttttagttt 
gctgcagaga 
ctatttcctc 
actttgggac 
attttatcga 



ctctagtgga 
tctgttccag 
tctgacattg 
gtccaccctc 
cgagtaaaac 
acccagaaac 
cacgtatgca 
atttaa taaa 
tgtctagcgg 
ttctagatgg 
ccggttccct 
gacggggaaa 
cagtatcagt 
ttgctcagta 
ttgagagctg 
gcttgtatta 
acaacaagat 
agtaccatgc 
ctcaaaggat 
gaggcgggta 
accgattcag 
ctgtcaagaa 
aacaacaaca 
ccatcattag 
gaaattgcga 
ttcgctcatt 
ccttccttaa 
tgtaccccct 
ggcgagaccc 
ctacacccac 
aggaagaaac 
ccgcaa taac 
cagtat ttct 
cattat ttgc 
aatctttatt 
ctggcaacac 
ggaatggtcc 
aaacagaaac 
aatggtgatg 
ttgaaagatc 
ttccactaag 
aagcactc tg 
caggaag tac 
cgaccatgtc 
gattttctgt 
ccacataatc 
acatttccat 
tgcttgcgcc 
taaagcaagg 
atgttcccac 
tccttctagc 
tttcagcctc 
tagatacaat 
ccttcct ttc 
aagccg tact 
tatcccatat 
ctccgatagt 
gatgaacatc 
tgctaggtgt 
agaataaacc 
atatgttttt 



ttagggtgtt 
aaacttaata 
ggtcgaccca 
taaactcgag 
tctctagtgg 
atctgttcca 
tgcatgcata 
acaaccacca 
aaaagtaacc 
aggtgcagca 
ccacgataac 
aacaaacagc 
tgtccaacac 
caatatgggt 
caagaagcac 
ctttaacaag 
aatcttccct 
ttctattcga 
gtaattaaaa 
taaataaggg 
tcagctgagg 
caggaggaac 
tcaaaaaaga 
ttggggagcc 
tgaaactggc 
aaacaacact 
taagaagaag 
caacaagtac 
ttcacaagaa 
tccccaagaa 
tgcaccagaa 
tgcaactact 
cagaaatgtt 
aaaatgtatt 
gtttctttcc 
tcccatgtcc 
tctatctgat 
agttttttct 
gtgtgcaaat 
atcaaatggt 
atctgaaggt 
agcagctgta 
atggactctt 
t ttttgtcct 
agcctttcct 
tgaaatagat 
attgctgtta 
ttcatttatt 
cgt tgcatac 
gttaaatgta 
ctttgcaccc 
cgcaggaggc 
atctgtcata 
gtccaaaaca 
cctacgcatg 
attttctgtt 
atttagtatt 
gtttttaggg 
tgccgcta ta 
gttggtgcaa 
actattagga 



aacagcctac 
aagttaaagt 
gcggtccacc 
cgacacagaa 
attagggtgt 
gaaacttact 
cgtacctatc 
ttctccgctc 
tacgaaatcg 
atctgtctgc 
ctcttcaaga 
aaaaatctgg 
ccgacagaac 
aatggaagaa 
gaaagggtcc 
tgtcttcacg 
gccagaattg 
gatttctcca 
aataaaaccg 
gcgctggcac 
ggcaacgagg 
agacagcaac 
acatcaacca 
ctcggcagct 
tattacaatt 
ccaagtgaag 
gacgttgacg 
agtttccgtg 
aaacaggaca 
aaggaaccag 
ccagaaccag 
actactacta 
attctgagta 
agatctaaga 
aattactcca 
acaagtgcta 
tgacaaacaa 
cttgtataat 
ttccttagat 
tctgggcctt 
ctcatgaaat 
ggtgcgcaag 
ctcatgactc 
ccctctacat 
tgcttattga 
aatctcctct 
actacttcca 
gtcgcagaat 
aggctcacta 
gaggcagtaa 
tcttgacccg 
acgcccggca 
ccaacatccc 
gcagcaggta 
gcacaaaatt 
gcagctttta 
ctagcgtctc 
agggtagtca 
gggtcagttt 
ttacagttct 
tcttgcatct 



cctttctggt 
acgtttctgg 
ctcggaactt 
aaatttttaa 
taacagccta 
aaagttaaag 
tgtatttttt 
actatacagc 
ttgaaggggg 
agtctaattg 
tgctaggatt 
ccataattga 
cagaaaggtt 
aatgttactt 
acaaatcttc 
cgatcgccaa 
ggtgcgcggc 
caatcattga 
tcgctggcaa 
atgggtggca 
agatggttgg 
agtattccca 
attctcctcc 
actcaatgta 
tccactctta 
gcatgtggcg 
aagctccacc 
aatatactcc 
agatcttaca 
aagtagaaac 
aaccagcccc 
cagttgcaac 
tcgtgttttg 
aggaataaat 
ttgaaattgt 
aaggttgtat 

^ggggaatct 

aatatctgct 
taggtccccc 
taactcttac 
ctgcataggt 
attcgaatcc 
tagggactct 
actttgatga 
ttgtgaaaac 
tctttgtact 
ccacaacttc 
cggccgatat 
tatccttgac 
ctccatgaga 
tttttgctgc 
agcgttcata 
ctaacaattc 
ccttttcttg 
tattaggggc 
ggtttatctt 
cggattgagc 
tgaggggaaa 
tgttcctacc 
ccttttctcc 
caaaaatact 



cgcagctaga 
acaagccatt 
gacatcaggc 
aaaatttttg 
ccctttctgg 
tacatttctg 
taaaagtagg 
tgaccttcgt 
attgttgaac 
tgttgcaaga 
tggcgacccc 
agatagacct 
ttgctccatg 
ccctaacgac 
cacagaaatg 
atcacctgca 
agctggagga 
taaggaagtg 
cccccgccac 
cactcgcatc 
ggactattca 
tagtttccgt 
tgctccacct 
tcgactggat 
tgatagaaag 
gcgcacaagt 
tcctcaatca 
ttcatcaaag 
agaggaagaa 
taaagatgat 
agttccagac 
acgtcacgac 
gtttctgggt 
aaaatggtat 
cctccttgtg 
agttccatcc 
atccttcaaa 
cattaatcct 
tacgcttcta 
cccctccaat 
tatttccaca 
tccatggtag 
tttactcgct 
tgatgatgtt 
ttccgcatct 
accggcagtg 
actcccatca 
tgttggcgaa 
accctcaacg 
agactcgaaa 
atcccttata 
atcctccat t 
tgcgttccta 
tcttgtggaa 
caacataaca 
tcttccat ta 
tctcaggagt 
acgtgaatgc 
ccctccattc 
caattccccg 
cttttctcca 



108120 
108180 
108240 
108300 
108360 
108420 
108480 
108540 
108600 
108660 
108720 
108780 
108840 
108900 
108960 
109020 
109080 
109140 
109200 
109260 
109320 
109380 
109440 
109500 
109560 
109620 
109680 
109740 
109800 
109860 
109920 
109980 
110040 
110100 
110160 
110220 
110280 
110340 
110400 
110460 
110520 
110580 
110640 
110700 
110760 
110820 
110880 
110940 
111000 
111060 
111120 
111180 
111240 
111300 
111360 
111420 
111480 
111540 
111600 
111660 
111720 
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atattggcag 
aacgaagaat 
agggcacccg 
gagtacacgc 
aaccgattag 
agtagcgact 
ttgacttcac 
tgcatggacc 
ctggaaggta 
gccgcagaca 
gaattgaagg 
gttactcgat 
ctcgcgttga 
atttccatag 
aggattgctc 
tcgcgcatac 
tcattagaag 
tcccccactt 
gaatttgtag 
tgcgctgtaa 
tcattgtata 
aagttgggga 
tttgtaacca 
gtcaaaagag 
ggtctatagt 
ttttccttta 
tcgtgagatt 
tcagaatctt 
gtctttactc 
tgggacgcta 
ctatggggta 
ccagctccaa 
cttcctgttc 
tgttcgtcag 
cgcaataatc 
ccaaaaagag 
ctagggaggc 
agcaataatt 
agatctatac 
tttgttccag 
tccttttcct 
ctctgctggg 
tcgaaatcga 
cccactcctt 
ggatctttat 
tttttggtga 
tagtgcaaat 
gtagcaacat 
agagggacac 
tctctactgg 
tcgttgcctc 
gaactaattt 
actccagcca 
ataattcccc 
gtatctacaa 
attataaatt 
tgttgtctag 
gccgtggaag 
caatgaagaa 
gaaagaagtc 
atttaattga 



gaagtagccg 
gtgattgcgg 
agcacgggta 
cgttcaagta 
ggatcaattc 
caaagagagc 
attgttgcca 
cgaccgtgcc 
tgggtaaagg 
ctcccgcagg 
aggttttaat 
cataagtgtt 
ccggcgatct 
gtcttacaag 
ctgcaaaaat 
ccgtaccttc 
ctatgcgcca 
tcttaaacac 
acaatatttt 
caggttgcag 
ttgatcttgt 
aaaaatagtc 
tcatagaatt 
agtgcttgaa 
gtaccaaggg 
ttactctatg 
ttctcagttt 
taaggaagga 
ctgaaggatc 
ttatgcccac 
aaggcacgac 
ccagcccatt 
cactgtaggg 
aatcttcaat 
ctcccgtaat 
aagcggcata 
acgcgtcctt 
ttgcaccctc 
cggccacttc 
tatctcctac 
gtataccacc 
agtagccagg 
tatcgtccac 
ttcctccaca 
tgagaaggga 
tataagtcgt 
aagcctcctc 
catcgtcgtg 
aaactccgtc 
atttgacgca 
ctacgcgaca 
cccctggtaa 
taccgaaggg 
cattttcttt 
aagagctcgc 
gggaagaccg 
gaacaaagaa 
caagaagagg 
gcgtgcagga 
tggagcccgc 
ttttttttct 



tgtttgttcc 
gcccaggggt 
tccgaaagat 
gtggccggcc 
tggcctaata 
cataggagga 
taattcatcc 
gttcgagact 
aaatccccct 
aggagcgaga 
gtctgttgtc 
gttgatggct 
gcgttcgaac 
aggtgaagat 
atcgggcttt 
tattccatta 
cgcagttgga 
tgcagatgaa 
ctcactgaat 
aataatacct 
attgatgggc 
cagaagtaaa 
taatgtactg 
aat tgtctcg 
caatatctta 
gagtgaagga 
ctgtaagggg 
tgtttctatg 
gcctgctctc 
agcatgatct 
cctgagcgat 
atcgt ttagt 
cactacacct 
tcttgagtgc 
cagatggcct 
agaatagccg 
agtactacca 
tacaaagggt 
ttccagtctg 
aaccccatac 
ttctttcttc 
ttcgcacaaa 
atcatcaata 
cagtacaata 
aggatccttt 
tccctttttt 
gacagtagta 
cttgtcgaac 
atattcatca 
agttcctccc 
ttttttagga 
tagagattct 
ggcgtccgta 
taatcctttg 
ttaaccacag 
atacagtct t 
acattaaaaa 
tccaccactg 
aagaagagct 
aagtcaaggc 
tctgaataat 



tctacataac 
ttggggacca 
tggccagttt 
gtgtgggggc 
aacgtttcag 
gggatgaaat 
ccttcatgga 
gattcttgga 
aaaactccca 
ggcttagaaa 
gaatttagaa 
gtcaagttgt 
caggtgtgtt 
gcaccattca 
tttgataacg 
t ttagtcctc 
gggtgagcaa 
ccagccttgt 
cttgcaaagg 
ccaatttctt 
aggccagacc 
tttgtaacgt 
t tcgcaaaag 
atgggagtgt 
tttactgttt 
acgacgtatg 
ttatctttgt 
g tcgtatcgt 
atgtgaaccg 
atgctatcga 
gcgtttggat 
gcatctccgt 
cctacagatt 
caattgccgg 
ccgttaaacg 
agagccgaca 
cctaaagttt 
tgagcataat 
a tttgtgtac 
ctgaacatga 
tcacattttg 
caatccatag 
a ttgagaatg 
tctgtgcacc 
cgcaccacag 
cctcctccag 
aaagggttga 
gcagaagagg 
tcatcttcta 
ccgagagaac 
gttttaaaac 
acaaaacctt 
tcgacatata 
acgcgtaaag 
t tgcactcac 
tcatgacaaa 
tggttgcccg 
ctggacgcat 
ccactgtccg 
gttaattctt 
tggaaataat 



tctttataga 
tcattgtagt 
cttccctgac 
tcagacccca 
aggaaagagt 
tgagggctcc 
attttagagg 
cactcttttg 
aaaatgatgt 
agggtcgtgc 
tagctctaat 
cggccgaacc 
cggggagaga 
ttaaaactcc 
tttttgcgac 
ttcccgagaa 
acatttcata 
aatgatcaat 
tggaactttt 
tgaatttggc 
tctcttttcc 
cgtcaaaact 
attggtctct 
gggccattcg 
tatcattttt 
ctgttgcgca 
acgaatcgta 
tgtgcgcatc 
tctcggggaa 
ttcctaaacc 
gttcagtaaa 
tccagatgtg 
cactgactgt 
cggcgtccat 
ctgtaatttc 
tcctcgtatc 
cgtacctttc 
ctgtttttgc 
tttcatcaca 
cagggcagcc 
gggcgttggt 
tagaagtcat 
ttgtagggtg 
caagaatggg 
acgaatctac 
ttatgactcc 
cgtttctgta 
ctgaaaaaat 
taccgtattc 
gacagctaat 
agtcaaatgt 
tccctatgat 
aggggtcgaa 
agttticgaga 
ccgggtacaa 
ataccccgag 
aagctccaag 
ctccaagcgg 
tcgccgttcc 
ccctgtacaa 
aaaacatcca 



aaatttagat 
gtactgagag 
acaaggcggg 
atctgctctg 
cctaaccctg 
ttcagtttcc 
gactattcca 
aatgagaaaa 
agcttgagca 
tggagcgtac 
gtctcccttt 
agggtctcct 
agatgcagga 
atctccagtg 
cctttcggcc 
aaatgtccct 
tgcttctttt 
tattttggga 
cttagcaccc 
attgtttggt 
ttcgccgttc 
attactagaa 
ttcttgtgcc 
cttagcggta 
agcagaaggt 
tatgccagag 
ccctgccttg 
ccttctgtca 
gattttaccc 
gggaatattg 
atttccccct 
agcggccaca 
gagttttcct 
ggcagattcg 
aggtgctctt 
gtttcctggt 
tgttatttgc 
cgcatcggga 
aagacagccc 
tagaccttgt 
ggcagggtct 
actgcttatg 
tataggtcgc 
ctctaatctt 
acaccctctt 
tccagattcg 
ggctgcatct 
agtatcggcg 
agttttcaat 
gcattcaaaa 
gttataatgg 
gtctgacttg 
agggttggga 
catggctcct 
taagacaaaa 
aacaaaagat 
accaaatccc 
aggagcccat 
tcaaagagcg 
caactatgtt 
ttgaaactta 



111780 
111840 
111900 
111960 
112020 
112080 
112140 
112200 
112260 
112320 
112380 
112440 
112500 
112560 
112620 
112680 
112740 
112800 
112860 
112920 
112980 
113040 
113100 
113160 
113220 
113280 
113340 
113400 
113460 
113520 
113580 
113640 
113700 
113760 
113820 
113880 
113940 
114000 
114060 
114120 
114180 
114240 
114300 
114360 
114420 
114480 
114540 
114600 
114660 
114720 
114780 
114840 
114900 
114960 
115020 
115080 
115140 
115200 
115260 
115320 
115380 
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tgcagtattt 
agggtacaga 
aagaagtgtc 
caaataccag 
cattttgagg 
agagacggcg 
aatgcttctt 
acgacaccac 
tgtttgcacc 
ctagacgtac 
tggctagccg 
cgcgtaggcg 
tatttgctcc 
ctttatttac 
ttgaagtgtg 
tagcgtacac 
gacgttcttg 
gagtatctgt 
gccgagggta 
gtagggggac 
gaggaagtgt 
cccttacagg 
cagacatgtc 
ctttatcgtc 
cgtcaataaa 
tagaaccaga 
catcatcatc 
acgtgatact 
aatctaaaac 
gctgttcacc 
cagtcaatgg 
cggcatattc 
tatcagtttt 
tgggtccagg 
agcacccgac 
cagtacttat 
tattgtaccc 
ccaacatttt 
gagctgggtt 
cctcggattc 
ctggtgtgtt 
ctgaaagttg 
gggaaagacg 
gagtgtattc 
catttctaca 
aatcgaataa 
cagtgttgaa 
ccacgtgttt 
cattggtgat 
gaccactatc 
cttttatctc 
gtatcctcac 
tcattcgaat 
gagaaatatc 
gtaacaattt 
caatattttt 
ttttcgggtc 
cagctttttc 
taaaattgtt 
gaagcgtgaa 
caaacattgt 



ttattcaatt 
tatattataa 
tatgaatttc 
agggtctact 
gattttaaca 
atgtagtgct 
tccttcattt 
cttgtcttga 
attattattg 
cagaaattgt 
agtctgtaga 
gccagtgttt 
aaaaacaaca 
cagaagattt 
cacaatttgg 
tttctccccc 
ttccttatcg 
attgacaatg 
ctttatttct 
aagactagtc 
agtgcgacag 
tgccgtaaag 
tgaaaactgt 
acctccaaat 
tatagggcgg 
cacgggatcg 
atcattgacg 
gtgaatgggt 
agggaaaaca 
tcccagtaat 
accaactgaa 
tgtggcgggg 
caagatgata 
ccaatctggt 
tctcgaaggt 
aaccgagaac 
agaagatgtg 
ccattgcgct 
agatggattt 
atttccctcc 
aacagtgttt 
attttcgcga 
gggaaaatta 
cacattatca 
agcaaaaact 
cacggtcttg 
aatacttccc 
gaatacggct 
aggaggccaa 
gctgctgcca 
tgccttaaag 
gatggtatct 
tggcagcaat 
atcaccaata 
tggatcctgc 
agcgagtttt 
ttttctaaca 
tattttcgca 
ctggacaatt 
ttcaaaggca 
tattgctgtt 



ttttaaccaa 
gatgatgtaa 
ttgtagaatt 
agtacacctg 
cataatacaa 
gaatttctat 
ctagctacaa 
ttcccaacta 
tagaacccgt 
ccatcggcta 
ggggcaactg 
ctccagttcg 
ataccccatt 
tcctttttgg 
tctaa tgttt 
tgtgtatacg 
atttgttccc 
tttgtagtat 
cccccgtaat 
tgtttcatga 
tagactatgc 
gttgagggat 
ccttcaaaat 
acttccatct 
ctggttattc 
gagactgtga 
ataacatcag 
tggaatgggt 
gcgtacagtt 
gcagtttgaa 
acgtttaaag 
gaagaaagca 
catggtctta 
gcagttgcag 
gtattatgta 
aatttatcag 
gttaaaacgt 
tctatttcac 
ctaaggtggg 
atattcttta 
gttttcgagt 
ggtagtttag 
gcagggtaat 
tagtttattg 
tccgttattt 
tttttatcct 
tttgttgata 
tgttgtttat 
cctgctggga 
cccccgtctt 
ggaagaggga 
tttccacctt 
gatgatgatg 
tcataaccga 
acatcagtga 
tttgacgccg 
gcagcggaat 
ggttttctta 
ttcataccaa 
ccccaccacg 
cttaacggca 



ctataaatcc 
cagatgcacg 
cttccctgtc 
agctagtgcc 
tatcatcatg 
ccactaaaac 
ttttcatgta 
tagcaaatag 
ttgcactacc 
accgagtaac 
gttgtctcaa 
tgtatctgtc 
ttcctaaaat 
ttccagactc 
cttcagaaac 
cttcactagg 
tcaaccagtt 
cattcgttct 
cagcgaatgc 
aagggtccct 
ccttgttggt 
cattccatat 
gcccgtccat 
ttacactgaa 
tcgaagaaag 
tgtggtagct 
atacggtcaa 
tatcactcgc 
tagtaggttc 
cgggggtaat 
cacgaggcag 
cagccgttat 
gttggtagga 
catctactct 
caacaacttt 
attgagtaac 
cagacgacgt 
tttcaggtac 
ttatctctgt 
acgtttcatt 
acacgaacga 
ctgctaaatt 
ctgaaaca ta 
ctttaggcat 
ttggtatggg 
ctgttacaat 
gtgttatacc 
tgccattgtt 
caaattcagg 
ctccttcata 
cattatctgg 
t ttctaataa 
a tgaaccatc 
tctgccgtgc 
cacgttttaa 
gttctaaccc 
catcgatagc 
tgtgtatcac 
tataggaaa t 
agccaaaaat 
gggttggttg 



acatgtggta 
gtcaataaca 
tgtacttgac 
gggttctatt 
gtcgaatgtc 
tagaccatcg 
gaatcttact 
taacgaggaa 
aattcctcca 
agggtcgccg 
ttgagaataa 
tgcaggttcg 
agccgcaagt 
gtctaccacg 
gccgtctccc 
tgctggcaat 
ttctattgaa 
tatggaagaa 
ccattcaggt 
tatccctcta 
tatgtcggtg 
agccaattcg 
cgtaaagggc 
aaccctatcg 
aacgttttct 
tttatcttca 
gacagtagtt 
tttcactagg 
tgcgggtgcc 
aatatctcca 
tagagatgtt 
tcgaggttct 
gaagaaatct 
aaagtttacg 
gttcgtgtta 
ttttgatgct 
tttgaagccc 
agacacacat 
ggcgtttaga 
gtatagtact 
cactgaaggt 
ggacgctatt 
tttttcattc 
a tcaagcctc 
tgaagatatt 
ggaaattttc 
gtcagctaga 
gcgagagaac 
gaaagtttcg 
tttttcatcc 
taatattccg 
ttcttctgta 
tctatcagga 
agtgcgtttc 
ttcctcagag 
a ttcttcaag 
cgctgctata 
accagatctt 
aatagaaata 
acccgtgact 
ttcggcaaat 



taaaagttta 
acagggcctg 
gctacattgg 
ttagacatga 
gtcctgttcc 
tcatttttca 
ctttctgcat 
gtaaattcat 
catctagggt 
ctaggccatg 
gaacccactg 
tcagggtatt 
ttcttgtctt 
tatgtagtat 
atgtacattt 
aactgtcgat 
aattcagttt 
atgaaaatgt 
atgcttgtag 
ttagaaatgg 
gcgggtgtgt 
ttagatggta 
atctgtatgc 
tctcgtgcac 
tttgcaagta 
aggacaacat 
ttgttccttg 
ccgttatgcg 
ttaaatgtag 
cctcgtactt 
gtgcgttcac 
ctattatcat 
ggcccgttcg 
ctagaaccga 
gcattcacat 
ggaaatacaa 
acaatccagc 
ataaaggtcc 
atcttgtaac 
ttgtttaata 
ttgttaatat 
tcagcttgtc 
aatgtatcag 
atgagggcta 
gttacctgag 
agtctattcc 
gaatttactt 
gcattcgggt 
aatagagacg 
actaaatcaa 
tcaagaatat 
cggatattaa 
gccgattgag 
attatactgt 
gatatccttt 
ttccttttct 
ttcccgaata 
tcatcgggcg 
aacacaacga 
gctgtttctc 
caaggaaaca 



115440 
115500 
115560 
115620 
115680 
115740 
115800 
115860 
115920 
115980 
116040 
116100 
116160 
116220 
116280 
116340 
116400 
116460 
116520 
116580 
116640 
116700 
116760 
116820 
116880 
116940 
117000 
117060 
117120 
117180 
117240 
117300 
117360 
117420 
117480 
117540 
117600 
117660 
117720 
117780 
117840 
117900 
117960 
118020 
118080 
118140 
118200 
118260 
118320 
118380 
118440 
118500 
118560 
118620 
118680 
118740 
118800 
118860 
118920 
118980 
119040 
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aaaaagttac 
atgtatatca 
aatgactcgc 
ggaatacttg 
atatgcggca 
agaaagtata 
tggaaaaaaa 
tctatcgtca 
cagattgaat 
attcttacgt 
tgctactaca 
tgacataata 
tatagactgg 
ttctgtagat 
tatggcaaaa 
ggaaatttta 
atggtgcttc 
tgtaattttc 
aagacttctg 
caagactaat 
tgtgacagaa 
tat tttcgac 
tgcagaattc 
aaaaatgcta 
ctcttcattt 
cgataacgat 
tatccgtaat 
agtcgagctg 
tctcctattt 
accggccata 
aaaaacta ta 
ccgttttaaa 
tacgataagc 
gattttgtgg 
aaaggggaaa 
cctagtcaat 
attcgaagag 
ccaaaacaag 
attccaagtt 
cacccacaat 
ccagagcctg 
a tactacagg 
gtccgtaaaa 
gtgca tga tc 
agaagt tgga 
tagccaacac 
gaaaacacct 
ctccctctac 
actcaagtct 
acccaacatt 
tgtggaagaa 
caagcattac 
atgccccatg 
atttagatgg 
aaagcaagaa 
tcgcaggaca 
aattgccacc 
tgttctcatt 
tgaagaggag 
aatacagatg 
caatgataat 



ccgggcatgg 
gacgtggaac 
ctagccaact 
agatacgcgt 
acagttgaca 
ctgaattcta 
ccttctgaac 
gagttggcaa 
tcagcaatag 
gaaaatgctg 
gatacaggga 
aagcgttcag 
cgccccaccc 
gatactgttc 
tttatcctgt 
tacggtaaca 
tggttgaacc 
ctccactcaa 
gtggtagttg 
ataacgtcat 
gacaccttaa 
gaaattaact 
tgcacaaaga 
gtaaaatcag 
ggtttattac 
gatgagacta 
aggacatatg 
aagcgagaga 
ccgtatgtgt 
attaaaagtg 
aatgaaaagg 
aataaaaaaa 
aacagggagc 
aaacaagtcg 
gtactcaaca 
cgcagcttca 
gatgtaaaga 
gatattgtta 
ctatacccaa 
aatgcccctc 
ctgacgtacg 
cgccccatca 
gttctggaca 
aacaagaaga 
agcaagatga 
ctatccgacg 
gcccatctag 
ggcgcaccct 
aacaagtgtg 
aacaacaatc 
gatgaaaatg 
atccattgcg 
tgccgtgagg 
ttaaaactgg 
ttctataagc 
tttctcaccc 
aacgccatcg 
gacgggagcg 
gcgcttgctt 
cgccagtttg 
cctcatcgac 



cagggaatag 
aaggcatgga 
taaaattggg 
ctacgcccct 
tcaacctaat 
tacggagagt 
caatctcacg 
atgcaataag 
tgacagcggc 
taaactacat 
atgtagtcta 
aaatattaga 
ctgacaattc 
ttatagatct 
tcatatgtaa 
tttcttcttc 
tgtacaatag 
aaaaggagtc 
tggagagttt 
ggcctgttat 
tgagactagc 
ccttgacaac 
ttttgctcgg 
tcaaagagac 
tgaagaacac 
cagatgtggc 
cgtttaaaaa 
gtgacgccat 
ccgactggag 
tgtggttaca 
tacaatctct 
ttgctgccga 
aagaagcagc 
ttcagaacac 
tgttcactca 
ttgacgttca 
atccggaatc 
caatagttgg 
tcaagaaggt 
tctgggtaca 
agattgtgaa 
actatcaaac 
ttttaatgag 
acaacaccaa 
tgctccacat 
ctagtagttt 
caatccaaga 
ggtttcaaga 
tcaaggtact 
aactagaacc 
aagggaaaac 
aatgcctgat 
atgtgggcga 
ctgagagatc 
agatggaagc 
cagccagaag 
ctgaagccac 
gagaagaata 
ttggaagacc 
ctgaactgtc 
acatctctac 



aacccagttc 
gtgttgtggc 
cgattctctt 
ccttggggaa 
ggctcatttc 
tgtagtggct 
ccctcacccg 
ggacaagttc 
cttgggggct 
gtacgatgta 
tctttccacc 
taaggtatca 
gttcccttac 
catcaccaat 
ccatttaagg 
atccgataat 
actggaatgg 
cttttcagga 
tcctcccgtt 
taataacagc 
gataaggacg 
agcagtgacc 
gcgagctctg 
gggagaagaa 
aaaaaatgaa 
ttgttgggca 
aatatggggc 
tacatccttt 
ttgcttacta 
aatcctgaaa 
ttcatctgag 
acacgttcgc 
actatctaca 
t'Ctcaacctt 
cttgacccga 
ccgagtggtg 
atccatatat 
tgattacatc 
catcgaacac 
ccttctggac 
tgccaagtac 
tgggaagagt 
atgtggatca 
ctttcttcac 
tgctgaaatg 
ccctgatgct 
agataatgct 
taacaactcg 
atcctttgca 
agataccatg 
aacaagtctt 
gggaatgtgt 
cgaagtactg 
tgaacacaat 
aatgaaacct 
aggcgaacga 
agctcaagga 
tgaagaagag 
atttctggaa 
tagacgaggc 
agtaaatatt 



gtatcgtccc 
agacaagcac 
aaagaaactg 
ttaaactacg 
tcctacgctg 
aatcatcaac 
ctgggagggg 
atcagcatgg 
attgccagtg 
gaatttgcag 
aaaatggacg 
aaacgacccg 
caattgattt 
gcgatcgtgc 
gcagttatta 
tattttgagg 
ttcatgttag 
gctgacaatg 
ctcttggaca 
aataataata 
agtagcggtg 
aaccgtatta 
gacgaagaag 
aaggataaga 
gaattggaaa 
cgtacttcct 
cttgaggatg 
gtcaccgata 
ttacatccct 
gatttttccc 
atttgtcaga 
agtgtaaaaa 
gaacactgta 
ctggagaatt 
gccttccgca 
gccgagttat 
agaactccga 
ctctctccga 
ttcccagtaa 
gaacgccatc 

^ggggt^ttg 

ctactgatga 
tacaaattca 
tgctgtgcaa 
ttctttgcca 
gcagcagagg 
ga tgcactcc 
tacatgaaat 
gctgacaagt 
tgtggagtgt 
tcctggtacc 
gctgctgctg 
gaaaga tgcc 
cgtgtacttt 
cccagagttg 
gccatcagaa 
gatgtcaact 
ggagaagaa t 
gatgaggaag 
gtttctgtca 
gtgcaaccag 



tgattgctaa 
aggatgcatt 
atgttaattt 
acaaacaaca 
ctttgggtat 
gtagaaataa 
tagaacctcc 
gggcgttgga 
aacgtgaact 
aaagagatgc 
aagatgaaga 
caaaggaagg 
ggggcgatga 
ctaatatttt 
ggagtatgag 
atggacgtaa 
tagttagatt 
ttaacgtgaa 
ctgaatgggt 
gtacactccc 
cccgacatcc 
ccttccagtc 
aagctggaac 
acaatacgtt 
taaacatagg 
cgacatcctt 
caagtgatgt 
agagcagtcc 
gt tgtaaagc 
aggaaaa tat 
aatcaaacga 
agttattaaa 
tttggttaac 
ttcccgtata 
agatgaacaa 
cctatccaga 
tatccctctt 
agacggactc 
tcttccactg 
atcgcctgct 
ttgtcatccc 
gcaaactggc 
tctcattaat 
gtaaatgggg 
accctactac 
acgacaaggg 
tgttcctgat 
ctgcccttga 
acgagatttt 
gtgcaacatc 
agatgaattg 
gcaatgtaca 
ctcctacaat 
ttgaagcaaa 
ttgttcctcc 
tcgcaagaga 
cctacttccc 
tcttcaattc 
aagccagaca 
atattattaa 
tttatggagt 



119100 
119160 
119220 
119280 
119340 
119400 
119460 
119520 
119580 
119640 
119700 
119760 
119820 
119880 
119940 
120000 
120060 
120120 
120180 
120240 
120300 
120360 
120420 
120480 
120540 
120600 
120660 
120720 
120780 
120840 
120900 
120960 
121020 
121080 
121140 
121200 
121260 
121320 
121380 
121440 
121500 
121560 
121620 
121680 
121740 
121800 
121860 
121920 
121980 
122040 
122100 
122160 
122220 
122280 
122340 
122400 
122460 
122520 
122580 
122640 
122700 
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tgaaaagtca 
tatacgctca 
tgacaagagg 
aaactctcaa 
ctatatcgag 
cggggcccag 
agaaatggca 
atccccactg 
gaggcgtaga 
tctagcttca 
cattctcatt 
taatgcactc 
aaacttgcgt 
ccgct tgttc 
catgaagatc 
tattattgaa 
atgtttttca 
ggcacacttg 
ggctatccag 
ttatcttccc 
tcctacacta 
gaaactcttg 
agatgctgaa 
attgaaaaaa 
gtacattgat 
ttcatcaata 
cagaagaacc 
aattggagca 
tgcatacaca 
tgatgcatat 
attgagtact 
acctttccgt 
ctttaatgaa 
aaaacaccat 
agaacggtct 
tgtaattgta 
aataaataca 
tatggattgc 
tcgtataagg 
caacaagaag 
agaacctaag 
gaccctcgaa 
acttggacga 
tatttctaac 
taactacaaa 
aaagatgaag 
tgctataatt 
tggtggttgt 
tgttaaagtg 
cgaacgcaac 
acgagaactg 
taatgtccac 
aatacctaaa 
gttgaatttt 
gttagtatac 
aaaatggtgc 
aacgggggag 
tcacactaag 
ttctctcgat 
tgacggaaaa 
aacaaccaaa 



cctgctgctt 
agagatactc 
gcattattcc 
attgttggat 
acgtttggta 
gactat tatg 
agtgagttgc 
ttcaagaaaa 
aattatgctc 
cgcgaacacc 
gaa tatatag 
cattgtatga 
cctactgatg 
aatgaacaat 
ttcaaccatg 
atgtactgtt 
tccctcctcc 
tccagtagcc 
gaaacaatcc 
cataaaactg 
gaaatgagga 
gggcattttt 
acgtgcgtgg 
ctagccctga 
agaaat tcaa 
cagaacactt 
gagtgtcttg 
gccagaaagg 
gaaacgtttg 
tttaatgcgg 
ttgcacaatt 
cttcctccct 
gatacgacac 
acgcgaaaaa 
atatttttac 
ggatctacta 
gtcaaa tcta 
agatct tatg 
gctgcaattg 
aaaacgagca 
tggtacaaca 
caacacgtcc 
gaaaagaaat 
aatctcaagg 
tcttcttcct 
attatcgcct 
tcatcagaga 
gcgtt t tcaa 
aataaataca 
aagtgtaggt 
ta tagggctt 
t tgaca tggc 
tgtaagctac 
ctgagaacga 
ggagaggagt 
tcgaaaaaag 
gaagataaag 
gatataaaaa 
aaaaatgtgc 
acgtctatgg 
aaacgaaaac 



ccttcatcta 
gagttggagg 
acgcagcttc 
tggactttca 
gtcctcttca 
cagagagtat 
acatcaccga 
tcaggactgg 
gagatattgc 
cccctgttca 
tagatttcat 
gaaagtacat 
aaagggtgga 
acaacactag 
atttgggtgt 
ttactttaat 
tctgtggtgt 
gttgtcgagt 
agtctgaatt 
gcatctgcga 
acgaagttga 
gtggcaactt 
gctttttgag 
ttgcccgtga 
tgatgaaaca 
ctcacacaaa 
ccctggattg 
tttgtttcct 
ttcttgatac 
agaataaaga 
tatttgttag 
ttggacaaca 
ctattctagc 
atagtagatg 
gtggagtttt 
taatgcctag 
ataacttttc 
ggtttgagat 
attgctcgaa 
ttattaatgc 
aaaatgccat 
gtgttagtga 
ggacgtgtaa 
gtaaaattga 
tatactataa 
ctataaacaa 
aaaaggatga 
agtataagaa 
ttgaggcttt 
ttaaagaggc 
cggccagaag 
aacgccctta 
acacagtatt 
gggatggaaa 
acgctaaaga 
gtgtgattaa 
acgaaaggga 
aaatagaaaa 
aaacccacgc 
tgggctggga 
tggaaacgat 



caacatgctc 
agaaagagtg 
ttccatgctt 
agcagtgtat 
cgcctaccca 
cagatacgac 
agcattagat 
aaaatcttac 
tgaggaattt 
tgtacatccc 
ccaccactgc 
tgaacacgag 
agttttaagg 
aatgtccctc 
gtctaaattt 
ctaaacaata 
gtatatataa 
gaacatcatg 
ggaagaaatt 
agaagctgca 
ccatttctgg 
gtatgtggag 
gtcgcaagca 
ggataaatca 
agtttttagt 
atcttccccc 
gctggacgca 
ccagaacctg 
aggcaacgaa 
taaagtggat 
gaaaagtctt 
ccctatcatc 
gtcaatttct 
gagatgtccc 
aacagtcagt 
tgtacttttc 
tgccataacc 
tatagacaca 
cacagatttc 
ggtaaagaac 
gtgtggtgaa 
tgagtatatg 
ttatctagat 
caccatgtgt 
gcaaataact 
gaggaggtat 
ggtggaggaa 
aaaacaactc 
ttcgttaatt 
tgaaattaga 
ttatgtccac 
cgatgaaaac 
gtacgataaa 
ttataaccca 
tgtcagtact 
atatgaagac 
attctttaga 
tgtactaaat 
atcttctagt 
ttatattttt 
agatatatcg 



aagaatgacg 
cccgtcatga 
tgtgactttg 
gatccccatc 
ggagccgtca 
aatgatattg 
gtctttgagg 
tctaactgga 
gttagggtat 
tttagagatg 
atcacctggt 
aatacaaatg 
gtatctcaac 
agcaccaaaa 
ggtgtataca 
aaaataatgt 
ggcgggtagc 
gcagcagacc 
gccgatactg 
gctaatggac 
tcccaagata 
gcatttatag 
actggactag 
aatacaacta 
gctgaaattg 
gtgtacttga 
tccaagagga 
attgttgcca 
ctagaacagc 
gaaatgtgtg 
ccccatcatt 
aacattgaaa 
ataccatcaa 
aataacttga 
ggagattacg 
tatggcgacc 
tgttcttatt 
cctgaaaata 
cattcaccgg 
ccttttttta 
gtattggaaa 
gacagatttg 
agaataaagt 
aaaattctag 
gctacgtctg 
ttgtgtaata 
gatcacacta 
gaacctaaac 
aaga tgctga 
gagtgcgcca 
gatctggtat 
gctaacacta 
gattcgcgcg 
ataagacact 
gttacctgtt 
tttttggatc 
ctaaaaaaat 
tctgatacac 
acagttgtaa 
tcaatcggta 
agtagtgacg 



tctttgagtc 
acctgtccaa 
ccacagaaac 
acatatccaa 
cttttctgga 
tctcattctc 
gtagtttatt 
acgaccattt 
gtgaaaactc 
gagcaatccc 
ctatgcaagt 
tgcacctgtt 
tcagatggag 
gattgagcct 
aactcctaga 
aaagaattcc 
ctcctccaca 
tcctagagtt 
aattcctcaa 
ggccatatct 
acaggaagct 
ctggttctat 
gataccctct 
attacaactt 
ataagcgtcc 
aactgatcga 
cggccaagga 
tattaatacc 
aagtattgga 
tagtggccat 
tgtacaatgc 
attcctcat t 
gtatggtaat 
tgactgcagc 
gatggttttc 
gaaaacatct 
ggaacaagta 
actgtggttt 
taacgcgagt 
ttagacacac 
atgttggcgt 
gtagtctatt 
ctctagaaac 
aaaccaagta 
atgatcctat 
ttctagaatt 
aaaccggcaa 
aacatttaat 
gaaatgattg 
acgaactggt 
tgaagcgaac 
tcatgtcttt 
atgttaagt t 
ctatgctgga 
ttgaatggtt 
gttacgagaa 
gtagtagaga 
tttattctta 
aaaatgacac 
aaggagaaaa 
acgacgatga 



122760 
122820 
122880 
122940 
123000 
123060 
123120 
123180 
123240 
123300 
123360 
123420 
123480 
123540 
123600 
123660 
123720 
123780 
123840 
123900 
123960 
124020 
124080 
124140 
124200 
124260 
124320 
124380 
124440 
124500 
124560 
124620 
124680 
124740 
124800 
124860 
124920 
124980 
125040 
125100 
125160 
125220 
125280 
125340 
125400 
125460 
125520 
125580 
125640 
125700 
125760 
125820 
125880 
125940 
126000 
126060 
126120 
126180 
126240 
126300 
126360 
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agaagaagag 
caagaacaag 
agaaccttct 
tttgttatga 
acacctgagc 
tttgtaaagg 
gaaagaagac 
tttcttgctg 
tgttgaggca 
gggatctact 
aacgctggag 
gagactaata 
gtttaaaaat 
gatatacaat 
agtgttgttt 
ggtccttttt 
cttttgttcc 
agcttccttt 
ctccatcatg 
atcaagttct 
attcataaag 
atcacttgga 
gttttggtgt 
gatgagtctg 
cctcaaacca 
gtagaattct 
cgcattggaa 
gaaaagttgt 
ttctagttcc 
ttttgtggtc 
caactcttcg 
tgtcgaaagg 
cgccaccata 
ggctatatat 
atccttgaaa 
cattgtactg 
tttttctact 
gtccttgtct 
ttcttgcgtt 
tccaggttca 
ttcttgtaat 
aacaaatgtg 
catgacttgc 
taactccttt 
ttccatattt 
ggttacaata 
gttcggccca 
actccttttg 
gtatattttt 
gttattttac 
catattttta 
ctcagaaata 
atcaatagca 
attacgggaa 
actcttgaaa 
ggccacaggc 
taatttttcc 
aacaaaacgt 
agttggactg 
tacttcctgt 
gtttttaagt 



gaagaagatg 
agcaagaaca 
ctacctaata 
tgttaaaaat 
ggtttgggcc 
tgctgaaatg 
ttcttgaaga 
gtgggggatg 
gtcaggaaga 
tttgtcctca 
attcaaccca 
tctttcttag 
ttgtcttggt 
atcccttttt 
ccatcttctt 
actccccttt 
tccctcttct 
gtggtcaaat 
gaatggagct 
tgaggcgaag 
attcggatat 
ggggtctgga 
ttgatactgt 
ggcctcatac 
ttttctgttg 
gtagaggata 
gatttcttgt 
tcagtttcga 
ttttcaactt 
ttttcgacct 
cgtatctggc 
tacatttcaa 
aacaagggaa 
tctcctttgt 
ccatgtttct 
aaacctaaac 
gcttcagaaa 
tttcgtttta 
gcaagccatt 
ggaggggcac 
ctttttaacc 
tcttcggaca 
tcatatttac 
tctgatacac 
atcttcttca 
ctgtggattg 
ttttgcacca 
aacaaattca 
aaaaattagg 
cttaattttt 
aaatgaaggt 
tcctcacata 
aacgctaacg 
aacatgtacc 
atggggttta 
acccatgtat 
atgacatcat 
tccttttccc 
aaataaaacc 
acgtctttgt 
ataacgtgtc 



aaggaaaaag 
agaatgggag 
ctttatcgtt 
aataaaaata 
tgttgacgtg 
tacttgtttt 
ggaccgataa 
atactagtag 
gtgatctaga 
atggaggcat 
tgattgtgcc 
tttcccgtca 
taaaaaaata 
acacagaaat 
cttcttctac 
gctcctgttc 
ttaccttctc 
gtctttcctt 
tcctttgttg 
agatgtctat 
cagaacctgt 
aaactatact 
cgttgcaagc 
ctgcagtatc 
cagtttcaaa 
tcgagaagac 
gtacatt tct 
tatacgagtt 
ttgtggtctt 
ggttttctag 
gttgcttttc 
tgcttgcagc 
caatcaagca 
aatgggtacg 
gcatagttct 
cgattaagaa 
gactattgct 
atttttcctt 
cctcgtcagt 
aacttatccg 
ctcttgcgaa 
attcgctttg 
ttgttttctt 
tttcattact 
aaacaacaac 
ataat t ttac 
gatctgtttt 
gatttatatg 
atggggacag 
tttccctata 
tatcatatca 
atcgcatgca 
ccaacagata 
ctaga tacac 
caacaa tatc 
atttcagacc 
cagcactaaa 
cgaaagg ta t 
ctctaaacaa 
cttgagaagc 
ctttctcagg 



aatgaaaatg 
aatgtgttgc 
caattgtgta 
ttgtacattt 
cctaactttt 
ttatccaaat 
aaaaatggcc 
ata tgaagaa 
tgaccgtgtt 
cgaagaattg 
aacaacagaa 
cggtgaaaat 
aaacgaaaac 
ggcatgtctc 
gtcgctatca 
actggtagac 
aaaccacttt 
gttgtaccta 
ttcaatttca 
cctctgccca 
cactttttgt 
tgatcggtta 
gataggtgcg 
ccttttcttg 
ggaaccactt 
gagacaagag 
gagagattct 
tttgtactcc 
tactacttgt 
ttcctctttt 
atcactgtat 
agaagggtgt 
atgtgaagaa 
gtgacaaatt 
ctccattgac 
atcggaactt 
catgtacctg 
gactattgta 
aacgtgccca 
ttcaattctt 
tttgagttgt 
tcttatagat 
cccgatcatc 
aacagggtta 
ggaaaataca 
tcacagcttt 
aaaggaactg 
tatttgtcga 
atggtgtaga 
atacaaataa 
actaaacact 
agacttgcgc 
cgtctccact 
cagtggaata 
acccttgtat 
acttttccgc 
accgagacca 
ttgaggaaga 
catccgtgtc 
aagccactcg 
cttaatagag 



aataactgca 
acagatattc 
aaaagtatgg 
tattacattt 
tgtcaccctc 
ttcctgtact 
accttccaga 
gtgatgaaga 
tacatggtgt 
cgtcttttga 
taaaataaag 
gttggttatt 
tgtcaatata 
tttacacgct 
aattcattgt 
tgttctccat 
tcggtcttgt 
gtgacaattt 
aattgggtca 
tcttgataga 
acaataccct 
cattttaata 
tcacagccag 
tttatggggg 
ttgtctgtcc 
ccgctgtcgt 
ttttgttcct 
tctttcttta 
gtttctagtt 
atttgtaaat 
aagataacag 
gcgctccttg 
tggctcttga 
tcaaccaatg 
tttaaagcgt 
attatcatac 
tataattgac 
ttgattttac 
gcatcatctt 
aaagagtcat 
ttatctttct 
tcgatagctt 
aggaatgacg 
ttgtttttct 
gttttctttt 
gttcaaaact 
ggtaaaatac 
tttttattaa 
ttgatacagt 
ttggtatcag 
tttatttacc 
atttcttccc 
tctagagatg 
actaggcaat 
tttttgagat 
atatttccct 
atcaaaaaat 
ctttttgtta 
ttttccttta 
tcatcagtag 
gataataatg 



gcagcagcat 
tcaatgttgt 
atgtgttgaa 
gtttgttttg 
atgaatacaa 
gaagaatatt 
ctgacgccga 
cttttgatac 
gcctaaagca 
ctggagattc 
acggtgacgg 
tcttccctat 
ttgttttatt 
gttgttcttg 
cgttttccca 
tttctcgctt 
ttctcgtacg 
ttgatgtgtt 
tattttcttc 
cggtgccatc 
tcaaatcttc 
ccttgaactt 
tcacggccat 
agaaaatgta 
tgcacaacaa 
atctgaaggt 
cagagacttt 
ctacttgtgt 
ccttttcaac 
ttttcctagc 
cgtaggctag 
caaaaacgcc 
aaataggatt 
ggaccatcat 
gcgtgacgtt 
aatcgtgttt 
tgaatttaaa 
gcacgtcttc 
tgaataaatg 
cttctaccaa 
tcacatgtac 
cataaacacc 
atcttagttt 
ctccaatgta 
aagaaatata 
taactacgtg 
acatgaaaac 
tattaataaa 
gaccgtccct 
tttaggaaaa 
cccttttcac 
tgccgtacaa 
ggggttgaac 
gggaatgata 
ggcacacttc 
ctatcgctat 
ctgtgctaac 
tatacctgta 
cgagcttatt 
ttgtagacgt 
attctatctt 



126420 
126480 
126540 
126600 
126660 
126720 
126780 
126840 
126900 
126960 
127020 
127080 
127140 
127200 
127260 
127320 
127380 
127440 
127500 
127560 
127620 
127680 
127740 
127800 
127860 
127920 
127980 
128040 
128100 
128160 
128220 
128280 
128340 
128400 
128460 
128520 
128580 
128640 
128700 
128760 
128820 
128880 
128940 
129000 
129060 
129120 
129180 
129240 
129300 
129360 
129420 
129480 
129540 
129600 
129660 
129720 
129780 
129840 
129900 
129960 
130020 
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gagggaattg 
tttattttcc 
ttcaactgct 
cattagaaaa 
tatcagtttt 
ctttgacaca 
gttaaaagtt 
atgaggtagg 
cacataataa 
tcaatatgtt 
cggatattaa 
gagccaaggc 
aggctattca 
tcgtggaaga 
acgcgtgggt 
attctttgta 
atgagataga 
ctggggatat 
aaatatacag 
ttaaacttgc 
aaaataacaa 
gtgacactac 
aatatgagga 
tattgttata 
actctgtttg 
tcttccttct 
cttttaggaa 
acggaggaac 
gaacaaatgg 
gtggtggaac 
gaggtggtgg 
atggaaatgg 
aacgtctact 
cagaggtaga 
atgacgacca 
tgttcaagct 
aagaccgtgc 
gagaaaaaca 
atttttccaa 
taggagaaac 
tagtagagga 
cttacaaaat 
attgtttggt 
ctaacagatg 
atgctaccat 
ccaataagaa 
tagaattaaa 
ccgttccaag 
ttggtgcctt 
tgattgtttt 
cttttaaggg 
attgttgaga 
gttgatgaaa 
atcaatgaag 
actagtagtg 
agagataaaa 
agatatgtac 
catgccaatt 
attaataacg 
tccgagtggg 
gatttacagg 



tcccctgtga 
tttaattgta 
tcaaatgtac 
gaagagcgta 
tgttccccta 
gtttttatgg 
tgagtgtatt 
tactgattcg 
aaagggaaat 
gtttgatttc 
tgggaatgat 
tgtcatggat 
ggaagccgta 
aactgacagc 
agacagagcc 
tgataaactc 
tacaccaata 
cgaagaagag 
agatgaaaga 
taaattggat 
tgaatataca 
taaatcaggt 
gcgagagatc 
ctttttggtg 
aagagaaaaa 
tttcgtctca 
aaggaggaag 
tggaacaaca 
aaatggaact 
aaatggaaat 
tggaacaaat 
aggggataca 
taattccata 
aactgcctta 
attagaattg 
cgcaggtgtt 
ccgtaatata 
catcaacgat 
tagtcttaat 
tattgaaaag 
ttttaatagt 
cagagtagta 
ttttgccgta 
ccaagacata 
ggtaataagg 
tgacactact 
catgctcatt 
acctttgact 
tacagacaaa 
attttcaaaa 
aattaaaggc 
gatttgtcac 
aatacgatga 
catccgaaac 
ttattgctcc 
ggaaagaata 
aagcaattac 
ccgatacata 
accgcgccag 
gaaatgataa 
aaaaggatct 



gctctcttaa 
caatgaaaac 
tcattacttc 
gctgtaactc 
tatactccat 
tactctggga 
cttttgtggt 
tgatggataa 
aaaaataaga 
tttctaaaaa 
acagatttat 
agaaataggg 
gcggccaaga 
ggacaaagtg 
atgccttcgc 
cctcctttta 
tcttaccttg 
tacaatcttg 
gaccagattt 
agtatatcag 
gagtttgtct 
ataggtattc 
tttacatttt 
gacgtcaggt 
atgttcgtca 
ataacaattt 
aggaagagat 
aacggaggag 
ggaacaacaa 
ggttctggaa 
ggaggaggaa 
gacacagacg 
tcttcaaaac 
caattatctc 
gacgcctcag 
tcatcggcct 
aatgaggaag 
gaatacgatg 
cttgttacta 
gctttccctc 
ggaactatta 
gaaggatcaa 
gtagtaaata 
tgctttgtaa 
aaaggtgatg 
catttttcaa 
ttctcagaaa 
gacgccaacg 
aatttattgt 
aaaataaaac 
agaggctaat 
aactaaatct 
attattagaa 
atatgataaa 
tgcaatagct 
tgaaaagagg 
agtgatggaa 
tttaacattc 
ccccgtgaac 
cggagt tgga 
ttacctgtct 



ttctttcagt 
gccgtctcta 
ttcatacttt 
gtcttcttca 
ggtttattta 
ttactttact 
aaactaacaa 
cgatggcgtc 
actagtaaca 
tggctgctgc 
ccaagctaat 
ctaaaatgga 
aacaaaaagc 
tccctccaac 
atattgaact 
acgtacaaga 
ccatggtagt 
ctcctacctt 
ttacaaaggc 
gtaaaagtag 
gtagcgtctt 
gtgaatatga 
t tat tccaat 
cccgtattat 
taagcatagc 
tagatggtgc 
a tcgtaccag 
gaggaggtgg 
acggaggagg 
caacaaacgg 
atggaaatgg 
att ttgagcc 
ctaaagaata 
gagatgattc 
acactctaca 
ttttagaagg 
aaattgcaca 
gaaaatatgc 
aatacacaaa 
atgagattga 
cttcaaacac 
caacaga tec 
aggaacaaca 
ttat tcctcg 
aaattaaaca 
tcatcacaga 
gaattctccc 
tactgtcagc 
ccagccaata 
accatgggag 
gatgattggc 
gatgagacag 
gataaggttg 
cttgctgctg 
ggcacagtgg 
ctatggacat 
tttcgtttat 
ccttttttaa 
tgctccttgt 
cgtaaggt tg 
gtta tttgta 



cccccttcaa 
aattcactct 
tcatattgtt 
ggtgtctcat 
gtcagttgat 
caaaaattta 
tgaagtacat 
ccatgtctag 
tattttttat 
aaaaatggac 
cacagacgtg 
catgaataga 
attagtggta 
attatcggga 
tgtagagagt 
catagacgac 
ggtaaaagtc 
tggtgtgaca 
tgataaatct 
acaactgacg 
tgcagagttt 
caaacctaaa 
acaacccgca 
ataggactat 
tacgtcattg 
aaaaacaatc 
tgagagtggt 
aggagaagga 
aggtggagga 
aggaggaggt 
aggaggaaat 
tacgccagcc 
ctatgaagca 
tacccaaact 
aggaaaacca 
taccactatc 
aacaatatta 
cacacccgag 
ccatgaagtg 
gtttgagaga 
tatgcagtac 
aggggaagtt 
ttctctagaa 
tttatccgca 
agaaacctat 
caaggatgaa 
cactttgagt 
ctacggaaag 
aaaaaaatat 
ataagcaaaa 
tatcggctaa 
cccaagttgt 
aagaaatgag 
atatgataag 
caagaactat 
tagcctacaa 
catataaaga 
gaataaaaaa 
cagtgtctta 
aca tacacat 
tgttagatac 



aagtaatgtg 
ttcggatagt 
tttctccaat 
ccaggagatt 
ttatttttct 
ctcacatgtt 
ttcttagtaa 
tgatgtgagt 
tttatccttc 
gctatccttg 
attcaaaaga 
agagtagatg 
tttgataaac 
tccgattacg 
gttgagggag 
caaatcggtg 
gactgtgaaa 
caaaataata 
gtgcgtattt 
tatgcggtaa 
gaatctgaca 
aatgaattcg 
gggacgaaat 
aaggcacaac 
gtgttattct 
gactctcaac 
gacggcatag 
ggtggtggtg 
gaaggaggtg 
ggaggagaag 
ggaaatggaa 
cttctaaaag 
ttcgtatctg 
ataataattg 
agggattatt 
cgcaaagcgg 
agtcaactaa 
gaaagagctg 
ggtcttttag 
tgcataattc 
aggtccaacg 
gtccctgatg 
atatctgcaa 
ataggaaaaa 
ctgtttgtgg 
tctgttggaa 
gaccctgcaa 
cgcctaggtg 
aaaaaaagat 
ggtggaacag 
tgttgatcca 
taaacaggcc 
accggatata 
agaggtagac 
caataattta 
accatggaga 
cctgactgtc 
gatcgcctat 
tcccaacaag 
cagaagaaat 
tgatttttca 



130080 
130140 
130200 
130260 
130320 
130380 
130440 
130500 
130560 
130620 
130680 
130740 
130800 
130860 
130920 
130980 
131040 
131100 
131160 
131220 
131280 
131340 
131400 
131460 
131520 
131580 
131640 
131700 
131760 
131820 
131880 
131940 
132000 
132060 
132120 
132180 
132240 
132300 
132360 
132420 
132480 
132540 
132600 
132660 
132720 
132780 
132840 
132900 
132960 
133020 
133080 
133140 
133200 
133260 
133320 
133380 
133440 
133500 
133560 
133620 
133680 
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ggatatgata 
actatgtttt 
atatgcacta 
aatgtctatt 
tataaaaagc 
agtcaacgag 
aaagacacca 
gttataaaaa 
tcaatcaggg 
cctgctgcaa 
gataccagtc 
gaaaagggga 
gcctccagtt 
gataaattag 
gcagttaatt 
aaat tgggga 
ttaagatatt 
cgaaaagagt 
acagaccctg 
gattttttct 
gaattaatta 
gatttaaaac 
tggatagaag 
acgaacaaaa 
gacaagcgct 
ccccctcccc 
gctgctagtc 
agttttactg 
aaaattgaga 
agt tctatta 
aataaagtag 
agagtactag 
aaggatggaa 
cagaattttg 
tgccctgata 
gaaatttcga 
tatttcatct 
cgcctttcac 
caca ttttaa 
gtaggtgatg 
aggacagctc 
aaggtaaaca 
acatcaaaaa 
ggtcccacta 
gacctgcttt 
tttttaaagt 
agcacat ttc 
agaccctggt 
ttactcttcc 
ggtggaccgc 
tgtcagattg 
ctggaacagg 
tacactagag 
cactcgagtt 
gctggctcgg 
aaat t tctac 
ctgtaaccta 
aaaatttttc 
ttcggagggt 
act cccatgt 
gaagccttag 



aagctgtgga 
tacccaaaac 
tagtgttccc 
acaggatcac 
gaggttggca 
tacagaggag 
agttaagagt 
gtgctctcca 
aggcttatct 
aatcattaaa 
ctagaagtat 
attttaggca 
ctagagtgtg 
ggggtattta 
tttctcccat 
agaaaagtgt 
atggggataa 
ttgagcagtt 
aattacgttg 
gctcaaaagg 
tagccatgtg 
gtttcaataa 
gacatgttaa 
acacacatgg 
ggtccccagg 
cagaagtggg 
attgcgtaac 
ctaatattaa 
ttaaggatga 
cttctactcc 
actctataca 
aagtacacaa 
ataaagttgt 
ttcaagtact 
gcggaccata 
gtacagctgc 
ctgggtgtgt 
atatgggacc 
tttttagatt 
gaggaaactt 
tcgaaaatag 
aggtaacaaa 
gtgataaaat 
ctttggatat 
atgacaacaa 
gtataaaatt 
atcccttttt 
taggtccctg 
tccatctcaa 
tgggtcggcc 
caccagaaac 
cgtttctgga 
aatttttact 
tagagggtgg 
gccaatgtca 
tgtttctgga 
ccatatacac 
tgggtcactc 
ggacctctgg 
ccagaaatat 
gcgagtcatg 



agttgatgcc 
ctctaatcta 
tcccgcttct 
atgcacatgc 
cagactatca 
gatcggagtc 
atcatatctg 
gaaaggatgc 
ctattaccat 
tacaaatatg 
ggtggcttct 
acccagttac 
ctcccacatg 
ttatgctaat 
agaaagaatt 
acagttatta 
agtgaaagat 
ttgggggtta 
ttattttaat 
gttctttaga 
cataggtgac 
aggagaagaa 
acggatgcct 
agtatctttt 
ggtgtggctt 
ccaatttctg 
gaggaatatt 
ttctgaaccg 
taatactact 
acctcctaca 
attgaataat 
ccaaaactct 
gtttaaggag 
aaaggatgaa 
cagaggatta 
taggatagaa 
gactagacaa 
aaataaacaa 
gttgactggt 
gttctctgtg 
aaagataaag 
ggaagatata 
aatgaatggt 
tgtaaagaat 
ataaaggaat 
tttctaagac 
ccccttcttc 
ctaagggtag 
aaacttttaa 
tgatgtcaag 
gtctgttgct 
cacaccagca 
cttcctccat 
accgctggct 
gattgcacca 
acaggcgttt 
tagagaattt 
gagtttagag 
gtcgggccaa 
ctagatcctt 
tttcttgcac 



cataaatttc 
tttaatcgat 
gccagttctg 
tcctagaggg 
aacattcagc 
ctcccagaag 
ggcgtgggac 
aggaggaacg 
cttggattga 
gtaaacagaa 
aatgagtgtg 
ttgatggatg 
agagctttgt 
tttaatgaac 
aaacacgtct 
aatttaagaa 
acaaataaga 
tgctttaaat 
gagttgacat 
gaagaaagtc 
cgtaaacagt 
gggagaaaag 
caaatgcctg 
gccctagaga 
cattcttata 
gatcaggctt 
tgtactacta 
atggaaatta 
gctactgtta 
aagaaacaaa 
ctaccaactt 
aaaaaggg tg 
atgagaaaaa 
gatgtgtgta 
tacagatgct 
aaagtgaaat 
gaaggaattg 
tatgtgtatg 
gtgagtgata 
gacgagaatt 
gaactacaat 
gatagctg tc 
gtatgtaata 
aattgtactt 
aatactttat 
aaaagtgagt 
ttcttcttct 
gctgtaacct 
aaaatttttc 
ttcggagggt 
ccagaaacgt 
cccagaaagg 
ctcaaaaact 
aggcctgatg 
gaaacg tctg 
ctggacacac 
ttactcttcc 
ggtggaccgc 
tgtcagattg 
tctggttctc 
catgtatttc 



attttgaagc 
cgcatatagt 
cttctacaac 
ggggggtggc 
cgtaaaccat 
aggctgcctc 
attggggata 
atgaggatat 
attacattga 
ttaaaatcat 
tgagaactct 
ctgccatgag 
gttgcaagga 
ttgaaactca 
tcagggaga t 
gtgtagcagc 
aacatagtgg 
ttgttactca 
atgccataaa 
tattccttac 
ttgccaagat 
aagaggctgc 
ta tgggttct 
gtagtatggt 
caaagatgcg 
ttaa tacatt 
caggatttat 
aggaagaaat 
ctgttagtgc 
aaacaactcc 
taaatatgga 
tagctgctac 
get ttggt tg 
aattggacta 
atttcaaaat 
ggggagaaaa 
ggaaaattat 
a taactatag 
ccaacacatc 
atgtgggcgc 
tgctactgaa 
ttccatcttg 
t tggtaaaaa 
gtattttagg 
acaatttttc 
ttgagggttg 
tcttcttctt 
acca tataca 
tgggtcactc 
ggaccgctgg 
acattaaatt 
gtaggctgta 
t ttaaaaaa t 
tcaagttcga 
t tgctccaga 
cagcacccag 
tccatctcaa 
tggctaggcc 
caccagaaac 
gcccaccacc 
taggaatttc 



agggaacagg 
aaactctaaa 
agaactggat 
tgtgtccttg 
ggaggaagaa 
ccagatcctg 
ctctgtttca 
aactgcgtgg 
aaacgtaaaa 
agctgtggag 
agaaaaatat 
gctagttcac 
agaggacagt 
gtgtgttagt 
tgaaagtgta 
ttaccatgtg 
accattcaag 
gcacgtaaaa 
ttggagaagg 
atctattgtg 
tcagaaaagg 
tacatttgat 
ggacaaacat 
ttctggagga 
tctagattct 
gaagcgagaa 
aaaggcttct 
aaagaaaaga 
tactactagt 
aagtggaagc 
ggatttagat 
agttttaatg 
gggttctcat 
tttgttgcca 
agtaaaggat 
tgccatgtgt 
aacagatgtg 
gcaattaata 
aaacattctc 
aaaggatccg 
gacttccttt 
gctatttgat 
tatgggaatt 
ggtcgtgaat 
atttccttcc 
gtcataatct 
cttatttgaa 
ctagagaatt 
gagtttagag 
gtcgggccaa 
tctactgttt 
acctaccata 
ttttctgggt 
agggtggacc 
aacgtacatt 
aaagggtagg 
aaacttttaa 
tgatgtcaag 
attgcgtact 
agatatctgg 
gctgacgtca 



133740 
133800 
133860 
133920 
133980 
134040 
134100 
134160 
134220 
134280 
134340 
134400 
134460 
134520 
134580 
134640 
134700 
134760 
134820 
134880 
134940 
135000 
135060 
135120 
135180 
135240 
135300 
135360 
135420 
135480 
135540 
135600 
135660 
135720 
135780 
135840 
135900 
135960 
136020 
136080 
136140 
136200 
136260 
136320 
136380 
136440 
136500 
136560 
136620 
136680 
136740 
136800 
136860 
136920 
136980 
137040 
137100 
137160 
137220 
137280 
137340 
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ataaactggt 
catctcgccc 
tgtgccgatc 
atctgtacag 
tcgcagccat 
tgttggaaca 
aacctaggaa 
acaaccacaa 
ctctactcaa 
acagggctaa 
cttcggttgc 
tcatgagtga 
ctcttct tgg 
atgcccgctc 
tgctaatggc 
aatatacttt 
caata tcagg 
atgaaaaaac 
cttcaaagtg 
caccaaaagg 
atgtgttgac 
atgtaaaatg 
taaaaagatt 
ctatgccggc 
ttgatgattt 
gacagcgaaa 
aagatccaaa 
aaaagcagaa 
cagccatcag 
acatggctgt 
gtgcagtcga 
ggatcaagcg 
aagacgcgag 
aggaggaaga 
aaaataacaa 
ccaccaacga 
ctgttga tat 
acaagaaaga 
gggcaatgga 
ggtgtaa taa 
cctattctaa 
ctagcaccaa 
tttccttcaa 
atttctcgtt 
cttgtgtctg 
cttttcttca 
ctacagagag 
gacgtccaac 
ggataagtat 
ttctcct tct 
agaaggagaa 
taattctgca 
actaaaccac 
ttcaccagag 
tctaggtgtg 
aaatgtagtt 
ttct tcgtca 
ttcgtctctg 
tatcaactct 
gtcgcccgaa 
tgacaacgca 



ttgtcatcct 
acacagaaac 
tgggtgtata 
ctagcagcag 
ggcagcagca 
actcaccccg 
gatctttggt 
ccaaggtgag 
ttgcatatgg 
aaagtataga 
cgtgatccct 
gattattgac 
aatgatgaac 
aactaaaccg 
agctgaagct 
tcctggtttg 
agatgatgac 
tcgcggtcgg 
tctcagtgtt 
agaatttgaa 
taaagggtct 
tttgaggaat 
tgatgagagt 
aggtgctgga 
cagaaaattg 
tcttgatatg 
gaaaaacgaa 
caaggaggaa 
agaaaagcag 
cgcagccatc 
catggctgtc 
tgcagtcgac 
ga tcaagcgt 
agacgcgagg 
ggaggaagaa 
aaagaacaag 
gaggattcaa 
agaagagaaa 
ta tggtgaac 
ggatgaagcc 
agaaattgta 
ctat tctgcc 
cga tgcatct 
caagacataa 
gataaaatac 
actcacacat 
atggtatata 
agaccaacaa 
gatacgtatg 
tctgctcgta 
aaattccatc 
acaattgaac 
gagggagata 
catgatattt 
aatctacgca 
gatgtgtgcc 
t tctcaaaaa 
ggggaatgca 
tattttttct 
atggaaattg 
agtagaaact 



gttttatcca 
aggatatgat 
aaagagcgct 
cagtagcagc 
gcagtctcag 
gacggagacg 
gacaaaagtg 
agttaccaaa 
aatggagagt 
cttctcaact 
atctacgggt 
aaggaaactc 
gccaagcatg 
tttgaaaatt 
atttatgatg 
gaatctgctg 
aatatgttgt 
gtaaatagat 
ttaaagtttg 
gaaaaggccg 
gaacaagaat 
atttgtatag 
attttgagaa 
aatgaagaag 
ttgtcccctg 
ttgtgcccct 
gatggaaaca 
gaagacgcga 
aacaaggagg 
aacgaaaaga 
gcagccatca 
atggcggttg 
gcagtcgaca 
atcaagcgtg 
gacgcgagga 
aaggaagaag 
cgtatagtcg 
aggacaaaaa 
gaagtacaga 
aagaatacta 
ccttgtttag 
aacaatacta 
agattctcca 
aataaaatgt 
cacacatctg 
ctattttttg 
aaggctttgt 
catcccctct 
aggatatctt 
aaagtgaaat 
ttgcaaaggg 
ctattatttc 
tatcctcaag 
atgaattgga 
acgtttctaa 
acagaaaatt 
atgtgtgtat 
gggatttatt 
accctgaaaa 
cagatgagga 
ggtctggtgt 



gtttgctgac 
atcatagatt 
gcggagaggc 
agccaagaga 
gagaggggag 
tgataagata 
tgatagagac 
tcgcatcttc 
cgggaggaat 
cacttttggt 
cagtgtgtga 
ttcagaccat 
gcacattggg 
tccaatatga 
gattcagaga 
atgtgtacgg 
tgaacctcat 
ccttgttaga 
ttgaaaaaca 
aaacatgtgt 
cttacaagct 
tatcacaaca 
agtgtacacc 
aaatatgttt 
tctcaattcc 
acagtgacca 
gagtgagggt 
ggataaagcg 
aagaagacgc 
acaaggagga 
acgaaaataa 
cagccatcaa 
tggctgtcgc 
cagtcgacat 
tcaagcgtgc 
acgcgaggat 
acatggcaat 
gggaacaaga 
agaaacttga 
gtaatgttgt 
gaaataataa 
agaataatgt 
atattgtaga 
gtaacatgca 
tgaacatttc 
cacatgcaag 
atctccatca 
ttcgttctct 
acttgaacaa 
agaggataaa 
catagaagag 
tccaacgaca 
tcccctatat 
ccttattgtt 
actgatgaag 
tttcaacaat 
tattcctcta 
caacggtatt 
tactactact 
agaacaatcc 
ctgtttgatt 



gtcaatagac 
tctgggaaga 
agaaacatca 
agatcggacg 
aatctctgca 
cgactctgag 
tattggacat 
tgtgttggaa 
ggctctatgg 
ccacaagata 
cagagaagaa 
atgtaagagt 
aggtaatttt 
agcaatggga 
cca tggctta 
aaacaatcca 
ctgcaactat 
ttttttaaaa 
ctttaaaatt 
taattgtctt 
ttcttgtgga 
cctaagatgt 
taacttgaat 
catgagaaat 
tcatttcttc 
cacgataata 
caaccacaca 
tgtagccgtc 
gaggatcaag 
agaagacgcg 
caaggaggaa 
cgaaaagaac 
agccatcaac 
ggcggttgca 
agtcgacatg 
caagcgtata 
tgcagctgcc 
gttaagggct 
agacatggaa 
tagtagcagt 
taatgctgtc 
atttggttca 
aactcccaaa 
atactgtttt 
ttgcacacgt 
taaaacattt 
tttatcacta 
gaaggcttct 
ttcaactgtt 
actttaatct 
ctccgtgaga 
ttcaatgaca 
actcagataa 
ggcactgatt 
aaaatatcgt 
aggattatag 
ttttctgcag 
tgtgatgacg 
actactacta 
ccaaaaacta 
tttgaagtat 



catatttggg 
gggtgttttt 
gacagacttg 
caaaccattc 
gatctactcc 
cagtacacca 
tttctcatcc 
aaattccccg 
aaggcattgt 
aaaaattggc 
aggcccatca 
gatatacgtt 
ttacacttct 
gctaatgcag 
aacccatcag 
gtggaaattg 
ggtgtatctt 
atgaacacag 
gagtcaaata 
gatagaaata 
cactttcttc 
gaaaaatgtc 
tggtggt tga 
aagaaactgg 
aaaaatagta 
ccaaataaag 
gccatcagtg 
aggacattta 
cgtgcagtcg 
aggatcaagc 
gaagacgcga 
aaggaggaag 
gaaaataaca 
gccatcaacg 
gcggttgcag 
attgact tga 
actaaaaagg 
gatctgagaa 
ctagaaaagg 
agtgttgttg 
attggtatga 
cctcataaat 
atgtctttca 
ttgtaaaacc 
aacactacga 
cttcaaacct 
t tcgtacgat 
cgctttcagg 
tcaagacatc 
ttcaacttaa 
ttctagacga 
gaaacgaatt 
tgaagca tat 
tgctttttgg 
atggtacttt 
ttaatcccat 
ctgaagaatt 
tagagagata 
ctgctccttc 
taaagagaaa 
ttaaaaacac 



137400 
137460 
137520 
137580 
137640 
137700 
137760 
137820 
137880 
137940 
138000 
138060 
138120 
138180 
138240 
138300 
138360 
138420 
138480 
138540 
138600 
138660 
138720 
138780 
138840 
138900 
138960 
139020 
139080 
139140 
139200 
139260 
139320 
139380 
139440 
139500 
139560 
139620 
139680 
139740 
139800 
139860 
139920 
139980 
140040 
140100 
140160 
140220 
140280 
140340 
140400 
140460 
140520 
140580 
140640 
140700 
140760 
140820 
140880 
140940 
141000 
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gtactacatt attaatagag gagatagagg aggttctttt gaaaaggctg tgaagagtgc 141060 

aatttcttct atcaaggaaa agagatgtaa aatcacagat attaatggta ataaacctcg 141120 

attggttatg gtgataactg ggtgttatac agaattgtac ttcaaagatg cacttaaaca 141180 

gattggagaa aacaggcgca aatttttgaa aatgaatggg aattactttt ctctgattga 141240 

tgaacaagca gatctaatcg aattcgcgat gagtgtttct ggtgccgggg agaggatttt 141300 

tgttaacggt ttggggatgt tccagaaccg taaaatgata cctgtaattg atcctctcac 141360 

atatgaaaat gttgtatgtg gtgagcatga tatacaaaaa gaagatgcta ttctttctgt 141420 

aaggagagct attgcagact ataatgactt tgtaagtaag aacaagagag ggaagaaacg 1414 80 

cagcgcagaa gaagaaaatg aagatgaaga tgcagacgct agcagcagca gcagcagcag 141540 

tcctcctcct tcttctcctc ctgcacataa aaaatcacgt cttccggatg aaggcgaaaa 141600 

atgtacactc tgttaatttt ttcaaacaat aaactaacca ccttgtatat ttttgtttgt 141660 

ttgcaacacc ctttttatga taccattaag acacgatggc ggtaaacttg gataatgttc 141720 

ttgtgaatat caacaacaag gatgaagatc ttacaaaact cgtatccgag gcaataaagc 141780 

ggcgagctaa aactgtattt gacactaaaa atcaagcagg gtttgacatg agacgtcaag 141840 

ttgaagctgc attatatgaa gcaatatcca aaaagaaaga aaaggccata aaggcattcg 141900 

atgagctcat acaagaaaga ggtgatgaaa ttacaccttt gactacaatg cagtatgaag 141960 

^gtgggtaaa ccgtacaata actccctcat tgacgactga aaatttatta ggtgatgttg 142020 

agcacgccga ttttttactg gaccgaatga cacccgtaag cgaggaagat attgaaggtt 142080 

tcgctgcttc tacttttaag gaggtatcag attcaaaaac tgcaacagtc atagttaagg 142140 

cagattgtga aacgggggat atcgatgaag tgtataatct tgcaccatca ttcggcgtca 142200 

ctcaagaaat taaaatatat aggtcaaaca attcctcgga attggataat gtcgcagatt 142260 

ctttccatat ttataaaatt tctgcaacag atagcgacag tggcaatact aaaaaattgt 142320 

tgtatgggtt aaggaataaa aaagcaggtt atacgtgttt gtgtagaatt tttgcagaaa 142380 

ttgaatcaga tgggattatg gccaatacaa atatcggtgt cgctgaaaac aacagagatg 14244 0 

aaattgatga aaacgaagaa ggtaaatatg gttttttaat acccaaacaa ccagctggtg 142500 

caaaattgat catctacttc tttttaaatt gttggacata aaaaatgata ttttatacaa 142560 

tgcaaccctt ccttgggttt ctggtctttt ctgtgcttat tgtaattgta atgactgtat 142620 

tagctgtata cactgcacct caaataaaga aatcgaagaa gagaaaaatt gaagatgaaa 142680 

atgaggaaga acccgtaaag actttggaag attttgtaaa gggtcggctc cttaacgctg 142740 

tcaaggaaaa acctgcagag tactttgagt tgctaatatc tgcagacact gaagcagcat 142800 

taaaaactgc cgaagaaaca gcccttcgag attttgttat tgagaacgac tctgtcgaaa 142860 

tagatgtgga ggaagtactt gaagagaaac caagagaata tgtcttcaaa ttggcaggcg 142920 

caacaagcga aacgctaaca aacacaatca tcgcagaggt acaaaaaaag gcagcattaa 142980 

taacagaaga agatatcact attaaaatgt taaaacaatt cagggctgcg aacaaagata 143040 

ataaagacgg ggaagcaact cctgaagaaa aggaagattt taccaataat tcagatcttg 143100 

tggggttgta cttgaacgaa gtagtagaaa aaacaacaaa tattgtcatt aacaaaatat 143160 

tccctcatga gatggttttt gaaagatgtg ctattttaat tgaagatttt gatactggtg 143220 

ttgtgactga tcaagccatt cagataccct ccaacaaata caaaatcaga ttagtcgaag 143280 

gggatgaacc tgaagtattc cctggtgact gcttggatct tgcagtttca gttgataaaa 143340 

taaaccacgt cttgaaaatt tctgcaaaga acggatgtga aaacaactgc ttcgttatta 14 34 00 

ttccacggtt ctctcctgta ggaagtgttt cttccatgat attgggcagc actgaccaag 1434 60 

tcaaacctaa aacattctta tttttagcca acaaaaatga cagtacacat tttcaattca 143520 

caatggataa gcaacattct gtagggtgtg agttggacat gttaattttt tcagaaagga 143580 

acttgaggaa tttacccgat tcaaaaccta gacctctaag tgatgcagac atattggcct 143640 

catatgggaa gcgtctagga actggtgttt tcacaacaga aaatttggta gacgattaaa 143700 

taaaatttat aaccacttta tctgtagact tttatgtcct ttgtgcaaca aaaataaaca 143760 

tgtcttcttc gtcttctgaa actcctaaga cttccaccga tactggggaa gaaaggatta 143820 

aagacattgt aaatgctcta gataataatg gcgagtggtt gtcttcctat attgatccga 143880 

ttatcaataa tcacatctca cgaaaaacgg cagaaactgt ccaaaaaatc aaccaagaag 143940 

ttgatgaacg gtacgataga aaaatagccg acaaaatcaa cgaaataaaa tcatccatct 144000 

ttacaagtgc tcagactatg tatgaccaat atgcaataga cacatttcaa gaaggaaaag 144060 

gagccaacgg gactggacca gtcatggggc cagtgaacac ggttatcgat acaactttaa 144120 

ataaaatgag gggaaatatg ctcgaatacg ctgaagatat gtgggacgga gatgactgga 144180 

aacgattttc cagttctatg acaacgcttg aatttgatct aagttactct gatttaacta 144240 

tgatgcgtgg ttctgacggg tattttgcat tccctttccg tggaacaaaa aagataaaga 144300 

tggacggttc aagaaagaaa gaagacccaa ttaattgtat catttcagta acatatccaa 144360 

acaaagtagg ggatgagtgg gaagagggta aagaacgtga agtgaatttt aacctagaaa 144420 

gagtagacga ctatgaaaga gatatccatg tttcaatttt gtgcatgtta catgcacaac 144480 

ttgataattt cgaacaagca ttaggagaaa atgcaaactc tttttatttt aaaaaggggc 144540 

aaagagtcat gttcttaccc aagaaatcta aactgttcaa tagacctact gtagaagatt 144600 

ctgatatgtt ttctataata ttcccacctg catctgacca agattttgca gatgatattt 144660 
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attatcgaat aattgtaaca tgttcataat aaaatttgtt gaaataaaat acagtaatac 144720 

attcaaaatc ttttattatt tttcctttct aatccaacat atataatggt gatacattta 144780 

caggtgatac ctgaacatat ttataaaaat caggaatata cataaacatg ctattgtaag 144840 

ttgtattcat atccctggtg atcatctcgt gccagatgcc cctcagcgag atggagcttg 144 900 

ctgtaatctt tgccctgtgg agtgcttgtt tgtatgcgtc ttgagccctc ttcatctcat 144960 

gtgctgtagt ttcatccccc tcacaaggct ccatgataat attctttctg gccgaattat 145020 

acttgtaatc aatgaggtat tgcttgaaag gcccagtcaa gatgtcgttg gttgtagtct 145080 

ggatgaggtc acatttcttg gcaacattct taaaggcggc agcagcagtc ttggcagaca 145140 

ttgaattcgc ttcaatgctt ccaaaaatat cctcattatt ctcgtgcacc ttcaagtagt 145200 

tgccactatt ttctcccacg cctccgtcaa caataaggtt gcgaacgtat ctgcggctgt 145260 

gtgtcttgtt ctcaacaact tccctcttta gaagattttc aaccatctta gtgtacatgt 145320 

cagatctcca attgttgttg atgtcgttat gttgcccaga aaagttccct ggcaaactaa 145380 

ttctgaaggt cttgtagctt ccataaggag tagtgttatt aatgtggtta atgggcatag 145440 

attcacagaa tggaatagcc agtgtatgga cagtagcgta taccttggca gtcacattat 145500 

cttcacatag agcattaata agagcactct tcttgctctc acaataaggc ttcacagact 145560 

tgtcgaaaga ggaccagaaa ctcttatcat aatctacctc caccattgaa gattcgtatg 145620 

tagtggaacc catagaaaag ataataagat tgtccacatc gacaagtact gagtagtact 145680 

ggctgttttg cagatcaagt ccctgatcct cctcatcttc atcctcatcc ttcttctttg 145740 

tagactcgat gtgcacacaa tcctcagcag cctcagttcc ggcagggaac aacttaagag 145800 

acttgaggat agtggacatg ttgaatgagt tggtgatgca tccagaggag tgtctgcata 1458 60 

gagatagaca gacaggagtt gatctgttct ttgataggag agaaacatcc atctcttcct 145920 

cagattgagg atatttggtc actgaagcag atggtctgaa gctgttgtct gaagccaggt 145980 

agagagggtt cccatcgctc tcgtcgaatt cgcactctgt tacatcccca aacacaactt 146040 

ctgaacactg ggaagttgca ccatcatcat cctcatcttc ttctgaatct tcactgctac 146100 

tgctgataac actttccttg tcatcttcat cctctactac tgttgctgta tccttatcct 146160 

tatcctcctc atcctcttcc tcttcttctt cagcagcagc tacaacagtg acagatgaag 146220 

acgaagagga agaagaacac actgatgagg cgtcgtcatc atcatcatcg gagtcttctt 146280 

cttcttcctc tgactgctca ccctcttcct tttcctcctc tacatcactg agaccgatag 146340 

tttttctaat aagattcttg atgacatttc caactgaact ttcaatagtt tcattgttgt 146400 

aatcgtaatc aacttcctcc tcgttttcat cttcgtcttc atcctcactg tcgctgtatg 146460 

catatgctga cttttcgtca tcctcttctc caaatacagt attgttcagt gacttgaggt 146520 

tactctccct cttggccaac cacttgtcgt tcttgaagca acaacatcca tttctcatag 146580 

attcagtgac actttcttcc acaatttggt tagtggtctt gaaatgtgtt cctgccttct 146640 

tggtagacga agaactggga gttgacatga ccaatccagt tgacttgatt gcactatcaa 146700 

tatagtttcc tccaagttcc ttaaacagtg acttatctcc attacttcca aatatctgcc 146760 

tcttcttctc caaatccatc tcgtccacat atacagcatc agagagcttc aatgtgaaga 146820 

gagggcctcc cttatcaagg gccttcttgt tgagtgtttc aatagtcacg tgctcatgga 146880 

actggagatt catacaacca aggaatagat tggccagttt tgtagtatca gtggcgattg 146940 

cagttttctt cacagcttct ccctcaacct tgacaatgtt gctaaccaga tcctttcctt 147000 

caaatgccga cttgaccttt gagacacacc catcacaaaa ggtggaaggg gacagttctg 147060 

tgcgtccaga gttcatcaca ctattgaaca gtaccgattt ctgcgccttc ttattcttct 147120 

ccaattcttc gtcctcatct tcgtcattct ccagaaattc agcaacagat ttggccagaa 147180 

cactcacgtc aagatgagct ggcaccttgc ccataagagc gtccagttct tgaagagatt 147240 

gctgcttctt ctcggcgggt gtgatttctg gaggagtctg ctgctgctgt tcttcttcct 147300 

ctcctacaac ttcttccaca gtttccgtgt gtggactcct gatagtgttc ctcctctgct 147360 

tagagaccat tcttcttcca gcagcaaact tggccttgga aggagttggt ttcttggcag 147420 

aagtagtagt agtatcctca aattcataaa cttgatctgg ttcctcagag aaagtgacag 147480 

ttggctgctg ctgctgctgt tccttgattg actccatttt tagaatattg ggttattagg 147540 

attcagtagt tctgttattc ctagacaaca cgtcttgtct gtacgattag tcgagatgat 147600 

gtgatgtaga gcccttgccc gtgcctcaat atatacccct cagtagacct tgataatcac 147660 

ggttgccaaa caacaacggt tggaaatctt agtagaaaga accaatggaa caagtcagga 147720 

taataatata cagtataagc ttttggggaa caacattcat ttattgtcat agcgctaaca 147780 

ttagaagtac aagaacaggt gattctgcat ccagctcatt ttgttcaacc cacttcattt 147840 

gcaggggaca acactcttct ttcttgaaaa atatggatga agacacttgg aattcttcaa 147900 

taggccctgg ggcgttgctg gaggaggacg cagaagaaga agaagaacag ctgccgttga 147960 

acgagtttat agtttccaga agacaacata atttagctgt aattggacga ggagacttct 148020 

tgtaatgaga gaaccacaaa attcttcttc ccgtcttttc tggtataaaa gtagtcccat 148080 

aatatgttcc tttctggact accacattat cttcgtttaa gggaacatat ctgaccgact 148140 

ttctcatcaa gggctgatct ccatttcttg ttgtgcttag agccgatttg tattccccgt 148200 

cgttcacgtt atagttgcat atagaacagt gggatgaaag gaacttgtcg gttacgtgtt 148260 

cagtttgtac aagataactg gacgcatttg ctagttctgg agggacgtcc tggtttttgt 148320 
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ttattgttga 
gtatgcgatt 
aatcggtagc 
tctcttcctt 
cagccgtgct 
tgccgtgtcg 
gtgtgcgagg 
caacctttaa 
cctctaaagg 
ttaagggtgt 
gaaggtgatt 
caatgacata 
atttggctag 
gccatattat 
atatatcagt 
Gcaatcgaac 
catcaacaat 
accctggttt 
tggcacctac 
taggtgcttc 
gaacttgttc 
tgtgccgctt 
gttgtttttt 
cctccaggaa 
ggaaatttaa 
ggcgattatt 
aatatactat 
cgcagttata 
ccatgagtct 
ccttgtttag 
atttaacgtc 
gcgatccttc 
ggcccagcaa 
caatttggaa 
tcctattatc 
tacagtctgg 
cacacccacc 
cattatccgt 
gcaatccaca 
gaagaaaaaa 
aacgacgtaa 
cagtcatgca 
ctaaaattgt 
caacatttag 
ttagaactgg 
tccaaacgga 
aatactatca 
atcattaatc 
ctaaagacca 
gggcgccaag 
acccacgtct 
gagttggaaa 
aaggaagaaa 
ccggaggatg 
tactacatac 
aacattaaaa 
cacatcacta 
gtt tcaaagg 
acgggtggca 
atcaacgctc 
gcgtgcgtcc 



agtaggtttt 
cgcattgatg 
tggttcaata 
atttttgaca 
ggtagcagaa 
tgaggataaa 
tcgttttcta 
ttgccatggt 
gaacaagtgt 
tgttttattt 
ttttgataga 
atcattcgaa 
tttggagagg 
tttagaaagt 
agaatcattg 
tagaggaact 
ttttttggag 
cttttcccac 
tcttccttcc 
tacgtacaaa 
catagtcttc 
attgaatttt 
ccctacttct 
caaatcacag 
tcctgttatt 
gacattaaac 
aatatttccc 
gagacatctt 
tgctcctcca 
gcgctgtatc 
tcttacagat 
tttgtagcct 
ttcttttctt 
tccaaatccc 
atatagtctt 
aggcatgttt 
agaatatttt 
tgtttcttgg 
accaagtatt 
atcattcgtt 
caaaaccaac 
aagagaaaaa 
ccaatacttt 
tggacatgac 
atggaagtag 
aaagaggtag 
aaaattggct 
ttccctcttt 
tatatgactt 
acctgtttga 
cgttgctccc 
aaaatagcaa 
tcaaagaact 
caaattttgt 
aaacatcaga 
tattggagaa 
atcccaggaa 
attatccgtt 
tttctgatct 
tcttttccct 
taatttccca 



atctttttcc 
tatatgcaga 
atttctggtt 
cctagaactc 
ggggaagatt 
tggctaaagg 
ggaccgtaga 
gttatgagag 
ttccccatag 
tccttttcat 
ctatcggaca 
tcaaagttgt 
gatttattgt 
acaggcaaaa 
aaccttataa 
aaatccccag 
cctatgtaca 
ctagaacata 
atattactga 
tctgaaaatc 
tcttctgaaa 
gtgccagggt 
ttagtttcat 
atggctcgta 
ttataaaaca 
ccacaacact 
gcacttgaac 
gccaattcgt 
tatccaattt 
tttttcgcct 
tccataattt 
tcaaagataa 
tcatcaactg 
taggtttatc 
caaacagtgt 
catgtctatt 
ctcctttctt 
acgccacact 
ttttcatgat 
ctgtaccatg 
accagatgtt 
ggatcaactt 
aaaggcaaat 
aaacgctaaa 
tagcagtgac 
tggtgctaga 
caatgatgct 
tga tgatctt 
gtatagacag 
tcacaagtca 
ttcggatagg 
gggcccgaac 
tttatgtcat 
gttcaatagt 
gatggaaagt 
gataatctcg 
cattatttcg 
cagagtggat 
ccctatcaag 
cgagcgccga 
gtgcctaacg 



tatttaagca 
ttgtgagtgc 
caacttcctg 
taaggttggc 
gttgggagat 
tatatacggc 
tgaccatctg 
gtaaactcgc 
gggacaattc 
cgtactccgt 
ttatatccgc 
cgtgtgttga 
tgcgcaattt 
caacatccct 
catctctctg 
ttatgagatc 
tctcacactt 
agagtat taa 
tagctctata 
cactagcggc 
acatttgctg 
actgggaccc 
gccaggcggg 
ctccatcaca 
actctctgtc 
cgcctctga t 
ccacgagtct 
ccattgattt 
cagcgtttat 
tctctgccaa 
tttcctcaag 
aagagtgtta 
catagtagaa 
tagtccagct 
tttattagca 
gaccagatct 
gagtttggat 
ttcgtagca t 
taatttgttg 
gagacaacta 
gctactgtta 
aaggccgaat 
tttggaaaat 
cactttaagg 
agtgaagaag 
agtgcttctt 
caaggtgtat 
agagacgaag 
gacatggaaa 
gaaatagcca 
tcggctgttc 
agtaatattt 
catgtcaaat 
tctgtaaagt 
gatgaattta 
tccgataata 
tctttacaaa 
acggccagag 
cccgttacat 
aatgttttca 
aatgcgaacc 



cacacccttt 
tggagaaaat 
ttgttcttct 
agtactaatt 
aacaggagac 
atgccctttt 
ccttcctctc 
ctcatagaat 
aaaaatataa 
tattgtttta 
tgcaagttga 
caccatacca 
tagacgtgca 
cacgtcaata 
ta tatct teg 
ggtgaaaata 
actataaaag 
aactctcgac 
catattaacc 
agttccttca 
aattatttta 
gaattcgttg 
gtgttcttga 
tgtgcatttt 
tattattttt 
tcccattata 
aagagttgta 
cctgcccgtg 
tgaagtcctc 
tacattcata 
tgttagcggc 
ttcaacttac 
ggtcgatcgg 
tcta tgtcca 
gacatgttaa 
tcatcatctt 
acattttctg 
agcgtcggta 
aggattgtgt 
tggataatgt 
caacagcaac 
gtcctcaagt 
ccatgtcggc 
acccaaagac 
aggaagaaac 
caaagaaaga 
tccgccagtt 
taaaggatga 
aggtggtgga 
aaggtttggc 
tagactcgtc 
t tgacacact 
atttattgca 
atgtaaagaa 
agtcccttct 
atgttgctac 
aag tacgtga 
atattgtact 
tattgcagtt 
ccgacggctt 
tattatctaa 



gaattaatgt 
ggtatctcaa 
tcctcatctt 
attctggctg 
atgttgaccg 
ttccgttact 
tttttctgga 
ccgcaaactt 
ttccaacata 
ctaaaatttg 
cgtttatgtt 
ttccatttcc 
gaaacgtttt 
ctgtccatta 
tttgatgggg 
aagctctcgt 
tcatttttat 
aaattataca 
acgtattgta 
ctaaaggcta 
gtggatagtt 
aaatgttgtt 
agagcagcca 
acagtttcag 
cccattcgta 
aaggctaaag 
tattctccat 
aagcgcacat 
caacctttat 
aaagtgtaaa 
ggcggtggcg 
agtgtttatt 
aaagaatgac 
tactctttcc 
tcatttcaat 
catcccatgt 
taatcttttt 
ggggagagtc 
tctttgaatt 
cgttcagaat 
tgagaaacgt 
actgagagca 
tatttttgct 
aaagaagatt 
tagttcttca 
aaaatgcccc 
tgcagatatc 
acaaactgag 
agaagtttta 
ccgtttcgat 
catatccaaa 
aaacacactc 
aaatcttaca 
atcatatcaa 
cactggagtc 
tccttacaaa 
aactaaacct 
act tccagag 
ggtgtcctac 
ttttaatgca 
cgactttcct 



148380 
148440 
148500 
148560 
148620 
148680 
148740 
148800 
148860 
148920 
148980 
149040 
149100 
149160 
149220 
149280 
149340 
149400 
149460 
149520 
149580 
149640 
149700 
149760 
149820 
149880 
149940 
150000 
150060 
150120 
150180 
150240 
150300 
150360 
150420 
150480 
150540 
150600 
150660 
150720 
150780 
150840 
150900 
150960 
151020 
151080 
151140 
151200 
151260 
151320 
151380 
151440 
151500 
151560 
151620 
151680 
151740 
151800 
151860 
151920 
151980 
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aaacccattg 
caagaaggtt 
aagggcggtg 
tcttcttctt 
gcagtagaaa 
gatttagata 
gaactcggca 
tatgcagtaa 
cttttagatt 
act tcctcca 
gtggttacaa 
gtggtggtgg 
ttaccctcga 
acaattagta 
gacgaacaac 
gtgggagaag 
agtgtacaat 
aatttggcgg 
ctattttcta 
aacctattga 
acgctcgtaa 
cctccaacat 
caggatatga 
atcggggtcg 
gtggactcaa 
atgtctgaag 
tcttttgtag 
acccctctag 
aatgataaaa 
gttacaaata 
gttataggcg 
gcagcaaatt 
ggaatgatac 
atgtcaaagc 
agcagcagac 
gcatttctgg 
acaaaggatc 
gacaaaaata 
aatataactg 
gagatgtcta 
gtcgttttta 
tccccttgta 
ccacccgtct 
ctcttgttga 
tggagggagc 
cttcttcagc 
caaagaagaa 
catctccaga 
taatcgataa 
cgctactcac 
cctttaatca 
ataacaacta 
aagctgtttt 
ctgaagggga 
atattgtcca 
tccacaagat 
agaaagtaag 
gggattctct 
agaggccatt 
cttccac tgg 
tttctaaggg 



aactggcagc 
catccagtga 
gtggtggtga 
cttcatcctc 
aaggcaagaa 
caattagtaa 
ataaaattag 
caatcactga 
tgataatgag 
ccgccaaatc 
tgatggtgga 
ataatactgt 
atgaaaatag 
acaaagtctc 
aacaacaaca 
tatttctaga 
taacggctga 
ccaagataaa 
acctcccaga 
aaaacatatc 
acaatacgat 
caaaagaaac 
ctacaatgtc 
tacgctcgag 
atactatgga 
ccttaacaaa 
acggtgctct 
caaaatatac 
atacacaaga 
agcctacaag 
cagcatcaat 
a tcaagcaac 
gtaatactac 
tcgccaacaa 
gaggagacag 
aacagacaag 
atgtggaacg 
ctactactac 
gcataaacac 
gcttcagaac 
aactttaagc 
cccaaattat 
a tcattcaat 
gaataatacc 
agaagttgcc 
t tcggatagt 
acgtaatatc 
aatgctcagt 
tagttttggc 
tgaaactatt 
actcgaaagg 
cagatactgg 
ggacgcaatc 
gggaatcggg 
ggctgccgat 
ggagaaggcc 
gaggaacaag 
tcaagagcgt 
ctccaattct 
tgttgttaat 
atgt ttcact 



taatgttact 
aaagaagagt 
cgattctgat 
ctcctcctca 
aactaaacgc 
actgattcta 
aaatttaata 
gatgcaatct 
actaaaagaa 
taataatgct 
ttctggagct 
agttaatcgc 
aggacttaaa 
tacctcgaat 
atgccctcaa 
aatgggaaaa 
cgcgttcaag 
aacggctata 
atctgtaggg 
ccaaagtgcc 
ggaccaacaa 
ttctatcttt 
aaatctggca 
ggaatttgat 
catggcagtt 
taatgcgaat 
ttcttacatg 
aatgaaagat 
tttatataaa 
tgttgtgcat 
caagctcatg 
atcaaaatct 
tcacctgtgt 
ccattttatg 
atcgtccctg 
aggaagagga 
tatgaagaga 
tacacccagt 
gataaaacaa 
ttaaagtgcc 
gtttgtggtt 
aacaaagtgt 
atcccaactg 
caagaagaaa 
cctagaggtg 
gaagatgatg 
aatagcggaa 
gctgtaaatg 
gttcagttta 
gccgtggaat 
accggagagc 
caaattcgaa 
gttgaaggaa 
cttcaattta 
attgaatttg 
gatgaatctt 
tcccagccta 
attgataacg 
gctgctgctg 
ccacgaggga 
gtgaggaata 



cgccataatc 
aaaaagaagg 
tcagaaacag 
tccgaagacg 
aaaacaaaaa 
aaaacaggag 
gacaaggatg 
acgccaatga 
caagtaaaat 
ttagatagtg 
gaattggcaa 
catgaagcat 
actgttgtag 
gaagaaatga 
catgaacaac 
tcaatagtga 
caaaactact 
tctatcggat 
aataatactg 
caagctaata 
aactcagcag 
cccggaaacg 
cgaggatttt 
gaaggaggag 
aattttgccg 
atgaaccctt 
tttgaaaaaa 
gtatcaaatc 
aatagagcag 
tcccagctag 
gaagcagaag 
acaaatgcta 
acgaccatcg 
agtgtattaa 
atgttgcagc 
ggaggagtat 
gattggatat 
aatgccggcc 
gacgacaaga 
aattttcata 
ttgggtcgtt 
cagtatggtc 
aaccaaccaa 
tgattctgga 
tcaagagacc 
aaggaggaga 
agtactggaa 
atattgataa 
aaaagagtgt 
atggaacaat 
ctcttaagaa 
ttgaagccgc 
atgacaccgt 
acaagagtgt 
gacaagttgc 
ctaattcttc 
caaattctta 
caatcaaagt 
ctgaagaaga 
ttaaggatat 
ttgttgcagc 



ttctgagtat 
agaaaaagaa 
attcttcctc 
aagaagaaga 
agaaaccatc 
gttacttcca 
attttgcggg 
atcaaagatt 
atagtgttga 
ctaaattgac 
ggctagctgc 
tcattctaac 
agtcattctt 
tgtcggtgat 
aaccggattt 
actca ttccc 
cgcctatggg 
ctaatatctc 
taactggtct 
atataatcaa 
ccatgtccat 
atccttcatc 
attccatcgc 
ttaaagcgta 
ctcagtctct 
ctaatgtatt 
atggcagtga 
gatacttgaa 
aaagagcact 
caaacgccat 
cagcggaatc 
ttaatataac 
ccgtgagtgc 
acactgcaaa 
aacaacaacc 
taggatcagg 
taaacatgat 
ggacattagg 
gtatgatgga 
taaaaatgta 
cacggtccaa 
aatggaaagt 
ccccagaacc 
aactactgtt 
ccttccttct 
gcaaccacaa 
aatcgaaaca 
cgtgtccaaa 
atctgaagaa 
cacaaatgtt 
gaagcgctca 
tgcagcagag 
gatcaaggca 
tagctctcaa 
gcaca tgaag 
tggtgaatca 
ctacactttc 
tatcgaaatg 
caccactact 
tcactttttc 
aaatggtgaa 



gaaaatgctt 
gga taaaaag 
atcatcatca 
aaaaggagaa 
aaaggacgac 
cgacacgagt 
cgtagcccaa 
agtatctagt 
tactgaaagt 
atctcaacaa 
cttctttttc 
atcaaaactt 
taaaaattta 
gccgttcgaa 
gaaaagagtg 
ttccaataag 
aagacgcata 
gcccaatata 
aaggctaacc 
aaatgccaac 
actactcttt 
tataaaatta 
agaaggatgt 
cactttacta 
ggaaaaa tea 
agaaggagga 
ctgtgaaccc 
aaaattcaac 
eg tcgaacag 
gggggtagct 
tgaaatgaga 
aaatacaatt 
agccgccgac 
taacagtcat 
aacacattca 
aactgaacag 
atctccagaa 
atatggatct 
taaactttct 
gtattatgtg 
gatgtaagag 
atctaaaaat 
aacaacatgt 
gaaggtgttg 
tcctcatctt 
acgaaacccc 
atcgaaccag 
actattcctc 
cagatcaaaa 
aaatattcaa 
aacaatggaa 
aatgttacac 
atcctcctcc 
caagctaaaa 
tgtaatttgt 
ccaaaggtta 
actatgatcg 
tctccagtta 
acgaccacta 
gattcatcca 
gttccacaag 



152040 
152100 
152160 
152220 
152280 
152340 
152400 
152460 
152520 
152580 
152640 
152700 
152760 
152820 
152880 
152940 
153000 
153060 
153120 
153180 
153240 
153300 
153360 
153420 
153480 
153540 
153600 
153660 
153720 
153780 
153840 
153900 
153960 
154020 
154080 
154140 
154200 
154260 
154320 
154380 
154440 
154500 
154560 
154620 
154680 
154740 
154800 
154860 
154920 
154980 
155040 
155100 
155160 
155220 
155280 
155340 
155400 
155460 
155520 
155580 
155640 



wo 01/38351 



44 



PCT/USOO/28888 



aagaattcgt 
ctactttcaa 
gcatctgccc 
agggaattgc 
cacatacatc 
tgctcttcca 
gtttggagcc 
gcagggcaac 
atgttttgtc 
ctatggaata 
agaagggagg 
caatcacttc 
cacacaagaa 
caacagcaaa 
caaagcttac 
aaatggtgca 
aagagaagat 
agatcaagac 
ctgctgataa 
atgttgtaag 
aaaataaggc 
agaagctgtt 
gaaaagaata 
taataaacta 
tgtaattgta 
tttacgatgg 
cagactgcaa 
aaaccttaaa 
aaatatccga 
acattcagtc 
aagtttttga 
cctttctggg 
atgtttctgg 
accctccgaa 
aaaaattttt 
atctttagta 
aaacagtcaa 
tggccctacc 
tctaaactcg 
attctctagt 
aacgcctgtt 
ggtgtaatct 
cccagcggtc 
agggaaaaca 
agagtgaagg 
tttctggaac 
ggtaccactt 
cgaacttgac 
tttttgaaaa 
cacataccct 
ttaatgtacg 
gcggtccacc 
tgagctgaaa 
aaacagaagg 
aaacacacaa 
ccttcacttg 
ataaaaattt 
aagttcggag 
gctccagaaa 
ctgatcagat 
ggtgtaaaac 



ttctgagttg 
gaagcttatt 
ctactacacc 
tacctacaga 
tactacccca 
gagtcctgga 
tatttcatcg 
acgctgcttc 
tattgttcat 
tggtgttgca 
agatactttc 
ggaggaactt 
acgaaagagc 
tactccaaag 
tttagattat 
gagaattttg 
ggaaattgta 
gaatgcaaag 
ggtgtgtcta 
ggaggataat 
aagcggagga 
gcacgaattg 
aaataaaagg 
tatagcaaat 
catagtaaca 
gtgaccgtcg 
atactcacct 
agtttggtgt 
gctgcaatct 
gacccagcgg 
gatggagaat 
tacgttca ta 
agcaacagac 
cttgacattc 
tgaaaagttt 
gaggtggtgg 
aagttaatgc 
cagcggtcca 
agtgacccag 
gaatacagta 
ccagaaacag 
gaca tcgagt 
caccctctaa 
acactaagtt 
aggctatcca 
agccatt tct 
ctggcacatt 
atcaggccga 
gtttttgaga 
ttctggttac 
tttctggagc 
cttcgaactt 
aaatttttga 
ttacacctac 
aggttga tat 
tacacatttc 
ttctgggtcg 
ggtggaccgc 
tggtggggga 
aaaaattggg 
gtactgaaac 



tacactaacc 
catgaccgta 
acaggaggcg 
atatacgagg 
gcacaggcag 
actgatattc 
ggcgaaattg 
aaattttatt 
actgatggac 
aagagcaaga 
catagcgagg 
aatagggaat 
aactcttcta 
aagacgaaga 
gt tgatagta 
gctcaggaga 
gagggagaag 
gcatacaatc 
aagcacacat 
gttaataaga 
gatgatgaag 
ttgaatgaat 
ttaatataat 
attaatgcta 
tgccttttct 
tgataacgcc 
aacaggcctt 
ttcatttttt 
gacattggct 
tccaccctct 
gagtaaaatt 
tccagaaacg 
gtttctggtg 
ggtcgaccca 
t tgaggagga 
tccttctagg 
acatt tctgg 
ccctccgaac 
aaaaaatttt 
gg ttacacac 
tcaaaagtta 
cgacccagag 
actcgagtga 
gcttgtcaat 
gatatctgga 
ggaaagggtt 
cgtgcaatct 
cccagcggtc 
tggagaacga 
ggtcatgtcc 
aacagacgtt 
gacatcaggc 
aaagtttttg 
cctttctggg 
acgtttctgg 
acttagtata 
ctcgagttta 
tgggtcgact 
gtattcagag 
tgtatacatt 
attacatcac 



ttctaaaggt 
caatgaatcg 
tcccccctgc 
acaggactgg 
ccgaggcaac 
aaaagttcct 
tgtaccgttc 
cctcttcaga 
ttttcagtag 
gtaaaattat 
aggatattga 
gtaacaccaa 
ctactactac 
agagcgcttc 
catcatttgt 
gagtaaagac 
aagcacaaga 
ttgccaacaa 
tggaagattt 
cggtcgcttc 
aaactccaat 
aatgttgttg 
gttgtctttt 
ttgacacatg 
taggcatttg 
tacaccaggt 
ggtatagagg 
tacatcgggg 
ccctacccag 
aaactcgatc 
ctctagcgga 
cctgttccag 
taatctgaca 
gcggtccacc 
gagaggaggt 
agctggtgtc 
agctatccca 
ttgacattcg 
ataaaatttt 
taccctttcg 
atgtgcgttt 
gtccaccatc 
cccagaaaaa 
tcatacacta 
tcaaccagaa 
acattttatt 
gacattggcc 
caccctctaa 
gtaaaattct 
agaaacgtct 
tctggtgcaa 
cgacccagcg 
agatggagga 
cgcagctata 
agcaacagcg 
atttgtcatc 
gagggtggac 
cgatgtcaga 
tgtgtttctg 
ttctgatccg 
ccacagtcgt 



tgaagagaag 
acacattaag 
agctgataag 
agtgttccaa 
tggtgctatt 
cgatgctaag 
aaagtggagc 
cgagaagatg 
tgtacacttt 
ccccaaaact 
agtgcctgta 
gggaatgaac 
ttccaccact 
tgctgccagt 
attttacaat 
gctaaaggca 
aacttataga 
aacatctgga 
gggggatgta 
tactactact 
ggaatttgaa 
tgttgtattt 
acattttcct 
ttactagtcc 
aggcaaatag 
gacagtatag 
aatcaataaa 
acagaaatgg 
aggtccaccc 
gagctgaaaa 
aacagtaggt 
aaacagtaaa 
tcgagtcgac 
ctctaaactc 
aaattgatac 
tccagaaacg 
tttctggtgc 
gtcgacccag 
gagtcggaga 
gggcgcagca 
ctggagcaac 
cgaacttgac 
tttttataaa 
gtgcaggttg 
tccagaaacg 
atatttggta 
ctacccagcg 
actcgagtga 
ctagcgaaaa 
gttccagaaa 
tctgacattg 
gtccaccctc 
cgagtaaaat 
tccagaaacg 
tgacagtagt 
acatggtcga 
cgctcgctcg 
ttgcaccaga 
gaacaggtac 
agaaagtaaa 
cctcctccct 



gtggatcatc 
gcatggtact 
gtttccgcta 
ttcgatgggg 
cacaagtcta 
aaggcagaag 
cccaatgata 
aacattgctg 
agaaaggata 
atcaagat ta 
aaattcactg 
agcctccgag 
tccacttcaa 
gacccatttg 
atcagtaagg 
gtcaagaacg 
ggaatcgtaa 
gtacttttcc 
cttgattttg 
acatcatctg 
actgacggag 
tgtatataga 
aaacttgata 
ctagtcaacg 
gaaaagagta 
agggtttcag 
aaatcaacaa 
ggttaaaaac 
tccgaacttg 
attttttgaa 
tacacactac 
gagttaatgt 
ccagaggtcc 
gatcgagctg 
agatagaggt 
cctgttccag 
aatctgacat 
cggtccaccc 
tagaagggaa 
agtctccaga 
aaccatttct 
attcggtcga 
attttcgtca 
gttgagatag 
tttctatcca 
tattatttct 
gtccaccctt 
gctgaaaaaa 
cagaaggtta 
cagtcaaaag 
gcccgaccca 
taaactcgag 
tctctagcga 
tctgttccag 
ttctactaag 
caaaaattt t 
gtctgatgtc 
agttgttgtt 
acatacccaa 
ccaactgaac 
tctaagagag 



155700 
155760 
155820 
155880 
155940 
156000 
156060 
156120 
156180 
156240 
156300 
156360 
156420 
156480 
156540 
156600 
156660 
156720 
156780 
156840 
156900 
156960 
157020 
157080 
157140 
157200 
157260 
157320 
157380 
157440 
157500 
157560 
157620 
157680 
157740 
157800 
157860 
157920 
157980 
158040 
158100 
158160 
158220 
158280 
158340 
158400 
158460 
158520 
158580 
158640 
158700 
158760 
158820 
158880 
158940 
159000 
159060 
159120 
159180 
159240 
159300 
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aacatatccc 
attatggacg 
gaggagaatg 
aaaaagatcc 
tcatcatcat 
tacgtggata 
caaactttag 
gactcatcat 
ccagataatg 
gaaaaggaaa 
ataaacacat 
cttttgaaga 
aagcaagtaa 
aggttggtgg 
acggatacta 
atcaagaagc 
agttctaatg 
ggaggttcta 
ttaaacaagg 
gtctgtgtgc 
atggttacta 
aaattcatat 
actggcccca 
attatagcca 
gcgacaaatt 
cataaaatgg 
cataaaatgt 
ctcagtgaaa 
ttgcggaaca 
gtaaaattgg 
ggatgtagca 
ggagccgatg 
agaaaaaaca 
gctttcattg 
acgtcatcaa 
acttcttcta 
ataccacttt 
gaccctgaac 
aaccggcaac 
tcattcgggc 
ataatcaact 
cttcctcatc 
tgtctactgc 
atctgttgcc 
ttgtgccgtt 
accaggctga 
agtttaacct 
tcattgcgga 
catcacgcaa 
cagtaacaat 
gtgaagtggg 
gaatggcact 
gagtagaaaa 
gggtagagat 
atttgaagga 
cctattttgt 
ctgaacataa 
catcatctgg 
ttgataacga 
ctgctggagt 
attgtagtgg 



gtacccaact 
atctctacat 
aggtaggagt 
tccgtaaacg 
catcagacga 
ttgcagatac 
agaacgacct 
cgtcttctgg 
tccatgttct 
aggaggaaga 
ctgtagatga 
aaaatagttt 
acgaggccat 
aggactcttt 
gtaattcatc 
gtaacattgc 
aggagaataa 
atgcatccaa 
ttatattttt 
agcgagcaat 
aaatcttgaa 
ccgttggaag 
aacataacaa 
tgttggttaa 
tagtacaatt 
cgtctgtttt 
tattaagggc 
gaaaacctag 
aattgaggtc 
tgtcacagca 
tgattagcgg 
ttaaggacgt 
ttagaagaaa 
caatgggttt 
ataaaaacaa 
cttctggtcc 
ctccacgatc 
tcttttctga 
tcttctccct 
tcattcaact 
tttcaagatg 
atcatcagct 
tgatttcaag 
agattctccc 
tctgggttca 
aaattcttct 
attgaacaag 
tactcctccc 
cccatgggtc 
tgaagtctcc 
ggagaatcca 
taatgtcgtg 
cagggccaat 
gctcctcaag 
acattatgag 
tgttgaattt 
tggatcatca 
gccaagtgga 
aatggggaat 
ttctccatca 
tgatgacg tg 



acctcaaaca 
gagctgttct 
agaaggagga 
aaagagatct 
ttctgactct 
aaggaaaccg 
ctacatgtct 
tgaagaagaa 
tggttgtaag 
gtatgaagaa 
ttgtgttata 
ccagttctct 
gaattcagca 
ggg tgacaca 
ttcatcatcg 
aatgggaaga 
tttccgccct 
caagaaactc 
tagagaaata 
gaataatgac 
tattattggt 
agatgcacta 
gagactccgt 
tgatgttccc 
tgcttctgct 
caa tag tagt 
taatttaatc 
aacattaact 
tagtggtttg 
attaatgtat 
aaactttgag 
gaagactgtt 
cgtccctttc 
gcttttcgtg 
gacagaggaa 
aacatttaag 
ttctggtcca 
agccaggg tc 
cct t tcgagt 
gtgctgacaa 
tttggaagtt 
gccgcatcat 
aaagttgctg 
aact ttaaga 
tcgaaggcag 
aaaggaggaa 
gtagaggctg 
tctaaagata 
aaattgactc 
aatgatagga 
ttcctaatga 
tggttcttca 
tccaccta tg 
aactttattt 
catgtttata 
gaaagagtgg 
tacatggaat 
ggcggaaaga 
aacaatagta 
ctttctcca t 
tgggggaaga 



acaacaacaa 
tcttcttcat 
ggaggcagaa 
tctgttaaaa 
gatagagaag 
cccaaagtaa 
agttcatctt 
agtgacgatg 
aaggaaaaat 
gaattcaaga 
cctgatcgga 
cagccagttt 
ttctcttcca 
tctaaaattt 
acgttcgtca 
gtagcagaat 
gtagtttcct 
aatagtaaca 
cacagtgtta 
aacacaaatt 
aaaattcctt 
tatttgtacc 
atccctcaac 
attacctcag 
atggtgga tc 
tat tccgtat 
ctatctattt 
cagagcgtgt 
accagtgaag 
gagggtgtga 
gatgaagacg 
ggactatctg 
tactgaaaaa 
gtgttgatgg 
agaaagggta 
agagaagcgt 
gaactgaatc 
agtcccgacc 
ataagtagag 
caactaagaa 
ctgctaataa 
ctgacgatca 
ccattcttgc 
atgtgattaa 
ctgaaagtgg 
gtgatggaaa 
aagaaatggc 
atccattgag 
agaagaatct 
gtatccgtag 
ccatcagtgt 
acagattctt 
tagcaactag 
ctggagaaaa 
acaagatcag 
acaacacgat 
acaggtgctt 
gtggtgtcct 
gtgcagctgc 
t tagcagtga 
agatgatatt 



caacaacaac 
cgtgctcaag 
tcggaccaac 
gtacatcatc 
aaaaagaagg 
gaaaactgga 
cttcttcctc 
atgatgatga 
ctccccaaga 
gaatggcatt 
ttttaaccct 
ctttcctccg 
tgttatccag 
cctcctttat 
ataattgtac 
tgctgtcaaa 
taatgcgagg 
ggcaaactat 
tcgcgctata 
ctagcggata 
ataatgaaat 
agaatgtgat 
aacaagctga 
atttactttt 
ctgcatatcg 
ataaagtcct 
tatcagctag 
atttgttcct 
agagttctct 
ctcgtcaaac 
gtgtaacact 
cattgctatc 
atatggaaac 
tgtacatctt 
gtaaattatc 
ctcctgaatt 
taactgacgc 
gttaacggaa 
gcgttgcacc 
tatacattga 
ctttaatggt 
gcagttaggt 
caacagaaca 
caaccctaac 
tagtgcaaac 
gaaatcatcg 
ctttaaacgt 
ggatgaccct 
ggaatatctt 
tggaagatat 
tgacattaga 
ccgcatggtt 
cgatgctatt 
tgtgccccag 
taaatgtgga 
tggatttgtc 
tgacacaatc 
gtcttctgga 
agcttctgcc 
tggagatgat 
caacacatca 



catgacatcc 
ttcatctgaa 
agaggccaag 
ttcttcttcc 
aagaaaacta 
tactccttca 
ctcatcttcc 
cgattatgac 
catagaagct 
accttcacgg 
cttttctaca 
gttggtgatg 
ttctggtatg 
aacacctcaa 
agatgaagat 
tattgcagcc 
accaacttgt 
tccccaagta 
tttatcgtcc 
cgctgaaggg 
gagtagagaa 
cacggatatg 
tttttgttac 
aactggaaag 
cctggctgtc 
agatcttgac 
aaataagtgt 
aaatcatctt 
aggaacagcc 
catcgaggac 
gaaatgtttg 
tgataggctt 
tactacgctc 
tatacagtac 
atcatcatca 
aagagaagac 
tgaatttata 
aaaaggtgca 
gagccaagca 
aataacccaa 
gacaagaaat 
ccccttggac 
gagtccctat 
cagatatcta 
aagaatgaaa 
cagcagaaca 
gtggctgaac 
gatgctat tc 
ttctgggaag 
cttcaagcca 
atcctgcaaa 
tctggacttg 
gcccagatct 
gcactgaagt 
cgtcagccat 
aattctgata 
aggaagaatg 
acattcttca 
cctgctgttt 
gatgacgacg 
ggagacggat 



159360 
159420 
159480 
159540 
159600 
159660 
159720 
159780 
159840 
159900 
159960 
160020 
160080 
160140 
160200 
160260 
160320 
160380 
160440 
160500 
160560 
160620 
160680 
160740 
160800 
160860 
160920 
160980 
161040 
161100 
161160 
161220 
161280 
161340 
161400 
161460 
161520 
161580 
161640 
161700 
161760 
161820 
161880 
161940 
162000 
162060 
162120 
162180 
162240 
162300 
162360 
162420 
162480 
162540 
162600 
162660 
162720 
162780 
162840 
162900 
162960 
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caggagaatc 
gagaaaatac 
gaaatgaaga 
ctgttgaaaa 
attatacgtc 
acagagtaat 
tgagtgacat 
agatggttga 
agattcgtct 
gctttgacaa 
aggaaccttt 
atgacaagta 
agattgacac 
tacttagaaa 
tggtgcatag 
atgaacgtaa 
ttgaggatat 
gtaaaggact 
atgagatcac 
atttccacgt 
atatctccat 
acgtggctgg 
agtactttaa 
taacctctgc 
attgtaagct 
gcatctctta 
actcttcttc 
agttcgataa 
taaaaatgtt 
gtgatagtag 
cagcaaagaa 
ctgccgttaa 
cagcagaaga 
acgacgatac 
attctaaagc 
aaacacaata 
atggttatat 
acagatgtct 
gctatattaa 
atttttatat 
tcaatgttaa 
cgtatagcag 
taagctgggg 
tcctccaatt 
gaatctcttg 
atatggcgcg 
ggaatcttgt 
tttgaagcaa 
gtctttatac 
agagaagagt 
gtattgttat 
cttccaatgg 
gttccaagag 
ataaaaagtc 
atagtaatcc 
ggatctagtg 
ttaataattt 
ataacattcc 
cgggcaggca 
tgttggatcg 
gctgcaatag 



ttctgggcag 
tgcttctctt 
taaacaacta 
ccatgtaatg 
aaacaaggaa 
ggttgaactg 
tgaaactgta 
acatggatgt 
tagtgccaag 
gtctagagag 
tccccgcaac 
cagggcaagg 
ctttacaaca 
gtgcttcctt 
aggagacagc 
cggaattgaa 
tacaaaaatt 
ttctttgagg 
aaaccaactc 
ctttacttct 
ttttgcagaa 
agaggacaag 
gctctttaat 
cattaacagg 
gtctgataac 
tgcagactat 
agaaaagagc 
gagtgaagac 
ccaaatggat 
tagcagtgag 
gaggaaagtg 
gaaggtgaag 
agaaaagaaa 
agaagaggac 
tattttctaa 
taaagttaaa 
ttcttgtagc 
gtgaagcagc 
cagcataata 
aaaaatcaac 
caatcttccc 
caatacggct 
atttttcatc 
gagttgtagt 
ttgcagtttc 
gattaactgg 
tgaggttgat 
taaagtatga 
ggtcctttct 
ctctaatttt 
tattagcctg 
tatttaattt 
cacctccacc 
tccccgaaga 
cattttgttc 
caacatataa 
tcatctcttc 
taaatttgtt 
cattaactgg 
ctcttcttgt 
caatgatgtt 



aacggtggtg 
tctcagaagg 
ctcaagaaca 
tgtacagatg 
ctatacaagg 
tttgtcaacc 
caaaatcttc 
acagatatgc 
agagcttgtg 
gcaaatctca 
cacaattctc 
atgaacaagc 
acagacgatt 
tctgcctctc 
ttcaatatta 
gaagttactt 
aaaatgaaga 
gaattctcta 
tgcaacaact 
gtgtgtgtct 
ttgggtgaag 
acatatgatg 
gcactcgtta 
agagggtaca 
gctatccagt 
tatacgtctc 
aatgcagatg 
agcaagaaaa 
agggttttgg 
aatgaagagg 
gaagatgttg 
caggaagaag 
gaggaacagg 
gagaaagcag 
atatcatgtg 
cattgttttt 
ttgctcggca 
catgataccc 
aatcaggttc 
gtctactatt 
ctgaacaccc 
agcttcatta 
agttacttga 
cttgtccgct 
gatgatggaa 
aacaaacaga 
attgtccttg 
tgggattcta 
tccatcatcc 
ttgcatttgt 
cctctgttga 
gtccaatatt 
gccgatggcc 
agcatagcgt 
agctgttatc 
tccatcgacc 
gttgctgaca 
ctgaagaaat 
aacaaccgct 
tccaccacct 
ttgtacacct 



gtgcatcaac 
aaaatgttcg 
ttatcaactt 
aaaatatctt 
ctattgtttg 
ttattcttcc 
cttcaaacaa 
gctatgacat 
agtgcagaaa 
cccctagcca 
acaggagtaa 
tcaagaagga 
ttctcctgca 
ttcatcacat 
actttgcaaa 
cctcacaaac 
gaggggatga 
agaaggttag 
gcaacgttaa 
acatccacaa 
aattgaccaa 
atattatccg 
aattctgtca 
tgtgcatggt 
atcacgaatc 
gcaataacaa 
tagccaagac 
ataagaacaa 
atggcatgga 
aggaggaaga 
atagcaataa 
atgtggagat 
aggcgaagga 
tagcatctga 
tataaaatgt 
tatttgacac 
gcaataagag 
ctcgatatat 
ttccaatccc 
gacctggcaa 
attacaatt t 
tttaaccat t 
gatacaatat 
tgctcttgga 
atacagttga 
ttctgtgcag 
gtgtgtttcc 
ctgtatttct 
ctaatgtaa t 
tcattgtgaa 
cgctgctgct 
gaacgtaatt 
gctgcagct t 
aaagatcca t 
ctttccagta 
ataatatgag 
agcatca tea 
agttcga tc t 
tgtgtttctt 
actatattat 
ttaccagat t 



ttacaagaga 
tctcatggcg 
tctgaatagt 
cgatgaggat 
ctccaaccca 
ccgtcttagg 
tggatcggtg 
ccctccatat 
actgtgcaag 
gaaggcagga 
cgctcacgac 
ttcaaagaag 
agataggaac 
tttctgtcct 
caacaaactc 
ggtcaacgcc 
tattatagat 
taagattgtg 
ctcttctaat 
cattattcct 
gcttgttaag 
caattacgaa 
caggaattat 
gagcaacctt 
actatgctct 
ttctgaagat 
tatggcctct 
aacttcaaat 
tgatgatgat 
ggaggaaatt 
gaagacactg 
ggaggaagtg 
ggaagacgct 
tgaagatgaa 
agtattttaa 
atgtacatca 
caagagaaga 
tattaataag 
tagaagccca 
cagccactat 
ttgatttttc 
ccaattttac 
tattccattg 
aactttgcat 
gccattgcat 
gagaaaagtt 
atgggatagg 
ttagtgtttc 
tcaaaagttt 
cctttatatc 
gttgttgttg 
ctacaggcac 
cttcactagc 
acccttcatt 
tt tcttcaat 
acacgtgctc 
ttctagatat 
ctgtgaacgt 
gttgttgctg 
tactgcctct 
ctgcaattac 



tttaggtgtg 
atgccaaagg 
gcacttaact 
caagccgagc 
gccaatgtgt 
aacccaattg 
agaacaaaga 
gcaaagggaa 
gacgtgaggt 
agggaggtgg 
tttactttct 
aaggtaaaga 
gcttttgatc 
gatgtgctta 
gagtgctata 
aaggaagcac 
gttgtgaaga 
agaaggttta 
ggagatgtgg 
gtgctcgaag 
gagtgtagag 
attactgtaa 
aatgtggcag 
gtcgggtatt 
ttgcactcta 
ggaggaggaa 
ttctatgacc 
gagatcctta 
gatgaagata 
gtaaagaaac 
ccaaaggaac 
aaggaagcag 
actgagtatg 
gatgatgaag 
aggtataaat 
tcaatttcac 
gtcatataaa 
gggagttgct 
cacattgggc 
acctagctca 
tgaattctct 
acttgaaaca 
agataaaagt 
gacgatagcc 
gtcagtaagg 
caagtcttct 
tttctgtgcc 
tgaaatgttt 
agatgtgaat 
ctcgggaaca 
ttgagtagta 
ccttctacca 
ttcatcattt 
tccagtatca 
ggcctttcta 
gcacact tct 
aatagtagca 
aggctcgaaa 
ctgttgttgt 
gagttggtcg 
ttccattttt 



163020 
163080 
163140 
163200 
163260 
163320 
163380 
163440 
163500 
163560 
163620 
163680 
163740 
163800 
163860 
163920 
163980 
164040 
164100 
164160 
164220 
164280 
164340 
164400 
164460 
164520 
164580 
164640 
164700 
164760 
164820 
164880 
164940 
165000 
165060 
165120 
165180 
165240 
165300 
165360 
165420 
165480 
165540 
165600 
165660 
165720 
165780 
165840 
165900 
165960 
166020 
166080 
166140 
166200 
166260 
166320 
166380 
166440 
166500 
166560 
166620 
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gagtcttcgc 
ggtaatccag 
gtgcccactt 
ttatagatcc 
acagcagacg 
tatttgactt 
aggaatgcgc 
acgaatgaag 
ccaatttcca 
gacaccaatt 
tatgtcacat 
gtttcagaag 
gaggatatta 
cctgttggta 
gatgatggag 
ttcaaaatac 
acaataaagg 
tgaataactg 
aataaaaatt 
taatctgtta 
aaaaa tggaa 
tccactttca 
taaagaccca 
cttccggcca 
atattgtata 
gagaaaagtc 
ttattataat 
agaattaatt 
ccgaatgtaa 
acgagtgttt 
cagtctgttt 
gtaactgggt 
gtagaagttt 
gaagatacat 
ttttgttcgg 
gttggttcat 
accaaatcct 
ataccaaatt 
ttggagtttg 
agaattttct 
tttatccaca 
gtgaatctag 
taggcgtc tt 
agatgggata 
tctttctcat 
aatctctcta 
tcgataaacc 
tctg tcagag 
gattttgtag 
gggtgtattt 
aggt t tgcaa 
atttcgctat 
gctaaacggc 
tgcgttgttg 
ttgtttcctt 
ga tggatatc 
tctgcagctg 
attctctcag 
aaaacctctc 
gacattgatc 
aaaaacccct 



ctgaaattgg 
tatcatactg 
tactggggaa 
tcattgggct 
agagaagggg 
gtttagatgc 
taacatgttc 
aagaagaaga 
tatcctttat 
cacctttatt 
ttcttgttag 
aaaatggagc 
gaggtacatt 
caggaagatt 
aagaagcgac 
tctgaggtgt 
gaggatctct 
cataatgtgt 
tgtgttgatc 
gttgtctagc 
gaagaacaac 
ctggctgtcc 
ccacgcagtt 
gtataggctc 
tggatgcaat 
ccacaacagc 
atatcagaga 
ttaacgtagg 
gaaaccttcc 
tattatgttc 
tctccgatat 
ttttcatgga 
tttgcctctt 
ctcctaaaga 
tcgtcaatga 
aaagtttggg 
ttaaagaatt 
cagtcgagtt 
aagccatgtc 
ttgcatttgg 
ccttcacatc 
aggtcgtaag 
tcaatactat 
tattgacaag 
catgcattgt 
aagacgaagc 
tctcattttt 
atgaaaatcg 
atatttcttc 
ttacccttat 
gggtagaatc 
cgccatcttg 
tattgtcttc 
acataatttc 
atacgggtgt 
aacaatatat 
cagtcaaatc 
acacgtccat 
acatgatggc 
caaataagga 
tcgtgtttaa 



tacagaagga 
aagaactttc 
aatatcgatc 
taaatgttgt 
ttcaaagaat 
atgaacagga 
aattttccca 
agaaagggat 
accatcatcg 
tttcaatcca 
agatttaggt 
agtggcagtt 
aatgaaaagt 
aactacccta 
tgaagtggca 
atcggtcagt 
agggaaagca 
aacaatcttt 
aggatcggct 
ctcctttaca 
ttgaagtacc 
tctcataata 
aatgggatcg 
ccctgggaat 
ctctcgagca 
atcagagaca 
tgctattata 
aatatcaaca 
agaatcgtgt 
agcgtcgtta 
acagtccatg 
aatggaagta 
tgtacgtttt 
ttttccttta 
ggcggttgag 
ttgaggttta 
ggtatttttt 
aaattttgac 
atttaaatca 
gttcttactg 
cattagacga 
aacaatatca 
tgaggaagaa 
ttcttccatg 
ccagtcctta 
agtttcttcc 
acgagaataa 
tcgtgcttct 
aatagtcatc 
ttttgcattt 
ataacacgca 
ttgttgttgt 
gttgttctta 
ctcctcgacg 
gaaccaatag 
agaatagggc 
tttgtagtct 
gttgttgggt 
aggaatcggt 
aataccctat 
catgcagaag 



atcttgggtg 
caaataaatt 
ctccctgtag 
gtcgatttag 
acagtttcca 
agatttgtgg 
tcaaccgaaa 
ccagtgtcgt 
cgttcagtat 
aggaatccat 
ggtacataat 
gtttgctgct 
tccagatcct 
ataggggttt 
gactttaagt 
gtatagtatc 
aggcacacgc 
gcaattagat 
gtatatttta 
gcccatacgc 
tgctggtgca 
tttcttgtca 
taccatctgt 
agtatggaat 
atatctccta 
tttgagaatt 
gcagcagaag 
tctggtataa 
atttccgaca 
agagaggtta 
attgaagtgt 
ggtgcccctg 
ctattttcat 
gacgataaaa 
gaggaggagg 
tttttaaatg 
ccaaaaaagc 
aactctggta 
cgtagccttg 
gtgtcaggaa 
aatatcacag 
agtatgccgt 
agtctttcca 
ccgtcgtttc 
atattaatga 
atgttcagct 
catcctcctc 
cgtgctcttt 
tctttattcg 
tctgtttcac 
aaagttcccg 
tgttgctgta 
ccaggcaggg 
tttctcgtca 
tcggtaaaac 
cttgtcacta 
gggcctaatt 
ggataaagta 
agaagagata 
aacgttccac 
tgctcagact 



acaacctttc 
cgaatccagt 
atagaggact 
tttttgaggc 
tgcccaattc 
gctgagaagt 
aacttgtccg 
cttgatggtt 
tctttacaac 
cttctgcgat 
ttaagaacga 
gttgttgttt 
tattcatctt 
ctgatgaata 
tttcggctac 
tgagagacgt 
ctccattatc 
ccatcacggg 
gtagcacttc 
ccgatttcaa 
tgtacatcca 
ttgtttctaa 
actgccg tec 
cgcagttaga 
ccgtaatagc 
ccctcccttc 
attccccaga 
aggctgtag t 
cgttactttt 
aaaaaaggtg 
attgaggccc 
aagggacaat 
cgtattcctt 
tcgattgtgc 
aggctgctgt 
atagttgata 
actgggccgt 
tgataatgga 
taagagaggg 
atgctatctg 
ggtgttgtaa 
gatgggcata 
tttccca ttc 
ctttaagatc 
caacaggcaa 
tgatgttgtt 
ctcctcgagg 
tttccactaa 
tctctataat 
ttcctaaacg 
cattaacaag 
gcccttcagc 
gccgttctat 
gcctgtaatt 
accaa tccac 
gcct tgcagc 
cgctctgtac 
taaatactca 
atagacccgt 
caacccctat 
gtgctccttt 



tagagttgtg 
tcctgccact 
aacacattcc 
ttcataaagc 
ctttaccgaa 
cgtaccaggg 
tacaccatca 
attaatatca 
tt taaagtac 
attctgacca 
ttttgaaatg 
ttgttcctta 
taaaaaccgt 
tctggaggag 
attatctcca 
atcattttct 
ggctgccgaa 
cttcttgaat 
acgtactata 
aacgcaatca 
agcggtctca 
acacgaccga 
ataacttctt 
ttctcttcca 
agatccgccc 
tagatttaca 
atcgtctctc 
attattttga 
tctgtcgaca 
catttctatc 
aaagaagcaa 
gttggtatcg 
agaaagggtg 
gattttttcc 
tgctgttgct 
ataccgtgta 
aaagtagtgc 
agaaactgac 
atgttgatag 
gaggggggtg 
taatttccct 
aaacacacca 
ttcttgattc 
tagccgtatt 
aagccctgcc 
aatttcgcct 
aaatttttgt 
tgcatttaga 
aaacacagac 
cccctcggtg 
cggtagagaa 
caataggcaa 
tgtttgcgtt 
cgagtagagt 
cggttacgat 
attgtctctt 
gtggttccgt 
ggtgtacata 
attacatctg 
tatctgtgaa 
ccctccctac 



166680 
166740 
166800 
166860 
166920 
166980 
167040 
167100 
167160 
167220 
167280 
167340 
167400 
167460 
167520 
167580 
167640 
167700 
167760 
167820 
167880 
167940 
168000 
168060 
168120 
168180 
168240 
168300 
168360 
168420 
168480 
168540 
168600 
168660 
168720 
168780 
168840 
168900 
168960 
169020 
169080 
169140 
169200 
169260 
169320 
169380 
169440 
169500 
169560 
169620 
169680 
169740 
169800 
169860 
169920 
169980 
170040 
170100 
170160 
170220 
170280 
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cccggcactg 
caaaaggaaa 
aagctgagag 
ttcccttgcg 
ccaccccgta 
gagcttaacc 
ttggatgttg 
cgagctggcc 
aaccaacttt 
caataggaac 
ccgtgatgat 
aaccaacacc 
aaatgagaag 
attgtgacca 
tttcagctgg 
ccaagaaaca 
atgtggctaa 
ctggaggaga 
agggagaaaa 
aaaatcttgt 
ataaaagaat 
atgatacttt 
tttccataca 
acttgggtta 
ttcttgtgtt 
ggtacttctg 
ttcttgatat 
acatatgaca 
ggaagactat 
tcttcctctt 
gtaggcggac 
acattggcca 
tctggagtgt 
ttcttcttat 
atgcacttca 
gccaatccac 
tcgagagatg 
tccttttata 
tctggtcttg 
cgagattctt 
ttatgcaagg 
tcaccttgtt 
gcacggtcct 
tttacagaat 
cgacactgga 
ttcttgtttc 
atactttctc 
cattaattgt 
gtggaccaga 
atgttatcct 
tagacatcga 
tgggtaagaa 
gtgacgaaga 
aaaaaggtgc 
cggtagaaaa 
gaatggtgct 
aatctttctt 
atccttcgct 
agaaaaacat 
tccacttgaa 
ataacccaga 



agaagccttt 
ttgaggagct 
atatccccag 
ttgacactac 
gtgacgttag 
a tatcgt tgg 
atgacgttgt 
tgtgcaagtt 
ttatataagc 
catggcctca 
ggaaagagat 
tgttagcgcc 
acttgaagat 
tagcgatgac 
cctgtgctct 
ggtgttcctt 
aactgaagaa 
tggagatagt 
tggatctctt 
agatggtgga 
acataattta 
taggtgaagg 
catgattctc 
tacgtggagc 
tcatctcctt 
gcacgtattc 
tgaagatggg 
cgtaatcacc 
tgtcagaaga 
tatttgtact 
tagcataggt 
gatcgtaagt 
tgaaccggac 
tcttattctt 
cgagcatgtt 
tcaggaaagt 
tttcttcctc 
ccgtcacttg 
ggattggggt 
cccttgatgc 
acaccagaat 
cgactgttat 
cttcaggtaa 
gacagtaatg 
gaaaggt tga 
tggagttt.ga 
ccat tccttc 
gtttcaggga 
cgtcgagatg 
aggaaatgtc 
tttcaaggac 
aacagcaaag 
cata tacgcc 
caccgaggaa 
ggctatgcta 
ttctaagcga 
cagtgttttg 
tacaaaattg 
cagtgtggct 
ctttattcga 
aacgttcccc 



ccctccatac 
tctggttgac 
aacctaccct 
cggtcacagc 
gcacggttac 
taagctcact 
gattaatctg 
ctcaaaaaac 
ctaatattat 
tcatcatcat 
gaagaaaata 
gcctgggtgc 
ttttcttcag 
gatgacgatg 
tttgtgaggg 
cttacaagcg 
ggcgcagcaa 
ggtattgctg 
ttggaatcta 
gaagaaacca 
tataatattt 
aataaaggta 
ccacatgtta 
attgttgttg 
caattcctgg 
gtagacatgt 
agaaggtggg 
gccattgctt 
aaatgacgac 
tgttggttgt 
tggaccagaa 
ggtagatttg 
gtggctgatg 
ctcccacggt 
catggccagt 
gacgaggata 
gactaccaga 
gttagaaaaa 
tctgatactc 
agtttctatt 
attgaactcg 
acatactcct 
gaaaactgca 
tttatctata 
aggaaaatct 
ctttttagga 
agcgggattt 
ggctataaaa 
acgagacatg 
gactacactt 
aatatcacag 
aaaataaaga 
accaaggaat 
aacgccatcg 
tcagaaagtg 
gacacgtccc 
atat ttgaaa 
tgtgtgatta 
aacgtgtcta 
ctcgggaaat 
ggtatgtttt 



cctggtactg 
caatctttcc 
ctcgaatttc 
gatatcccct 
cactacttaa 
gaaaagcaag 
tcaagcactt 
tagacggtaa 
aaatcaacaa 
cccctgttgc 
cactgtccct 
ctgttgatga 
atgaagaaga 
acgaggagga 
gtttctttgg 
cagccgttgc 
ccatggaaga 
ctgatgttgt 
ttgcaacaac 
cagaattgta 
tattgtttcc 
ggagaaggtg 
ttattgttag 
ttgttactac 
atacacttct 
tcttccacat 
tttgaaggaa 
ctaggtggag 
gagcagctcg 
tgttgaggtg 
aatgttacca 
agcatgggtt 
aaactggatt 
gagcaagaaa 
gcgagcccaa 
atgttgtcgt 
cgagatatga 
acgatagtgt 
tgcccagcct 
ccatccaaga 
tgaacgcttg 
tcaatagaat 
aatattgtat 
atgtcaccag 
ttccgcatat 
ttatttgttg 
tccatcttac 
caaaactctt 
gtgtgcttgt 
tctgtactac 
accaaaacat 
aggaagatgc 
tcgaacagac 
cggccgcagc 
aaggaaaatc 
agaaacagtt 
ctggatcagt 
aagccacgac 
tagtgaatac 
tcttcgaaag 
tcaagctgcg 



cagtagaaga 
ctcccccatt 
ccgagaagaa 
tcatcgatct 
tcaaccccaa 
aaaacctgaa 
tgaaggaact 
aggtatgacc 
ctaacaaacg 
cctctcctct 
caggaacagg 
agaggatgaa 
cgatgataat 
tccttcatgc 
cttcctcagg 
tgctattttt 
aaattcaaca 
ctctctcgct 
actcatcaag 
aaatgttttt 
tcttaaatga 
tggaagacac 
gtgggaattt 
tgctcctggt 
ggatagatgg 
tatcatcctc 
gcttcagatg 
atggcggcgg 
acattggcat 
tattggcaac 
gttttgtgac 
cagagatgtc 
cggtgttgct 
tgcaagtagt 
tagtcatgga 
tggtgataag 
tgctctgggg 
agttcacagt 
ccagatatag 
tatcttcacc 
acttgttgat 
cgtatagccc 
taggcacaac 
aaaagtgaaa 
tcttgagttc 
ccagacttac 
ttttgtagtg 
acgcatctgt 
tccaaaaggc 
tgacaataat 
tcagttattg 
acctgaaaca 
aataaaaggt 
tgccgctgcc 
aatggtcatc 
cactgcattg 
gatagttgtc 
tgatattgct 
agtgtccact 
aaattgcatc 
agtgcccgca 



ggaggagaag 
ccc tggaaat 
ggagaaggat 
ggagaaaacc 
caaggttggg 
caaattggtg 
tgagaagctg 
aaaccggtat 
cgaaaac taa 
gtcgcatcat 
aatgtgaaca 
gatagggagg 
aaatcatgtc 
tttaagggtt 
aagtcactta 
aagactagag 
gatg tgatta 
agtgagggag 
actacaattg 
tggtatacaa 
caatttcctt 
aattgtagtg 
cttaaacgta 
tagggtgttt 
agattgt tga 
tccctcctca 
tcgtccgtt t 
tgttggcaga 
atagacactt 
tggcctgggt 
ttgaggggta 
caggtcct ta 
cccgttcctg 
agttctgtcg 
gcagccgact 
attgtccatc 
ccctgct teg 
at t tgtccct 
gtgga tagac 
aaa taagtcc 
gatgcataa t 
ggggt tat tg 
atctggcacg 
atctatca tc 
ccgcaataac 
agcatcgaaa 
gttttcag t t 
tcattcag ta 
cgttctaggc 
tgtgtcagtc 
aacaagaaat 
aaggaaaa ta 
ctacagacaa 
actgctgctg 
aacagagcta 
aagaacaggg 
gggcttcaag 
gatattctac 
tttaatagat 
tcttacag t t 
aagcctctct 



170340 
170400 
170460 
170520 
170580 
170640 
170700 
170760 
170820 
170880 
170940 
171000 
171060 
171120 
171180 
171240 
171300 
171360 
171420 
171480 
171540 
171600 
171660 
171720 
171780 
171840 
171900 
171960 
172020 
172080 
172140 
172200 
172260 
172320 
172380 
172440 
172500 
172560 
172620 
172680 
172740 
172800 
172860 
172920 
172980 
173040 
173100 
173160 
173220 
173280 
173340 
173400 
173460 
173520 
173580 
173640 
173700 
173760 
173820 
173880 
173940 



-'Ill W,' ""i! 



wo 01/38351 



49 



PCT/USOO/28888 



tgcctggaga 
atcccaattt 
ggaaaattac 
aattactatt 
aagcacaaag 
acatgttgct 
tggtggacca 
tggacgctgc 
aaaaaataag 
gacgtgctat 
atgaccaata 
tttctgaaaa 
tagatgaaaa 
tagtggagga 
aggaaaaaat 
acaaaatgag 
atgcaaaact 
cgtaccgatt 
ttggtaatga 
atgctctaac 
gaaatgcata 
ataaaggtaa 
tgtgcaacgc 
gtacaagttg 
gagacaagaa 
tggtagattc 
agaacaatag 
caacaggtgc 
caagcaggag 
ctactcaaaa 
ccactgaaac 
ttctctgtca 
aagaatgcct 
tgtgagacgg 
gaggttacaa 
gcgtcttctc 
aatccaaaat 
ggagatttgt 
aaactggata 
agagagcacc 
gaaaaggcaa 
aaagaagaag 
gttttaacgg 
atatatcctt 
gaaactacat 
tgcaatgaaa 
gagtacgtgg 
cacaaaacat 
tttctaactg 
ccaaacttga 
gaacttgtaa 
agagatataa 
gtgaatctcc 
aaaatgtctg 
gatgtagctg 
gctctttctg 
atcatcactg 
aaaatgtctg 
gatgaaatta 
tatagcccgc 
agcattgttt 



gactataggg 
taaaatgtct 
tgtgctcgga 
tggtttattc 
agtcagagaa 
ccactcctac 
acaacatatt 
catgtctgca 
atcacaacaa 
agacacactt 
ttacggtaaa 
tacaaacagt 
cgaatacaag 
atgtgaatca 
accaacgcct 
tattaacaat 
acaggaaaaa 
tttgaataat 
tgtatatagt 
atacaacgaa 
ctttagtact 
caatagaaaa 
gtgtggtgtc 
cgatcaaaat 
gattaaacgt 
tttatcctcc 
atgcacccct 
tccaaaggtg 
agggatggaa 
taaaaccaat 
tgaagcacag 
acttgtggaa 
tccatgactg 
acagagattg 
aattctcttg 
tagaaagagc 
tggaggaaga 
ttgtgaataa 
atgtggacac 
tactgaggaa 
agaaatggat 
aagatgcgaa 
gcacacatga 
tgtcaagaaa 
ctcctcaggt 
gtatgaatgc 
ctacttgtgc 
acaggtatgt 
atcctgtact 
gggcaattta 
agacggcatt 
atcaagctga 
taaatcgcat 
catctttaat 
actcatttga 
ttgctctcta 
gagaaagaac 
ttgctcgcgt 
gaaccatcat 
tcgaggaaga 
gtggatatta 



gaatactaca 
gactggttga 
gaaggagaga 
cattacttta 
aaatacggca 
ccgtacgtca 
tctgaagtgg 
aatttagtgc 
ctttttggac 
tctttcgatc 
gagagatgtg 
atgatgaata 
gataaacttg 
aagggt ttta 
tatgttttat 
aatagtaact 
caccgtgtaa 
tacatctcaa 
ttattacatt 
agggcccttt 
ctatttgaaa 
cgtaaatttt 
aatttgaaca 
agtaccagtt 
tttaaagcag 
tcttcatctc 
agtgat ttta 
ggct tagtgt 
ga tcgtccca 
ttgaataaac 
atattcaaca 
aggaggaaca 
ggtggtatca 
tggcgc tgct 
taaccaacag 
agccaatgat 
actccctgaa 
gcgagccgcg 
tct tgcccag 
gttggactct 
attgga tata 
aaaagaggat 
ccctatgccc 
cgaa ttgaga 
tttgagtagg 
agggagggca 
atgtct ttac 
tatagactac 
tcattt taaa 
tagtccagat 
aacttt tagg 
aaatctacca 
tcaagaccag 
a ttggacgaa 
gaaaatcaag 
tggagcacct 
atctttaaac 
tggacaagaa 
ggaaga tatt 
aaataaggtg 
tacaatggac 



caaaggttgc 
ggataaaaac 
gtgggtgcgg 
tggacaacaa 
tcccgcatct 
aaccatcggc 
ataggacata 
cttcaaaaga 
atttgtgtaa 
ctataaacca 
acccgttttc 
gtcgcatttc 
atcatattgt 
ttgcctcccc 
tagcgagagc 
atttgtcggg 
ctttagccag 
cagacattgc 
taatgacaaa 
caagtaatga 
aatccattat 
ctcgaatcgg 
agggtagtga 
acatagagaa 
ctgcaaccca 
cttcttcttc 
tagattatgt 
ttaaaatgtg 
cagccaacta 
tattagtttc 
agattattgg 
aagacaataa 
aagacagatt 
tgcgagaaca 
tcgggaagat 
cttggccaca 
tcatttttgt 
tactacgaca 
ggtttagata 
atacttttac 
acacaggagg 
caatctctta 
ctgagggcta 
gaattagccc 
caacatgatg 
gaatccattt 
tcggaatatt 
gacaaatgga 
aaggcagaag 
aataaaggtt 
ggttcacacg 
tccaattcat 
tacacgtaat 
tacctcaaga 
ggagaaatcc 
ccaaaacctt 
gacaaatatc 
aataatcgca 
acagggagtt 
catataggca 
acatctattt 



aatgatgcgc 
tgcattaaca 
tgatgtttct 
cattaaaatg 
agaatggtac 
cgagcaagtg 
ctatggaaca 
agagagtatc 
accttcaaaa 
agacaggtgg 
tgttgcacgt 
ttgtcaaggg 
ggatttgtgt 
atttttgagg 
ctgtaatcaa 
ttcaagtagg 
gttgaacacg 
acctgatttt 
cttgcctaaa 
aagtacatat 
aaataaccaa 
acaagaaaag 
tgaaatcata 
tgcattatct 
tccgccagtg 
ttcttctcag 
gtacaaattc 
tgatattctt 
tagaacctcc 
tgctatcaag 
tagtgaaaag 
tgtcttcgac 
gtgaggtgtt 
cgtactctgt 
gtgccaggag 
ttataggtat 
ggtttatcaa 
cgatgcatct 
aacggatggc 
aaattgataa 
ctggcaccga 
gcgttagtga 
gagggtttat 
ttaaggaact 
tatctacgcg 
ttagcgaccc 
taacagggcc 
aaaggactgg 
ctgtgtgtaa 
tcttgtgtgc 
aaccgtctct 
ttggtgtaaa 
ttgataaaaa 
agactgcttc 
aatcacctga 
cagcttcggc 
tatcggataa 
agagagccga 
tgtccggtgc 
tcatgaataa 
cttccgaacc 



gatagtaagg 
tttaaagttg 
gtcgtatcca 
tcccccaaag 
ttgtacattg 
aaaagggcga 
aagaacagta 
tccttcatta 
gaaactactc 
tggaataaaa 
ttagtgtctg 
aaatggtggc 
acagaagaaa 
aagcaccaga 
aaaaatggta 
gcgaagagga 
atgatggcgt 
gccaagttat 
tcccgtggac 
aaaacacctg 
gaaactgcta 
agctcttttc 
aagggtattt 
gacattaaca 
aagcaagaat 
acgtctaaca 
actgacgaaa 
gcatccttag 
ttacattcag 
gaaacaggag 
ggactatcaa 
tgatttgtct 
tgatgtacac 
tgacggaaag 
tgtttatagt 
catcaagaaa 
tcacaatgga 
aagcataggg 
ctcatcattg 
agttaaatat 
agaggacaat 
aattgtggat 
ccagaaaaaa 
tttccctgaa 
tgaagattta 
tgattctgga 
tgcgtgtaag 
aagacctgaa 
atcgacaaat 
gcctgtagct 
cattgtcgag 
ctggccctat 
aagatcgggg 
agccgttctg 
ggaggctgcg 
tgtggcctct 
tgtcctattg 
ccaggcagct 
gtacaggcaa 
caaaacgcct 
tctttctcta 



174000 
174060 
174120 
174180 
174240 
174300 
174360 
174420 
174480 
174540 
174600 
174660 
174720 
174780 
174840 
174900 
174960 
175020 
175080 
175140 
175200 
175260 
175320 
175380 
175440 
175500 
175560 
175620 
175680 
175740 
175800 
175860 
175920 
175980 
176040 
176100 
176160 
176220 
176280 
176340 
176400 
176460 
176520 
176580 
176640 
176700 
176760 
176820 
176880 
176940 
177000 
177060 
177120 
177180 
177240 
177300 
177350 
177420 
177480 
177540 
177600 
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acagattttc 
tcaaaggtcg 
aatatggata 
ccagattcta 
gttaaaacag 
acacagctaa 
gaggcactca 
tttaaggaag 
atgtctggaa 
ttcgggctct 
ttccctacca 
ctacttagaa 
agcgaaagag 
gatgcagtag 
ctaagtgcgc 
ttgtggaata 
agaggtgttt 
aatcggaacc 
aatccaataa 
agcggagcga 
ctgaagctat 
gccttcttcc 
atcagcctgt 
ttgtggagaa 
aatcccccaa 
tggttggtgt 
acagcatgat 
agattaatat 
acacattctt 
aggattagaa 
tatatttgta 
cttggaggtg 
gcacacgtca 
agtgttcgac 
gtcgttgatc 
ctcggcccca 
agcagaaaca 
aggagtattg 
gactcgcatc 
aatcattata 
tgcaacgtcc 
cttcctttac 
aaagggtttg 
atctaatttc 
ccctactcct 
cgtcttctga 
ttgctagaga 
aaggtgtgga 
tgtcatttga 
cgtccattgc 
ggtacattaa 
tgtatttctc 
tagctttctt 
aattgactag 
agcctgcact 
tattcttgct 
atgtacggtc 
tcttgtcctt 
taccatcttc 
tgggagggac 
aatcttttga 



aaaaccccac 
actctgctag 
taaagtcatc 
cacctaaccc 
cagttaatga 
cctttatgag 
tgtccgatac 
atgcatcaaa 
accccagaca 
gtttcatgtt 
ttccccattc 
aatacggttc 
ccaatgcagt 
caaatctcac 
agagatcgtg 
ggtttgtgtc 
tgaggaggaa 
acccagtttt 
acacaccaag 
ccccactgtt 
tgctgaaatt 
cattgaacgc 
gcgcaagatg 
atataaggac 
agaatttgac 
aattaagagg 
catgaacagt 
ttgtgtgttt 
tagggatgat 
aaataaaaga 
caattcccac 
aatccaatgt 
atgagggtaa 
atagtatttc 
aagagatcga 
gtgccagcgg 
tccttcatgt 
taggacctct 
atctgatcat 
accatgataa 
aggtttgtta 
gtgttggttg 
tttttaaaat 
agtaacgata 
cccttcacct 
attggatgaa 
attgacgcta 
agattcttgg 
aaaagaaaac 
tgttgctgct 
gtttcgcata 
agatgcttga 
taaaagtttg 
tgtatttaat 
gtatgctaca 
gttattgtta 
attactgtcc 
ctctgaaaac 
atcctcttca 
agccatttta 
caaagtgaac 



tgtcattgcc 
gtctacaaga 
aatagattgg 
ttgttcagtt 
tatatacgct 
cgagattgaa 
tagggagagg 
taccaattca 
gcccataaaa 
taaggtaatg 
aacacaatac 
ttcattcgaa 
gaaaaaattt 
cggacctgtg 
ttcaactaaa 
ataataaaaa 
gtggaggaag 
gaagacaaga 
accaaggaag 
cctgatgaga 
ttttcaatgt 
ataaagagca 
ttggttgacc 
gatagcgacg 
ctctcctttt 
gtggccggag 
gttctgggta 
ttgactaact 
gagaaatttg 
tggtataaaa 
tttacttctt 
tcattgtagc 
tttttgaaga 
ccttaatgtt 
cgccagtgtt 
gatcagtcct 
ccttatcgga 
ctccacgaat 
aattagcgac 
cgattagagc 
ggttgccaaa 
gatccagcta 
ttaaccctac 
cctcccacta 
cctcctctcc 
gaattgtaat 
gatttgttat 
tcaatataca 
gttcttttcc 
gctggggtgg 
ggcacaacag 
tttagcttat 
atcactactg 
gcat tatcaa 
ccaaaccttc 
ctcactccag 
ccgtatctta 
atgttgccgt 
tcgactgcct 
gaagaacaac 
tcttttggcc 



aatgtgacta 
ttcgacgctt 
gcaaatatgg 
gacactattg 
tctgttggaa 
aaactgcgaa 
gcattcgtag 
aaattgtttg 
ttggtacgcc 
cctccagaat 
catggtctat 
aagtcctggg 
ggtgtaaacg 
tatgttctca 
tttctccgtg 
tggctcagac 
aaaggcaaca 
gttcatccac 
aacaacttct 
aggtcaagca 
ttgggatcgc 
tcactactaa 
acctctatca 
aaaaactgag 
ccgattacgt 
eta ttgaaag 
ctgttgtgtc 
ttatttgtac 
taatgagtca 
ttacttgtat 
cttgatttcg 
ttcgaaggct 
tgaagattta 
tcctgcaata 
gttgaggatc 
tggggctgtg 
gagaccattc 
tgacattacc 
gacgcttct t 
aatgattgca 
ttccattttt 
gttagttctt 
atcgaaccgg 
cccctacttt 
ccct t tactc 
attttgtgtg 
tcaaaaggtc 
cactgctagc 
taccaacatg 
agggagaaac 
atgtagat tt 
attt tatgca 
cgcttagcaa 
aaatagggga 
ccccgcct tg 
aatacgcgtc 
tatta tcgta 
cttcacttct 
tatcaaattg 
tagaaatact 
aat ttaa tag 



agcggatgga 
ttgttaatgg 
tagaaaatgt 
tgtccagaga 
aatcttattg 
aggctgcagt 
agttcctatt 
ttcagaataa 
gttctgctga 
tcatgaactg 
atggtacatg 
ctcattttga 
atacgaggat 
ttttagatct 
aaattaagga 
atcaaagatg 
acctttcaca 
atcttctaag 
agaattcgtg 
agaagttgaa 
tcaagacagc 
aattgttatc 
ttttaaggag 
cgtcattctt 
tgatcgcctt 
taaggaattg 
caacattcct 
atttgctaat 
ggtaacaaga 
ttttattcaa 
tccttgatat 
gcgccgtctt 
atgtccttgc 
acagttggcc 
ttcaaagtca 
acggtagaga 
ttcatggcca 
tttgcccttc 
ccaacacgtg 
atggacaaga 
ctttgtttta 
ctgtagcacc 
atgaagtgta 
catctccatc 
gtattggtcg 
cttcattctt 
cataaaagtc 
gctactcatg 
agaagataca 
atcttgattt 
tccgtgctta 
gtacacttca 
gtgtttcctc 
tctgtgttcg 
tgtctggaga 
accgaaaaag 
tactcgacga 
tttctctgcc 
tatactgttt 
atccaaaaga 
tacaattggt 



gagcattttt 
tgttgcgaat 
gatcaaatta 
cgcaagtgta 
tcgtcctgca 
tgtatgtttt 
ttacgttagc 
gctatcttcc 
ggaaacacta 
tatatttaac 
tttaacccct 
ggaaatttta 
agattgtcta 
tgtacgtact 
aaactatctt 
ggaactaaca 
aagaaatcta 
aagaagagca 
aaggatctgg 
gaaaagtccc 
aagttcaaga 
gatgcaatta 
atgcagaatg 
aagagtaaga 
aacaggattc 
ttgcagagta 
tacaacatga 
gatgatttgt 
tacatttcaa 
acaaaacctt 
cgatcacatt 
caaatttgct 
tcgagaagta 
gaacagtaat 
tgttagagtt 
tgacaagatc 
cctttccaag 
tttgaattgg 
tgttgaatac 
ttgcaataat 
gatggaagtt 
atatacccag 
ccttgccgtg 
tctaccccct 
tcttcatcgt 
ttcacttgct 
ctccgtctag 
ttggaattga 
ctgctaggca 
ttagaaggac 
ccctttttat 
atggacaatt 
ttgtccatga 
atgattgatg 
tcaaacatac 
tggtttaatg 
attgct tcct 
tcacttccat 
aagacgtaaa 
agaactttat 
atgttcaaga 



177660 
177720 
177780 
177840 
177900 
177960 
178020 
178080 
178140 
178200 
178260 
178320 
178380 
178440 
178500 
178560 
178620 
178680 
178740 
178800 
178860 
178920 
178980 
179040 
179100 
179160 
179220 
179280 
179340 
179400 
179460 
179520 
179580 
179640 
179700 
179760 
179820 
179880 
179940 
180000 
180060 
180120 
180180 
180240 
180300 
180360 
180420 
180480 
180540 
180600 
180660 
180720 
180780 
180840 
180900 
180960 
181020 
181080 
181140 
181200 
181260 
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aaattgttaa 
aaattgcccc 
ttgcagattc 
ttgtttcagt 
ccactgatgc 
cacttgatgt 
acgtttgaca 
aatttgcttg 
ctcctcctcc 
cagatgaaga 
gagaaggagc 
acgcgtatgc 
atttctcatt 
ctaaaaatgc 
tctcggatgg 
attgaaggtt 
ccatagtctt 
caaccactgt 
taggtggagg 
gaggtggtgg 
tgtctctact 
gcggtggtgc 
aaaacattct 
tcatgattgg 
ttcgaaccct 
tgtgccgaga 
gagattgaac 
cctcctcttc 
cacctgctat 
ggctggattc 
cgggttcatt 
cttcctcttc 
gttctatttg 
tggcgttcac 
ctggcgaggg 
cttgttgtgc 
cattgtcatc 
cacccccttc 
catcatcgta 
gaggatcttt 
cctcctcctc 
agttactata 
caccttcagt 
tgtg ttaacc 
ttacctctat 
tggtttttgt 
ggaaggggtg 
accg t ttata 
atcccgatgt 
aattcaaatt 
cttcggggaa 
ctgagtgttt 
caacctacag 
cagaccaccc 
gctgatttat 
cgatgaggaa 
tatcgtagta 
gctctactcc 
acgtctatac 
gatt tattgt 
gctgatccat 



ccttcccacg 
gaaacaaatt 
tttactcatc 
ttgatgtttt 
cttggtcact 
tttaaaggat 
tggttgatta 
agtttgatta 
tcctcctcct 
catggaagtt 
aggagtcgat 
tgctgaaaaa 
tgttttcctg 
cttcatggtt 
ttttactggg 
ctttgttgtt 
gtgtaatctt 
actcaaaaag 
tggaggggga 
tggaggtgga 
ttcttctccc 
tatatcgttt 
aggtgctttt 
agggaacaca 
gggcgcattg 
ttcatcttca 
agactcttta 
ctctccctct 
tgaagagaag 
aactctatga 
tttttcttgt 
ctcttcctcc 
ttggttatca 
attttcaagt 
ggtgttcact 
cgctagcgcc 
ttcatattca 
ttctccctct 
catgtcatta 
atttcttcca 
ctcctcctca 
atcaacatta 
tatacgcaga 
gtggaaaatc 
agggtaattc 
tataactttc 
gttcagagta 
ttactatttt 
tgataattgc 
acctggatca 
cattcacacc 
tagaaagagg 
taaataattc 
cctcatccca 
gttcctgtat 
gaaaatatat 
gggagtatta 
ctccttgcaa 
aattcctttt 
ccatcctaca 
gcctccaggg 



tccatacata 
tctgccccta 
tgttttagtt 
tcaaaatcct 
atgtccgaaa 
atgaggttgt 
tcgtcgtctt 
ctataaaatg 
ccagcaatag 
tcatcctcgt 
ggtcttattt 
agagggttgg 
aacagctcgg 
aattctgcag 
ttttctggat 
tctagaagag 
tcttcggcct 
gatggattgt 
ggaagaagaa 
ggaggaggag 
atttcattgt 
agttggtcat 
ggtgaaggtt 
actgcatcct 
ttaaattgat 
tattcttcat 
gtttctacat 
tcttcatcct 
aaataatcca 
gagtgcacgc 
tctataggtt 
tcctgttctt 
tcttctttgt 
aaaaagttca 
atacttttcc 
aataacgcag 
tctccttctt 
tcatcttcct 
gtgacatctc 
tttctaactc 
tcctcgttca 
ttattgatat 
tttgatgccg 
aaacactaca 
tcagggtgta 
tctaaaatct 
agaccaaaaa 
agggtactag 
gcttatcgct 
caatatcaat 
agaagatata 
gtcaatcatt 
tcaagagagg 
atcatcttcc 
gtgacgagtc 
taaaggagga 
agatgatgaa 
gtgcgatagg 
cttccatgac 
acaataaaag 
aattgttatc 



gtttgttgat 
tagaacctaa 
tagatgaggt 
cttgggcttt 
gcatattgac 
ttaggaacgc 
ctttcagtat 
aagaatgctc 
actcgtcgcc 
ttaattcaga 
tgctagctct 
acatgatttg 
cagatttaat 
cgatgatatt 
cttggaacac 
tttttaccct 
ctttcccacc 
tgttgttgtt 
tattggtagg 
gaggcatcag 
ctggaggaaa 
attgttcctc 
cctttatctg 
cgcactcttg 
gcatagtact 
attcttcctc 
gctcttcaga 
cttcttcatc 
ttccatcatc 
tttcattttc 
ctttttcttc 
cctcttcttc 
tcttctgttc 
acatgggaag 
actggttttc 
ggtcagcctc 
cttcctcgtc 
cttcctcctc 
tcacaaattt 
ccgtgtcctc 
tttcctcctc 
cgtcctcgtt 
cttccatgat 
tt tactaagt 
tattctatag 
actaacggca 
taaaatatt t 
aaaacgttgt 
gcatttgtta 
aaggaactaa 
gctcataaca 
ttagcagaaa 
aggaaca ttt 
caagttgaac 
tcgcaccgac 
agaagttagt 
acctggttca 
gagtagaaag 
gttgaatgaa 
gataaaaaac 
tgaaactacg 



gtgtttctct 
ccccatttct 
agacaaatct 
gttggtgtct 
taatctagac 
cagtctttga 
att ta tcagt 
caccacctct 
ttcagcacat 
tatactttca 
ttcccccatt 
cttcctctga 
gtctgtcacc 
tttaaggcgc 
aact ttcacc 
tgtttttaag 
taatgtgttg 
tgaaaataag 
tttgagttga 
catcgaatct 
agtcacgata 
acctccacta 
ggggagaata 
tatatctgga 
tatttcttca 
ttcttcaa ta 
tttttgtgca 
ctct tcatta 
attacctcct 
tattgcatct 
ctcttcctcc 
atcctcttcc 
ttcacctgct 
aacttgccta 
aggta taatt 
atcttcttcc 
ttctagattt 
ttcctcttct 
actttgtttt 
atcttcatct 
atcctcttcg 
gtcgatatcc 
gaaatt ttag 
tcct ttcctg 
tttggtct tg 
agagcattga 
tttgattgtg 
cagaaccaat 
ttgcagctct 
atttgacccg 
acagaattct 
tggacaagta 
ctactcctca 
tttaaattct 
gagaaagaag 
ggaggagaaa 
aaaa tcaaaa 
aaagctgaag 
ggaggacgtc 
aatcgttcta 
gtgagggagg 



atgattgtag 
tccattatgt 
gatacatcat 
ccgaacagtg 
caagaaacag 
ccaagcttat 
tccatgtgca 
cctcctcctc 
actccaccag 
acctttttaa 
atataagttg 
aacaactcca 
gcagcagaat 
tccattactt 
agttcagagc 
attatagact 
ttaacctttc 
gactgatttg 
ggaggttgat 
tggtccatga 
gatggaacag 
ccaagaatag 
tctgtctctt 
gggggagatc 
tcctcgagtg 
tattccgtgt 
ggttcctcct 
tcattgccgc 
tcactcatag 
tcatccttct 
tcctgttctt 
tcttctacgg 
gatccgccca 
tttggcccta 
gtcgcatctt 
ccttcacctt 
ccgtcatatt 
tcttcatcat 
ttagaagatg 
tcgtaatcct 
tcttcataca 
aacactcccg 
tatacaatat 
aaaaaacgag 
aacgtgcttt 
cagttgagtg 
tgaatttcga 
catgaggttt 
tcttactgcc 
ttcccttcaa 
cccttccaag 
caaaaatgca 
acaacaacaa 
ataagaaaat 
attttaacaa 
aggttgtaaa 
caaagtacag 
aatatattga 
tggcttcagc 
ggcctgtgaa 
aaatactacg 



181320 
181380 
181440 
181500 
181560 
181620 
181680 
181740 
181800 
181860 
181920 
181980 
182040 
182100 
182160 
182220 
182280 
182340 
182400 
182460 
182520 
182580 
182640 
182700 
182760 
182820 
182880 
182940 
183000 
183060 
183120 
183180 
183240 
183300 
183360 
183420 
183480 
183540 
183600 
183660 
183720 
183780 
183840 
183900 
183960 
184020 
184080 
184140 
184200 
184260 
184320 
184380 
184440 
184500 
184560 
184620 
184680 
184740 
184800 
184860 
184920 
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gaaatctccc 
aatttttttg 
catacagtat 
aaactatgct 
taagacaata 
tttttgtatc 
cttttaaaag 
tgatatttgt 
cattactgtt 
cctctgtagg 
caatattatc 
ctaccaatgg 
atctcgttac 
aaatgatgga 
cttcagaaaa 
ttttaactgc 
tgattagtag 
cgctcgtcta 
caaaaaaa tt 
ggtatggtac 
ataatgagga 
ataatgatag 
acggctttgg 
tgttatgtaa 
ctacaaatcc 
ctggttccga 
ctactggcga 
cagttggatc 
aaacaggatc 
aagtcaatca 
ctgtggaagc 
aaatatcttc 
agttttatta 
tcttcctcca 
gaccgctggg 
agattgcacc 
acatgtat t t 
tagggaattt 
agtttagagg 
tcgagccaat 
tactgt ttct 
cacccttaat 
ttctgggtca 
ggtggaccgc 
gtacattaac 
gggaagggta 
ttttcaaaaa 
gtcaagttcg 
gagctacaga 
acgcacccag 
ccatctcaaa 
cactcggtct 
accagaaacg 
tttctggagc 
attttattgg 
atccttactg 
agggttcctt 
catctttaaa 
aggggtcttt 
ttgcttggcc 
ctctgattcc 



gcttcttctt 
tttgtacctt 
gaagaagata 
tcaagat ttt 
aagattctat 
catttcgtag 
gttattggac 
catgacgttg 
gttcttaacc 
ggaagcaacg 
tttaataata 
atattctggt 
cactgaaaaa 
aaaata taga 
agaaacagac 
cctgagtaat 
agatgatgtg 
ctactcttct 
catgacatca 
tgcttttgta 
tctacacact 
attaacagta 
agggtctgat 
ttgtgacaac 
aactgaaatt 
gcctgtaaaa 
gcacgatatt 
acccattgta 
tcctctggca 
aaagggaggg 
tgcatcactt 
tgttacatgt 
ccactaaatt 
tctcaaaaac 
tcggtctgat 
agaaatgaca 
ctggagactg 
tactctgcct 
gtggaccgct 
gtcagattgc 
ggaacatgta 
ccactagaga 
ctcgagttta 
tgggtcgagc 
ttttacgtgt 
taacctacta 
tttttctggg 
gagggtggac 
aacgtacatt 
aaagggaagg 
aacttttcaa 
aatgtcaagt 
tttgttgctc 
acgt taaaat 
ttctatccat 
aaatttttca 
ggtgtagctg 
tcttcgc ttc 
cctacaggtt 
ttcttcacat 
gtttgca tct 



cttcttcttc 
ctaattgtac 
ttaaccgttt 
ctaaaattaa 
aaaaatcaac 
tcgacgggta 
atggctatca 
ggagtggcat 
acctgagcaa 
ctcgtgggtt 
tatgccgtga 
tgtacaacca 
gataaacctg 
tcggacttgt 
aaattgaaaa 
ggtcatgaca 
aaggacaaaa 
gcaacctcta 
acacagttgg 
ctacccacgg 
tggaacccac 
gaatgtgcta 
gataatagac 
cacaaggact 
caagtagatg 
tcttcccaag 
tcccatgagc 
ttcaagccag 
ttgaactgcc 
atattatata 
tcttcctcgg 
gaaggagaaa 
tattggtctt 
ttttcaaaaa 
gtcaagtttg 
gagctccaga 
acacgtccag 
ccaccttaaa 
gggtcggtct 
accagaaatg 
tttctgggca 
attttactct 
gagggtggac 
caatgtcaga 
ttctggaaca 
tttacactag 
tcgctcgagt 
ctctgggtcg 
aacttttacg 
gtgtaaccta 
aaatttttct 
tcggagggtg 
cagaaacgat 
aagtagggag 
ctaaaggtgt 
aatcatccaa 
ttttcgtgtc 
tactcctctt 
cgacaataag 
tcctgtcgga 
ctctaaatgc 



aacgtcatca 
ttctaaaact 
aaggtataac 
tagattaact 
atttatttgt 
gagatggtct 
catacatttt 
tttttgttgc 
acatggctgt 
ttatgatttt 
cagacgttta 
atgtggtttc 
ttcaatttgt 
ctcctaagaa 
gtgattataa 
gtaacaagag 
gttatgtctt 
aatacgttag 
ggagtaagct 
gatacgtgat 
cctcttcttc 
aaacggaccc 
gtgcaaaaga 
tgcttaaggc 
ttgctgcaaa 
tgacatctgc 
ccgtgaagct 
tttatggtac 
cctgcaccga 
gagatatggt 
actcttcttc 
aaataaagaa 
cttggtactg 
tttttctggg 
gagggtggac 
aacgtacatt 
aaagggtagg 
aacttttcaa 
aatgtcaagt 
acagagctcc 
cggacgcacc 
tcctccatct 
cgctgggtcg 
ttacaccaga 
tgtgtttctg 
ggaattttac 
ttagagggtg 
acccgatgtc 
tgtttctgga 
ctgttttcgc 
gggtcgctcg 
gaccgctggg 
aacacacatt 
aataatacat 
tagaggaaat 
agagaataca 
cctggcgaga 
catagacctg 
tcttcctata 
tgcggcttta 
tttccagatc 



tccacttctt 
gtatgcgatt 
ataatacata 
acatgtatac 
atcatttata 
gtagaaagaa 
cttattgttt 
tggtgtatga 
aggggattat 
aaattttata 
tagaagatgc 
ttcgacttta 
tagaggttta 
cgtgggggaa 
aaagggtaag 
gattataggg 
taagagattg 
aaaattttcc 
cgtgtatcct 
caacaaagca 
tgtgctcgtc 
aacacataga 
agaaggttat 
tcctctaatt 
acgtgtttta 
tgctcatcaa 
atcagatacg 
atctttagta 
caaggctgat 
ggggtatctt 
gtggttgaca 
aattgtgtaa 
tttacactag 
tcgctcgagt 
cgctgggtcg 
aactttatac 
gcatagtcta 
aaatttttct 
tcggagggtg 
agaaacgtac 
cagaaaggtg 
caaaaacttt 
gccgaatgtc 
aacaccagag 
gagactgacg 
tcttcctcca 
gaccgctggg 
agattacacc 
atagacgttt 
tagagaattt 
agtttagggg 
tcgagccaat 
tctggagcag 
aacattatat 
actttcattg 
atctttccag 
cggtacttat 
aataaggcta 
gatgcgttcc 
ctcccactag 
aatttttctc 



cgttttcttc 
ttgtaaaaca 
tatctgaaga 
aaggtataag 
aaattccata 
gatgctgctc 
tccctattga 
caaacggcaa 
ctttctatgt 
aactttgtca 
aagcggccgt 
caagaagcta 
gtccccagaa 
tatattttac 
aaggttggtc 
ccaagggatc 
agcaaagatc 
cct ttcagag 
caccctatac 
tacggaatgg 
ccagactcta 
atcggcatct 
gtagaaatgt 
acagagtatt 
ttccctgccc 
ctagacggag 
ggtgactatg 
aatcttccag 
ggaatatatc 
aacgccaacc 
actggtaaca 
tea ta tgtgt 
ggaattttac 
ttagagggtg 
agccaatgtc 
tgtttctgga 
ctgttttcac 
gggtcgctcg 
gacctctggg 
attaactcta 
aggctgttaa 
tcaaaaattt 
aag t tcggag 
ctccagaaac 
cacccagaaa 
tctcaaaaac 
tcggtctgat 
agaaatagca 
ctggtcatgg 
tactcttcct 
gtggaccgct 
gtcagattac 
tagttagtta 
t t tcaattcc 
tagtataaga 
tataacttag 
tattgcttcc 
tggatgaaat 
gtatcaagtc 
ttttccatat 
tgttaacatt 



184980 
185040 
185100 
185160 
185220 
185280 
185340 
185400 
185460 
185520 
185580 
185640 
185700 
185760 
185820 
185880 
185940 
186000 
186060 
186120 
186180 
186240 
186300 
186360 
186420 
186480 
186540 
186600 
186660 
186720 
186780 
186840 
186900 
186960 
187020 
187080 
187140 
187200 
187260 
187320 
187380 
187440 
187500 
187560 
187620 
187680 
187740 
187800 
187860 
187920 
187980 
188040 
188100 
188160 
188220 
188280 
188340 
188400 
188460 
188520 
188580 
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ttccactttg 
agataagaga 
tcttcgtgct 
tgtattttca 
ccatcttgcc 
ggtggctgca 
gacatatttt 
ttcttcgtcc 
tccccctctt 
attgacattt 
tttccagtac 
gatggtaaaa 
taaagaatgc 
attagttttc 
tgaaaatcgg 
aacttgtcta 
cttgttacat 
ccctaaaatt 
agtcatatcc 
tgtgacactt 
tgtcaaagga 
gatggctgct 
taccttgggg 
tatagttgga 
ggacccttca 
taggtgagga 
agattgcttc 
ttttgctcgt 
tgaatcaatt 
agctgctgat 
tggtgcttgt 
ccaagagggg 
gacggcatct 
cgttttattg 
ttcaggtgta 
tcgtttttca 
catgattttt 
aatgtcttcg 
aatttcaaat 
ggtgtcgact 
aacaggctcg 
cttgtcaccc 
attgccagta 
aaacttaaaa 
atctttggct 
aagagtggag 
tacaaccctt 
gcctgcaccg 
tttttcaaat 
gcctctgaaa 
agtatcctcc 
gatactatcg 
tctagaatgc 
agccgacata 
cgtatgattt 
tgtgatggaa 
tctacgttta 
ccggttctag 
aagagggtgg 
gttaaaggta 
cgtcaagcag 



acataattat 
ggtaagggcc 
ctaacagcat 
tctgttggaa 
aatagtaaat 
gccgcgtaca 
gcagcttcaa 
tctctttttt 
aatagacccg 
tctgctcctt 
gtctctagtt 
gtgtctagag 
atatctttca 
tgtgttagcc 
gcagctgagt 
ttattgacaa 
ctcttgtatt 
ccgtctatcg 
gtagttttaa 
tcggatggca 
tttggtttag 
gataatagta 
gggtagaatc 
acaacataag 
ataaagtctt 
ggcatggcag 
catcttagaa 
cccattgcca 
acccatgaag 
ctaaacacac 
ttcaaaaaag 
tcaatta tat 
ttagataggt 
aaccacgtgg 
ttttgtaaat 
tcggttttag 
gtggtgctgg 
gt taaaggga 
atacaatgga 
atattctgga 
gtgccatgga 
atattatccg 
ttgcgggtct 
cgtataatca 
ctctagatcc 
gggaaaa ttt 
gcaagattga 
gtctcaagga 
gtatcaacca 
catgcatgag 
tcggctgcct 
agtatgctga 
ctatggtaga 
ggcgagta tc 
actctgcgat 
agcacttttt 
acgtactaaa 
acattat ttc 
cgtacgacat 
acaacaaggc 
aaaaata tat 



tcttcaacat 
tccctggtac 
tccttaaagc 
catttcccca 
atactgctct 
acaggtagaa 
acacataccc 
gtttcatttt 
aatctgcaga 
tatta ttacc 
cctgttctcc 
attttgttat 
aaaacacttg 
ctccagagca 
cggg^tcaag 
gaaaatactc 
cacg ttgaag 
cctttggaag 
ttcgttctct 
tctgcagatg 
cgtttctggt 
ccatgatttc 
cttcctgtat 
a ttgtccatc 
ttatcctttt 
caaagggatc 
tatcacaaag 
gatcggccac 
gttcattagt 
cgtccataat 
taaagtcaca 
gctctctaac 
gtcctttggc 
gtccgaa tac 
ttgaaatggc 
tt tctgctac 
taaagtagtt 
tatttatttg 
gtacatggaa 
cgaaaactct 
cgccgaggag 
tctggcccac 
t ttcatcaac 
attcaaggca 
tctgagtatg 
ggacaaatct 
cgata ttgtc 
agccatccgc 
taataggttc 
atactctcag 
cgacagggat 
ttcagataac 
cagagccgtt 
tggagacagc 
tgaaatt ttg 
gagctatgtt 
agcctaccgt 
cggaggatgg 
ctctcaagtc 
tagcaacgtt 
cccgtttgga 



gttatccact 
aaaactagtg 
ccaaaagctt 
taagttatgt 
gtcgaaattg 
agaaagggca 
tttcttatcc 
tccttcaaaa 
gagtatctca 
accaccacca 
catagacggt 
gaaaccagat 
aagaattaac 
taaaaataat 
aacgcgggta 
caaagtttct 
agcttgaagg 
atatgggtgg 
ttccctctcg 
gggattgttc 
cctggacagc 
cctccctaaa 
tacagaaacc 
ttcttcactg 
ttcaatcatg 
taaacccctc 
agtatatcct 
cgttgaattg 
atcacctcta 
attctcaaag 
tagttcttcc 
accttctata 
taatcttagt 
gtcagaagaa 
agcaactagc 
taatgccaga 
tttgtacacg 
ttggtactgt 
gaaggagaca 
gcttgtgtag 
gcacagtacg 
tgctccgaat 
aacatctttg 
gagtcacaaa 
ctttctagct 
ttgggtgtgc 
cttctggaaa 
caggaacaac 
aattttggaa 
gacgaacgcg 
gatccagctc 
gcttgggttt 
attgctcact 
ttcaagcagt 
ttctgtgata 
aatgccatgg 
tcatgggtgg 
aagaagaact 
atcaatcatc 
acatctggcc 
atacttgaaa 



aacatcttgg 
tttgacgcat 
gcaaaggata 
gtaggaaact 
ttcgttattg 
gcaaaccctt 
actaggtgca 
cctatatcac 
accatcaact 
cctaaagcat 
ttactttgtt 
tcacaacgct 
ctgcttttca 
gcattccata 
acagagtagt 
ggtgaaacaa 
gcggtaggtg 
tccctaaaga 
acccattctt 
acatatttat 
actgccctgt 
agagcttcat 
atgttggtta 
actgtaactc 
tcagcagcgc 
tctttcattc 
agaaatagag 
tttctgttgg 
gtggaataaa 
gcattcaaaa 
gctgcagttt 
tctctgtaga 
cccgaattaa 
ataacaacct 
ccttccgata 
acatagtcct 
ctgcaatatc 
tttcgtcaag 
tcgctgaaag 
ttaatgtgaa 
cacaggacat 
ccaacaagat 
acaacaattc 
acaagtctag 
tcatgggttc 
tctttgaggt 
tgtgcccatc 
ccatggaagc 
gcgacataaa 
cagtcgttgt 
atactctttc 
cttccctgtt 
tttacgtgta 
ttgtctatac 
ctgaaaattc 
ttaacgtatc 
tggatcaagc 
acccct.caco 
ttgcatcacc 
tggatagtat 
ataaagcagg 



atgaaaatgc 
caatcacatt 
ataatgatga 
tcaaatctcc 
aaggattgga 
tagatacatg 
tgttacgcac 
cacctaaccc 
cccctaaccc 
cgagtactgc 
cctttaaact 
ggcagacgaa 
ctgcagagct 
gcacatacat 
tggagaacat 
tatcgttaaa 
tgcgtttagg 
gattctcagc 
tttccctttc 
ccagtttagt 
actctgcgtc 
ggaaagaacc 
aaacagtcga 
catcactaaa 
gtccgtactc 
tagaggcgat 
caatggctct 
ccaacatttc 
gaactagatt 
ggttcaataa 
cggtcaatct 
cactcctcat 
aaaattctgt 
tcctttgttt 
cataatttag 
ttgactctgt 
ttcgacctta 
gctgaaaaat 
gcgctcagaa 
gagtatccgt 
ttccgcccaa 
taagga tacc 
aacaaagaac 
cgttctcaat 
tgatccagca 
gcttcaaaat 
caagtgcgcc 
aatgttgttg 
gtcagcatac 
gcctctgagg 
ctctttcggg 
tgcagccgtc 
cacaatgctc 
cgtatt tgtt 
gtctgtagaa 
cgtgctgggt 
atccgtcgca 
tgaccacatc 
ttctagaatg 
caggtctgtt 
gtatggtgta 



188640 
188700 
188760 
188820 
188880 
188940 
189000 
189060 
189120 
189180 
189240 
189300 
189360 
189420 
189480 
189540 
189600 
189660 
189720 
189780 
189840 
189900 
189960 
190020 
190080 
190140 
190200 
190260 
190320 
190380 
190440 
190500 
190560 
190620 
190680 
190740 
190800 
190860 
190920 
190980 
191040 
191100 
191160 
191220 
191280 
191340 
191400 
191460 
191520 
191580 
191640 
191700 
191760 
191820 
191880 
191940 
192000 
192060 
192120 
192180 
192240 
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ataaacattg 
tttaattgca 
caaaagggga 
gatattgcca 
gacaggcaag 
catgatttca 
tctcgctgta 
gacccagaat 
aatatcgatg 
aagaaccaaa 
ttccagatgg 
aaaactgcct 
tgcactgcaa 
caaaaggaat 
ctcaatgctc 
gatatgcggc 
atggatgaac 
aatataaatt 
tttagtgtta 
taatgaaagg 
caacaccttc 
agtacgtttt 
cattataaca 
cccgtacacg 
ctcccacagc 
ccagattcgt 
acattcgtcc 
ttatcaccaa 
gttgtctagc 
ctgttgatct 
caaacaataa 
tttgtgcaac 
cgaacaagtt 
ggattgaaac 
atgtgtcaaa 
ttcttgcaaa 
ttcctccccc 
tgtatttaga 
ttcccttccc 
caataaccgc 
tcaattctta 
tcgatttcag 
ccatttgtat 
tcagaagaat 
ataccagcga 
attttgaacg 
gataacatgt 
atagtattgt 
cgctcctctt 
acatcgtata 
acaggattga 
ttgatttctt 
taggcaattg 
tcatggtgag 
ggctcatctc 
atatcgtccc 
tttagtatat 
ccttcaacaa 
aagcttagca 
tcatcattta 
aaatcgtacc 



ccaagcacaa 
acgctttaca 
gcgcagatca 
tgaaaaagca 
gtctgactac 
caagatacgt 
ttcttcccca 
cagtccccga 
aacctaatct 
aatgggacaa 
cagacgctat 
cttcttcttc 
agcagctcca 
ggagtagtgc 
agttctctga 
catccagcct 
ttcttatcag 
atatatcatg 
tgacagtgca 
aagattaggt 
ctcaaagtta 
ttccttagct 
gttggggagt 
gttaccttct 
tgctctacct 
tgctgcacta 
ttgccaagtg 
ttcctttgag 
tagttgtgaa 
ctgtaggctt 
aatattggtc 
aacttcatcc 
ttcatcagca 
ttttccatct 
ttcctgatct 
atgctctctc 
tcctcctcca 
gtctaattta 
tacgcccgag 
aaaaacaatg 
tctaaaaaac 
gcatgtgaat 
catgactaat 
cactatttct 
acgtggattt 
ataaacttaa 
gtttatcgtt 
acttgaaaga 
ttaaaatttc 
ctttttccat 
aagcgtcttc 
gccttacttg 
cacctaagcg 
cagctcccaa 
cgttagacga 
ttgtatcgcc 
gggtgtgttt 
tcacattaaa 
cgtcattaac 
tggaacgaaa 
ttgtaaattc 



catcagccgt 
cattctacct 
aactgtcaat 
ggggtcgggg 
ttcattccct 
catcttctct 
ttctaatgtt 
cactcgattc 
ttctgctcta 
actcaccacc 
cattaagaac 
ctcagcgcct 
aagtattcgc 
agtaaacaat 
agaactgttc 
gatcgcctca 
agctaatgct 
caattttatt 
accattatat 
gcacaatcta 
tagaagaaac 
gtgcttgcag 
aaactacaaa 
actacaaatg 
tgtttttgta 
ataacattaa 
cgcatatcac 
taccgggcaa 
gaaggtgttg 
tcatccccat 
aagtctatta 
atcgtttcct 
attgcctgtc 
gttattttat 
gctaatttgg 
gtttgacgtt 
ttattattat 
tatttcttac 
atagcaaaga 
tatattcctc 
aaacagattg 
tgtaatgaga 
atatatagga 
tagatttgat 
gcggttaaat 
atcaat taaa 
tattggtaca 
aaatgatgta 
agttatgaaa 
aaccactggt 
gtaaagatgt 
atcctctata 
atactttaat 
ttgagcctcc 
cattatttac 
tctttttata 
atatctagat 
gaatatgact 
tacgtcacta 
caattttgcc 
tcca tcgtca 



ccagcaagag 
tcaattaagg 
gtttttgata 
aagattcttg 
agtagtgaag 
tcaacaccca 
ctgaactccc 
cactttctgc 
actacaagtc 
agagcgttct 
gtttctggaa 
aactttttcc 
aaattcattg 
gggaacagag 
gagctgctct 
tctgaattct 
tcttagatag 
tatacctttt 
tattatctgt 
ttgtcataaa 
ttgtttcatc 
agcttatcca 
aacagtcctc 
aaaactctat 
gcacgggttc 
atgtcctctg 
gatcactgat 
ggtaatcagg 
ttgtcaacat 
tttcatcaga 
cgagttgttc 
tgacttcgtc 
cgacatcggg 
cctttattaa 
tgaagaattc 
gaagcgctgt 
tatcgtcgtc 
tcttttttgt 
agagggcaat 
cattgtttga 
aaatctatgt 
gaggccgaat 
tcatttctct 
aatatcacta 
tctaacgtga 
tccccttgag 
ctaaatgaag 
tgataaaaac 
tcttccctgg 
attcctttct 
tcaaaagtca 
tattttatag 
tggtcactct 
actgaagcga 
tcctccttaa 
eg t t taacag 
t tacatacaa 
acgcgttctc 
cgagacacaa 
gttattttaa 
ccaccaattc 



aacaatccaa 
gctgtgaggc 
attttgtcgc 
gactgctcac 
cggaatacaa 
tcaacgacga 
ctatcagctt 
tcatgatgtg 
agctagaatt 
tcaatatcga 
gcggcttcct 
aaatcttcag 
gagaatctat 
gagtagaaaa 
acaaattgat 
tgagcaacta 
ggtttttatg 
tttgtatgtg 
aataggtctt 
aatgtccagg 
atttttatta 
atagtctgat 
tgttttgttt 
ccgggctttt 
taatccaact 
tttcatgttg 
aaaacctcct 
tgtaggttga 
caccatttct 
aggatcaact 
gtaagtgata 
atggttcgtc 
gacgtttatg 
ccatacgttt 
atcctcagct 
ctcattgata 
agttttttca 
tttggcattt 
agaagctcct 
aagtgccatt 
ataaatcatc 
cctcggatgg 
ttagataatc 
cactttttgg 
gagttttggc 
cagctccagg 
ttgaagagtc 
caggtatata 
gttccatgat 
caaaaattgc 
ttacattagt 
aggcattttt 
ttcttgtgat 
tttcagcttc 
aagcagtgat 
cgtatgttgc 
tatcaatatc 
ctgttcgatt 
gaatcactgc 
aatctcccaa 
tcaaattgta 



cggaaggaac 
acttggggca 
atctcatatg 
tagcatgatt 
gaagagaatc 
actagtcaat 
gagaaatatt 
gcagcggcca 
gttgcttagc 
caggatcaat 
agatgggagt 
tggtgctgaa 
gcagcatgta 
ttatgacgga 
catcgaggag 
cgtcaacgcc 
aaataaaaca 
atacaatatt 
ggtaaatttt 
tcaatttctt 
gcaataaaca 
gcctttttaa 
gtagcgtata 
tcaatacgtc 
tgtagagaat 
acatgtacag 
cgtatattat 
ggaggttgtt 
gcgtctgggt 
gttactgtac 
ttggtagatt 
cctatgagat 
tcaccttcag 
tctgatttgt 
ttttcaagag 
acatcaactg 
tcatcgtcag 
gaatgtagag 
attaccaaaa 
tttaaaccct 
actcgcgttt 
gttgattatg 
acctatgtta 
tctgtttaca 
tgaagaatct 
gggagtgatt 
ggtaaaggca 
tcttctccac 
ccgtctattc 
ttcacagata 
taaatggcta 
tgctaccgaa 
ccaagaattg 
agtcacagca 
tgcaaacaat 
agattctccg 
tttactcttc 
aaatgcttca 
atctacatcc 
gggaatggag 
cgtagcaaca 



192300 
192360 
192420 
192480 
192540 
192600 
192660 
192720 
192780 
192840 
192900 
192960 
193020 
193080 
193140 
193200 
193260 
193320 
193380 
193440 
193500 
193560 
193620 
193680 
193740 
193800 
193860 
193920 
193980 
194040 
194100 
194160 
194220 
194280 
194340 
194400 
194460 
194520 
194580 
194640 
194700 
194760 
194820 
194880 
194940 
195000 
195060 
195120 
195180 
195240 
195300 
195360 
195420 
195480 
195540 
195600 
195660 
195720 
195780 
195840 
195900 
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ctacaccgcc 
attgggggcc 
gttattattt 
tttatgtctt 
ccatgtctta 
aattcagata 
ctctcatcgc 
catcaacagg 
gtttagaagt 
ttaacgtaga 
tcctggaact 
attacctcct 
gtagagggag 
tacccaaggg 
aatttctcac 
caaagaacat 
tggttcatta 
agagctaaat 
aacatgggaa 
ctcttgtggg 
tacaccatac 
gtttttaatg 
atgtgtaaat 
cacaaatgtt 
acaacatcca 
ggcctattct 
gtggtcttaa 
gcccatgcgc 
ttaatctggc 
ggtttcttac 
gtgatagtag 
ctcattgaag 
tttacgtctc 
agaacggatg 
aagtcattcc 
tttccaaatt 
tccaaaagcc 
tcacaaggaa 
taactgttgg 
atgcctctaa 
aattggcccg 
gccttagcga 
gacaccaat t 
cctccaccac 
aaaacactta 
gtattttttg 
atacacatgc 
ttctttaagc 
ccatcagaag 
gatgcgtcat 
gcagcttctg 
atactgtcaa 
tcgacagcta 
gaaagatcat 
gcggatgtat 
tcccccaggt 
tcggaaaacc 
gatgtattcc 
ttgtcttcaa 
ca ttcatcta 
tea tcaataa 



ctacttgacc 
gataccaatg 
gtctaacatc 
ctaggcgaga 
cagtgttctt 
gattagaaac 
ttatagttgc 
tgtaaaaaac 
attttcaacc 
cctttcttcg 
ggaactgggg 
ctatttttgg 
ttttctgtgt 
tagccattgc 
gtggtcaata 
gttttacaca 
caatgtactt 
tggtagagtg 
cagatagtga 
atgaagagga 
aggctaacat 
aggaaagtct 
ggtggcaaga 
ttgttagtga 
tatcagga tt 
ttgtggcagg 
caaagggcat 
cttgtctaga 
acatttgaag 
ggcgcaattg 
cagaagaaga 
tgtacaaatg 
tcaattctag 
atacgtcact 
atgctttctt 
tgatacatct 
cgataatttt 
aaacaataag 
acatcataat 
tcatgacgct 
gaagcatatg 
tttcggtaat 
gccttctttt 
tagagttaga 
aaaagtttgc 
atgaattaga 
gaataaagga 
aagaaaggac 
taaatccagg 
aacgttcgta 
ggtcctttat 
tattaatatt 
caaagtgttg 
cagaaatttt 
taccatcaat 
attttttaat 
tttcagtggc 
atttcaaaag 
ttttcatatt 
tatcttcttt 
tggattcaaa 



agtttcttgt 
tacaagattt 
tattccttgt 
tctcacatcc 
caacaattcg 
accacgtctt 
gccattagac 
atacaaaaga 
ttttattgtt 
tcttcttctt 
ctcggtaact 
tatcctgagt 
tacccttata 
tcaagtgatg 
aaggaaacac 
gtttgggcac 
gacaatatca 
tctaagagca 
agaaacttga 
caaaaacaca 
tacagcaaca 
cctcttacat 
atacatcaaa 
agaaccaact 
tggtaaactc 
agagtgcatc 
ttctacaaat 
acgagcaata 
aggtgatggt 
ttgttcctca 
agaagattct 
gaaaaaggtg 
gaccttgttt 
aattctatcc 
gtgatgatta 
ttccatcatt 
cttaacatcc 
tcttttagac 
aggagacaca 
gttttggggt 
tttggttata 
aatggaagaa 
aggcgtatcg 
agcaaaggaa 
aatatcactg 
ggaagagtcg 
tccagaattc 
attttcgcag 
aaaactatat 
catgttcata 
tatggacgat 
cgacagagta 
tttcttggga 
tccagtgtaa 
aactgacttt 
ttcgagcatg 
agggttgcag 
gccattgttt 
ctgcttattt 
acaatcgttc 
tgattcgtta 



tgaatttttt 
tgcgttagtt 
ttttcaattg 
agttctgcat 
tttatgtgtt 
tctacaatt t 
atgctgttgc 
aataataaaa 
ctggggcgaa 
actcctactt 
ggagaagttg 
acaatttttg 
gcctgtttca 
ctccaccccg 
aaaaaatat t 
agataatcag 
gaatggtcag 
tgcaacggtt 
ttctcatccg 
gggtatttca 
taatcaacac 
tcagtatact 
tccctaacat 
cctattaatt 
ataccaatac 
ttgatgttaa 
aaaactgaat 
ttcatagcca 
aatttattgc 
gcatcacaac 
ccgtaatttg 
ttagtattct 
ctgatcaagt 
ttctttttgt 
aattcgtgtt 
aaagatgaat 
tcattcttat 
atagcttcgt 
gtaatgctgc 
gaagtcatat 
gaactggaca 
gatccgtcac 
acattgaaac 
aat tgtgaac 
ctgttatcgt 
tccttttcgt 
ttagccgaca 
tcagaagtgg 
tttgacggtt 
aagtccagat 
aacttgcgag 
gggcccagaa 
gggggaagta 
acgcaatggt 
ttaacgcccc 
tcctgggcag 
actaaaacag 
ttctgtatta 
ttcttgttct 
caggaatgaa 
tcgcatattt 



cgtctatatc 
ttcgtttagc 
cctttaaaac 
ccttcatgta 
tggttatggg 
tatcaaggat 
tacttgcttt 
atactttagg 
gat tatatac 
ctttttctcc 
tgttggtggt 
gttttggttt 
ggaacaaggg 
tgcttttccc 
catcaattct 
ggttaagtct 
atttttgaga 
tat tcacaac 
caatatcctc 
gattactaac 
cagttttatc 
ggagagccat 
cca t tgaacc 
ccccttcctt 
aattgtcaaa 
aatctctccc 
tgagcatgct 
ttgtattcaa 
acatggactc 
tattgtcttc 
t tgcagcctt 
cctc taaacg 
ttttcattaa 
tgtcactgga 
gacgctgttt 
ggatatactt 
tctcaaagac 
cgacaaagaa 
ttt t tgcgcc 
tctttaataa 
catcat tctt 
tgttatgttt 
aagcctcttt 
tttgttgcct 
ctgttttagg 
cagaaagcaa 
tcttatcctt 
aagttgacag 
tcttaaccac 
ctcta tccac 
aaa tagtctt 
gagccttctc 
aaactgggta 
acgcaatcaa 
cgttagtcga 
acaaagatga 
gcgaaccgtc 
tgt tcataat 
tgctcataaa 
gattattaac 
tggatccaaa 



tgtgcaagat 
caagtaatca 
acaatccttt 
aactctgtct 
gtgtatcaac 
gaggatcaac 
tggtatacaa 
ggaagaaaat 
attcaatctt 
tcctttctgg 
agtgacagat 
taattgtact 
taggtgtcta 
ccttatatac 
tttatttgat 
caaaccccat 
agcattagga 
cattgccaga 
tgtatttgcc 
catcaattgg 
actctttagt 
ttcaaacaac 
ccaggcatca 
atcatatcct 
attgatggct 
aaagaatctt 
atttagttga 
agatttagga 
gtaatttgct 
ttctacttca 
agcacaaaga 
ggaaactgat 
aactgatgga 
atgcccacta 
tattccatgc 
cattcctcta 
aaatgttgta 
tgagaagatg 
ctgttggaaa 
actggaggga 
ttgggggatg 
ttggagtgca 
attactgtct 
attttctcca 
taaagaagac 
atttttatcc 
ctcaatacgc 
aaccactgaa 
atcgaataaa 
taatagtttt 
gatgttacga 
ggcagcatca 
gtttgttttt 
tacaccaggg 
caggcaatta 
cgtgatcatc 
gcgatcaaat 
cgcatcaact 
tttctcattc 
ctggcagttt 
tttacctaat 



195960 
196020 
196080 
196140 
196200 
196260 
196320 
196380 
196440 
196500 
196560 
196620 
196680 
196740 
196800 
196860 
196920 
196980 
197040 
197100 
197160 
197220 
197280 
197340 
197400 
197460 
197520 
197580 
197640 
197700 
197760 
197820 
197880 
197940 
198000 
198060 
198120 
198180 
198240 
198300 
198360 
198420 
198480 
198540 
198600 
198660 
198720 
198780 
198840 
198900 
198960 
199020 
199080 
199140 
199200 
199260 
199320 
199380 
199440 
199500 
199560 
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atgcacttca 
ttcttacagt 
ttaatagaga 
aaggtagttg 
gaaatattga 
tcaatttcct 
gtatcattac 
tcttggtcgc 
atcttagagt 
atttctcccc 
tcgtcttcgc 
atcacgatac 
tccccagcat 
ttacacaaat 
gcagtttttg 
ttctctccat 
aaaagataca 
cattttctta 
ttcttattac 
gaggaggagg 
attgataaaa 
ctgaatgctg 
tcgttctcat 
tcctgatctt 
ttcatataag 
gacgaagcta 
ttttctgcct 
cgtcctccta 
caggcagtgt 
gcatttccca 
gtaatacaaa 
ttagctaaaa 
gacaaaaaat 
aacattgaag 
tcgatcgcct 
aagaatatgc 
ttaaagat tt 
gaattatcca 
atggccagac 
atcctcttag 
tcgtcatcat 
tggtccttat 
aaaaacacgg 
ca tt tcttag 
gatttgttga 
caactattag 
act tcacggg 
tcagaggaa t 
aaagtgtcct 
gaaagtggag 
tggctattct 
cgctcgtaca 
actatctggt 
taggtgttga 
aatgaagatt 
ttaatgtttt 
tcttcatcat 
gaaacatatt 
ggtctctttc 
taacaaat tg 
gtcaggacct 



aggcagtttt 
gcctgtacac 
aaggcatggc 
aaacaacacc 
taggatcctt 
cacattctac 
aagcattttt 
atacttttag 
ttcctccgtc 
cactgccgcc 
tcgatgactt 
cctttttgcc 
tattgagagc 
cgacaagagt 
tggcaatatt 
ctcctgaagc 
cgacatccat 
gcgcatgaag 
tactactact 
aggaagaagc 
atatataatc 
tttttgatac 
cctcatcttc 
cagccataga 
atacgtgttc 
gagcagcaga 
ttaatttaca 
gaagataagt 
tttcagctaa 
ttttttcaac 
gacgtatatt 
atgagcattt 
cgggacagta 
gagt taatcg 
tcttgttatt 
gggcaaatac 
ttgttcctct 
agtagaaacc 
tagattcgtc 
catccattga 
catcttcatc 
tgtaattgat 
cagttttaag 
tttcttcttt 
aaaaatctcc 
tgatccaaga 
cgtaaacagc 
tattgtcttt 
tcagtagttg 
ggtttgaatc 
ttgaactatt 
atttaatcca 
tcttagggga 
tattggatat 
ggttagaaga 
ctaatttttc 
aacccatagg 
tcaccgatga 
caaacttctt 
gcataacttg 
ttgtttcagc 



tacttcggat 
agcatcggtt 
gcctctatgg 
agaagaacaa 
tatgcggaca 
cccttcatct 
cctattcttt 
ttttgattcg 
accgtacgcg 
aaaatcgaca 
ctttgagcca 
agcaaaggaa 
cgtaacgtta 
ggtctttaaa 
caaggcaacg 
aatggctgtg 
tctgggcatg 
ggctgtagtt 
actactacta 
cgtttcacaa 
ttcatcagca 
aggttcgtac 
ttcattttcg 
ccaaaagttt 
gattaaacaa 
aataaattcg 
ccttgtatac 
accatcaatc 
cctgaaggga 
ttcagccata 
tttggctctt 
gtatgcttgt 
tctatcgtgg 
ctcgctgaaa 
cttacccttt 
tcctcctgtg 
aggagcgaca 
cttatctgtt 
cagacttttc 
atcttcttgc 
gtctgcgcag 
gacacgaagc 
gatactgaca 
accaatgaaa 
tcctcgtctt 
cgtacagaaa 
cttagccaaa 
agatttagca 
gccatagtct 
ggtaacgcca 
gggatgaata 
caaaatgttt 
cataccattg 
tttttggtgg 
gcccaagata 
agtggaagta 
aacattcttt 
attactactg 
ttttgacttg 
agaccagtac 
aaaacctttc 



ttagagaatt 
tcgttaagcc 
agcatattat 
ttccaccagt 
ccattttcgt 
tcaatttcgc 
tcgtgtttga 
gtcattaaac 
tttcttgctt 
ttaccacttc 
aatgaacctt 
gccaaatttg 
atatttgtaa 
ttgtttccct 
gaacaaagag 
cacatacctc 
ccagtttttt 
gtagaattaa 
tgttttgaag 
tcgctagtgg 
ataaaaattg 
tttacaacgc 
tcatcatcat 
ttgttcatat 
agtcctctac 
gttgagaata 
atgtccgtga 
aaaaagtacc 
attgctgtac 
attttagaaa 
ttgagcatat 
tccgaaaaaa 
aacaactcgc 
catcccccgt 
gatgatgaaa 
cctttgttca 
aatttaccct 
cctgctaaaa 
ccattaatga 
agaaggcaag 
ccagtatcaa 
acgcgcttat 
atttcaattt 
cagtaatcat 
atatttgtcc 
ttaccctcgg 
ctagctgcag 
gagaaagtgc 
tccacactca 
catggagcat 
catccccagt 
agatttttaa 
ttacagttcc 
gtaaagtatt 
acgtacgcgt 
gttagagaat 
aaagagtctt 
ctattagagt 
tcctcatctc 
gagtcagaaa 
cccttgtact 



tgccttcata 
aaggagaatc 
acacaatatt 
gagct tttct 
catttttacc 
cacctctttt 
gttcagaaat 
ggttaatatt 
cattgatgtc 
tggaagcaga 
ggtggccaac 
cttgagtaga 
acctttcgaa 
ttgttcccac 
tttgaaccat 
caaagttcat 
ggtcaacaca 
catcatcctt 
aagaagagga 
ctttagcaag 
atggaggttt 
tagtattttc 
cctcattttc 
tactgacaga 
ccattgtagt 
aagaattcaa 
ccaaatttat 
tgttattatc 
cattactaat 
tgagagcagt 
tcttggcagg 
tattctgctt 
cttgttctga 
ttctaatttg 
caatacaggc 
ataagaaatc 
gcacagggga 
agttaaaatt 
cagaccgcat 
tttta tctg t 
tgccagcata 
ggccatcaga 
ctgatgaaac 
tgtcttgttg 
ctggtgtaaa 
gatctaaaca 
aaattttttc 
ggtgcagatt 
ggaaaccaga 
tattctttgt 
tacacaaaac 
acagcaattt 
acatcttacc 
gctgggagaa 
ctttagaatc 
catcataaag 
ttttctccaa 
tattgttaaa 
ccagggccat 
tatgattgga 
ggataaaagt 



ttcgataggt 
tgtgcagtct 
tttctcttta 
ggcaaagaaa 
gtcgtttctc 
acgttttttt 
aaaatcgtac 
actaaaaacg 
aacactattc 
ctcgtcctta 
ttcgtcaata 
tgaagaagaa 
ccaagtcctc 
aaatacaaca 
gtggtgggct 
taggccacgt 
tgagatgtaa 
ttgcttcctc 

ttcgtataaa 
tagagaagaa 
cgtgttttct 
atcggtttct 
gtcggtattt 
gccttccatg 
aataacgtgt 
cccttcaagg 
aacgctaata 
aaacgttcta 
tcctgcattt 
acacactgga 
gt tgtagtct 
ctcattgtca 
gagagtagac 
ccaatcagta 
gcaacacttc 
gt taaagaag 
atccttctta 
agaaattaac 
accaccttga 
tagaccagaa 
gt tggtactt 
aaagt tggga 
ccgataacaa 
caacactgaa 
tgaagacaaa 
gggagaagga 
cataagaggg 
atggtatact 
cat ttcaaag 
gctaatgtga 
tttaagcaaa 
caaact ttta 
agtctggtca 
gttaacaatc 
agacaataaa 
atcttgaatt 
gttcttccag 
attcttaatg 
gaaatagcgt 
tctcaaggga 



199620 
199680 
199740 
199800 
199860 
199920 
199980 
200040 
200100 
200160 
200220 
200280 
200340 
200400 
200460 
200520 
200580 
200640 
200700 
200760 
200820 
200880 
200940 
201000 
201060 
201120 
201180 
201240 
201300 
201360 
201420 
201480 
201540 
201600 
201660 
201720 
201780 
201840 
201900 
201960 
202020 
202080 
202140 
202200 
202260 
202320 
202380 
202440 
202500 
202560 
202620 
202680 
202740 
202800 
202860 
202920 
202980 
203040 
203100 
203160 
203220 
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cggccaacta 
ttttccgaag 
tcatgtttca 
gagagaaggt 
acagaagaca 
gtcaaatcgc 
ctttcaatgg 
tcttttttcg 
acaatgtcat 
ccatcgtacg 
tggctgaaac 
gaatcatcat 
cgtcccctat 
ttaatctctt 
tgaccggcag 
acagcagcga 
ctagtctctt 
acatcattaa 
tctgccgctt 
tcatacatat 
ttcttaacta 
gaaaatctat 
gacatgagaa 
ataacatatg 
ccttctttta 
cacttgctag 
aaaggagcag 
accattgcta 
atccactctg 
caaatttgat 
ttgttatttg 
tctttgagag 
ttcaaaagac 
gccctcgcgc 
tttgacacac 
ccccaatcaa 
agtctagaag 
tcagaatcat 
ttaaagcact 
tctaaacaca 
actttagttg 
tgattcgcac 
acaccatcac 
atgagtttgt 
agacattttc 
gacctaatat 
ctaaagtatc 
agatgtttag 
ttactactaa 
gggatattgt 
tctcgtacag 
atttctttca 
aaatttttct 
ctggatctaa 
ttattaatac 
cctagaaaac 
cccttggcag 
tcgttactgg 
cttagatatt 
aaaacagctt 
gcgcattcat 



ttctaccctc 
aattgacaac 
attggtcttc 
gacctgatga 
tgtaatcgcc 
ggcactcttt 
catccattcc 
agaaaggaac 
tatttatctg 
aaaacaagcc 
ccgtgcggga 
tgtaaaaacc 
tacttccacc 
ctagagaagg 
gaatgttgtg 
gttgggcagt 
gcagttcttg 
cgtccaaaat 
ttactaaaac 
cattgctttt 
gtgaaagggg 
aggtga taaa 
tctttcgggt 
cggcaaaacg 
tgctagcgtc 
tgattttaaa 
ttgcaat ttc 
gcacacaaga 
cccgactgcg 
gcctaacatc 
ttagcaagt t 
caaactcata 
a ttcccacgt 
taacgcccag 
ctgcctttaa 
taggaaggct 
ctggactatt 
tacttgtttt 
cgtatatgtt 
tctgttgacg 
gttttagagg 
acaatgccca 
ctcccgatga 
tatcttcact 
ctgccaattt 
taccattaac 
tcct ttcaca 
aactaggaa t 
ggggagaatc 
tgttgctgtt 
gcatagtaaa 
aaa ta tcacc 
ccat t tcctc 
agttgaatgt 
tactagcatt 
ca taacaatc 
ctccataat t 
ccacattgtt 
cttgtttttg 
ccttatcaga 
acattggtcc 



atggcctgta 
tttcttgtag 
agaaatttca 
ggatgatcca 
aggacatatg 
tcttatcttg 
agcacttaac 
acaaatctta 
attttgtgtg 
actttctctg 
tctcgagtcc 
acccgcgtta 
ctttcctgtc 
atcgataaat 
gaaagaaatg 
tgtacaggaa 
gcgagtcttg 
acaacaatcc 
aagccttgca 
ttcgctattt 
cctagaaaga 
ttgcaaccaa 
catttcttca 
gtaaaagtgg 
ggcctttttg 
gcatcgaatg 
tcgcacactt 
gagggacatt 
caaagtagac 
a ttgctaggg 
gcttgaattt 
ggccttggcg 
atcagcatcc 
aaattctgat 
atttagcatg 
gttattaaag 
ggttccagaa 
tcca ttcaaa 
aagttttgaa 
tttttcttta 
atcaacaact 
aataccagaa 
acatgccaat 
gatcacattt 
acactgtttt 
tttgtccttt 
tgccattgtg 
tgggagtagt 
gctcttgaag 
agcggttccg 
aggcactacc 
agattcagct 
cacaatgaca 
t ttgatgaaa 
gtcagatggt 
atcttcatca 
ctcctcatcg 
caggatctgg 
t tgatagagt 
cttgacgctg 
aaaagacttg 



tttgccaaaa 
aagttaaacc 
ttggtctggt 
cccccacacc 
atgttcatat 
ttagtttctt 
gcagtttcag 
aaaaagttct 
atcatgttct 
tcttcttcgt 
tcatcaacta 
ttaaaacctc 
atttgttttg 
gcttcttcgt 
ttagtgttga 
ttactaacca 
gcgcggaact 
aacaaatcat 
atacgtttca 
ttgtcggcca 
agcctgcttc 
ttactagtat 
atgacaatat 
agaggatctt 
ggagaatctc 
agtggtacca 
tctccctgca 
tctgtcttga 
ttgtttcgtg 
gtgggtttta 
atgaaatttt 
acctgcgctt 
aaatttgtgt 
agctcaaata 
tcactaccgt 
cactccctaa 
ccgtatcttg 
gctgtacgca 
acgttatggt 
agttccttct 
gtagaaccca 
aagaaaccat 
ctggctaaac 
tcatagatac 
atggcatccc 
ttcacaaatc 
ttgaaacgga 
tttatgcact 
ttattaagag 
tcttcactat 
caattcctca 
gcagaattgt 
gttgaaacat 
ttaacatccc 
acattttcat 
tcaccaccat 
tcactctctt 
cgagcagaaa 
ttctccctct 
gcaaaaagtc 
tcagattcat 



cactgccctt 
aaaacgagca 
tagcttcatt 
caccatttcc 
cataaacgct 
cctcaactgc 
tgacaccgta 
caatgtcaga 
taattcggtt 
ttctggcgtc 
cggatgatga 
ctccgccaga 
caccactatt 
tagcgtccat 
tgattctaga 
gattgtagtt 
cgtccaaaga 
ttgaagaagc 
ttaccttccc 
aagaggaaac 
gccgtcccat 
caaatccatt 
cattacttgc 
cttcaatatc 
ccttgcctgt 
ttttatccac 
tcatgatacg 
gaacctcttc 
gataatattc 
tcttcaattt 
caacagatct 
tccagtcctt 
ccagatcttt 
cggcgccaga 
cccaatcctt 
cagtctctag 
ttatcaaatt 
tgatacattc 
ccttgaaata 
cagaaagcga 
gcacaacaga 
tattaacact 
ttctaggtct 
ccaaaggtaa 
acctcttatt 
ctaaagctgc 
gagctacagt 
tttcgttaga 
aagcacaaga 
tgacattagg 
ttctcaaaga 
ccaatgttag 
acttttggcg 
taaacacatc 
catcactaga 
aataatcacc 
catcactttc 
cgtcgtcgtt 
tctccgactt 
gttggaaagt 
cataa tcaaa 



tccttgcttc 
atcacttcct 
attattatta 
tccctcgcca 
tccagtatca 
cttattccac 
caaagcatac 
aattactttc 
aagtacatag 
catgtgtatc 
tccccgacta 
tttctttgat 
atcctccttg 
tactttagct 
ggacttttct 
agtatcgatt 
tttaacactg 
acgcacagag 
aacacgtttt 
actaaccaaa 
taggacgtga 
aaaatcaaac 
aacttgtcca 
gtaaaatcta 
acagctgcac 
agatagacgc 
atacatagcg 
tgttgcagcc 
gtcttcaacc 
cttaattacg 
aatgtcgttg 
attgtcgtca 
accaaaaaat 
ttcgtccaca 
aactaaactt 
aattgtccac 
tgaaaagaac 
ttccatcat t 
atttgctgca 
ttgattaaac 
ctctaaatct 
cttatctgat 
aaacgtgtta 
tgctgaagat 
atctaaaata 
agtaacatca 
cagatctttc 
catggtattg 
acaaaaggga 
cttgacaat t 
tcttctatta 
tacatcacac 
atgacataaa 
atcttgtacg 
accaaattcg 
tgttgcacct 
ttcatcgctt 
gtcacatttt 
ct tgtctgcc 
ctccatgggg 
cgttttaaat 



203280 
203340 
203400 
203460 
203520 
203580 
203640 
203700 
203760 
203820 
203880 
203940 
204000 
204060 
204120 
204180 
204240 
204300 
204360 
204420 
204480 
204540 
204600 
204660 
204720 
204780 
204840 
204900 
204960 
205020 
205080 
205140 
205200 
205260 
205320. 
205380 
205440 
205500 
205560 
205620 
205680 
205740 
205800 
205860 
205920 
205980 
206040 
206100 
206160 
206220 
206280 
206340 
206400 
206460 
206520 
206580 
206640 
206700 
206760 
206820 
206880 
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gtatcgttgt 
ctgtttgtct 
cagacagatt 
gcgtcgcaca 
atttgctgat 
gcaagagtaa 
gttgacgtga 
tcagatgacg 
gatgatgttg 
ggaatactct 
cacatttctg 
tgtagaatgt 
tgaaggatgg 
ctgataatgt 
ttctggggaa 
aaaccgtgtt 
atgctattat 
ttgatgttgt 
ttatcttctt 
tcctttaata 
ttgtaacaca 
ggcatttctg 
atagaaccca 
cacatccact 
ttgctgatgg 
gcagtgtcgc 
atcttgaaac 
atacattcac 
tctgtgtctg 
ttgagcccca 
gctgttcttc 
ctagaagttt 
gttccaacgt 
ttcagcttgt 
cctttcttct 
tttccatcgt 
agattgttgg 
atttgtttga 
ccttcgttga 
agacagatgc 
tctcgcacat 
atattgacga 
tgaatgtagg 
atatcaatta 
atccagtgtc 
ctggtgccaa 
atactcctca 
cagctgccgc 
ctttatccgc 
tcaataacct 
ccacgaccac 
ctagagccct 
ttcgtgctgc 
tgttcaattc 
gtgatcgtgt 
aaaatctttt 
cagttcaaca 
caaacctcga 
aagcaggagg 
cctcagttga 
cgtcccttga 



tcaattcgta 
tgttcttgat 
gcttttcaag 
tgtgtctaaa 
agtactcttc 
cgactgtcca 
tgatggacaa 
aagtggatgt 
atgcagcagc 
ccaattcttc 
caaatttgcc 
ccgtatctct 
cgaaaggcat 
tctttgtttc 
gaagaaactc 
ttctcttgaa 
agacattggc 
tctcgtctcc 
cctcgtcgtc 
gggtagcatt 
tgccaattgt 
gcccaagttc 
tgaggttttc 
cttcggctct 
agagaatgtt 
tcactttaca 
aggcaacagt 
tcagtaattc 
aaacagctgc 
tcaaagaggc 
cacaacccag 
cttcctcata 
ggggtccttc 
ccagaaattg 
tggattttgc 
ttcctctgtt 
ggttgacact 
ccagagggtt 
gaaggagaag 
gtagagtcgt 
cactttcaag 
gtgcagtgtg 
ataaaataac 
cagctacaac 
ggactgtgag 
ctagtgattt 
gacagaacag 
tgctgctgcc 
catttctgaa 
taaccctgat 
atacgattca 
ccccaccggg 
gtcagaaatt 
gtggaatctt 
catcaagaga 
aaaggaattg 
gaaatcagca 
aaaattagta 
tctttttagt 
acatacttct 
tttagattca 



gaccattgtg 
cctctccatg 
cacacttttg 
cttctcaaaa 
tggatgagaa 
cgcgaatatg 
gagcttcttg 
tgcaccttct 
agcagcagca 
aattgtagga 
ctttgatagt 
gggcaatttc 
gattgtgtca 
tggtttcacg 
cattgccaga 
gaaacacaca 
agcaagggga 
agaaccgccg 
tccatctcca 
tgtcttggtc 
acagaaatct 
agaaaaaagg 
gagatataga 
atctgccttc 
cttcttttct 
atgtccggga 
tttcttgatc 
aagttgtacc 
aagacgttcc 
a ttggtgatg 
aaactctctg 
aagcctcatc 
ttcatccatg 
atcaccagta 
aacatccttt 
cactgtgtca 
gccgtctccg 
gccattct tg 
aggtagacgt 
tggttcctct 
aaacttgcct 
ctggttataa 
gtaacttttc 
tggtaagggg 
gcagagagct 
tatagaacag 
gaacaggcgg 
gctgctccta 
gatggaaaat 
gaaaaaatgg 
gaaaatgtga 
ggcacgtgtt 
atcgattcta 
attcccattt 
tcaatgatcc 
cgcaggagaa 
gtaaatactg 
gatttttgca 
gcgtcagcaa 
aatgatttct 
gcaacaaaca 



ctttcatcat 
gaattttcgt 
ggtacatata 
cctccttcaa 
gtgatgttgt 
gtgtcgttct 
atattttcca 
tctagactag 
gcttcttctt 
ggttttagtg 
gcatcccatg 
aggtttctgt 
aacttgatgg 
cgttcaacac 
agaggagagt 
atctggtcaa 
gggatttgtt 
ccagatttcc 
gaa tcgtcca 
aataacaagc 
tgatccataa 
atctttcttc 
cactttttct 
acgacaccca 
cccaccaact 
agattccagt 
atactttcaa 
tttaagattg 
tcttcaagga 
aaggcaacct 
aaggcaggag 
ataataataa 
cgcctatttt 
atttctggaa 
aaagttgcag 
tat tcaacaa 
ccgactctat 
gtcaagttat 
ctgttgtcgt 
ttcatctggt 
cctcctaggt 
gttttgtttt 
ttgtgtcgtt 
gttgtccgac 
tgtgagggcg 
aaatcatgga 
cggcgcaaca 
cgcagtacag 
tggagaggtc 
ctcaacgtgt 
accctggtag 
tagcccccaa 
ttgcatcaac 
tta tgagtaa 
gtaattgttt 
aggttaacgc 
cccttgcagc 
aattgaagta 
cagctcagtc 
tactcgatat 
catt tgatac 



acttaatttt 
atgctgcaac 
tgctaagttc 
attcaaaatc 
acagcgcaca 
tgtcagcacc 
aagaagctat 
aaaccatatc 
cactttgagt 
attctttgtg 
tgttattctt 
tctcatatct 
tgttaacgcc 
aattcttctt 
tgtcaatgtc 
gcttatcatc 
taggggctgg 
ctccagatcc 
atcctttaca 
cacctccttc 
tcaaactttt 
tccaatcgtc 
cgaaactttc 
gagaagtcac 
gcgcatggtg 
tatcactccc 
cacgttcgac 
actttaacaa 
tggttgggta 
taaacactgg 
tcacacatag 
gttccctgag 
cattctggca 
ccttttttac 
gttgactgta 
cattacatcc 
cttgtacagc 
cagtgactag 
actcgggcag 
tggcacattt 
tgagcacatt 
ttcttggtta 
gttcaccaag 
gaccgtctgt 
tatttataca 
ccagtaccca 
acaacaagca 
taacactgtt 
aatcgcagct 
ccaattccat 
ttctgttgtg 
ctacattgca 
tagtctatac 
tagcaaacat 
ttccaaacaa 
cgccaaggca 
atggaacgcg 
cagtcccgat 
acaatcaggt 
cctaaaacgt 
tgctctttct 



gcacaatttg 
tcccattctg 
tcggttaatt 
ttcacgaatt 
aaaccgctca 
cgtggcgaat 
ctggttttca 
tggtgttgtt 
aggtggagga 
ggtaaattgc 
gagtctttgt 
cacaacaatt 
agtcaaattc 
gctcatggct 
actcagaaag 
gtcattgatg 
aggaggcttg 
tccttcctca 
cgccatcttc 
tgccagaaac 
gtttctgtaa 
ggtattggcc 
cagtgttttg 
catgttttcg 
ataggggaac 
gccaatatca 
gattctgggc 
caccacattt 
gaatattacc 
cacgtctgtt 
tttctctgca 
aagctttcca 
agaacgctca 
tttgtacatg 
cattatcaat 
agcatacaga 
tagatgcccc 
aatttgttcc 
tttgttggag 
tattaagtga 
tgccttaaac 
gttatttttc 
gaatcttttg 
atcctacggt 
agaatagcag 
gaagtgaggg 
gcaacaacaa 
tctgcagaaa 
tcgtgctgga 
ccacttagtt 
tttttgaagc 
gtgcctactc 
caatgttcaa 
agtcaattcg 
aaaaatgtag 
ttttctcatg 
ggaagtgctg 
agaaaataca 
acttcttctt 
cacaagggaa 
agggttttta 



206940 
207000 
207060 
207120 
207180 
207240 
207300 
207360 
207420 
207480 
207540 
207600 
207660 
207720 
207780 
207840 
207900 
207960 
208020 
208080 
208140 
208200 
208260 
208320 
208380 
208440 
208500 
208560 
208620 
208680 
208740 
208800 
208860 
208920 
208980 
209040 
209100 
209160 
209220 
209280 
209340 
209400 
209460 
209520 
209580 
209640 
209700 
209760 
209820 
209880 
209940 
210000 
210060 
210120 
210180 
210240 
210300 
210360 
210420 
210480 
210540 
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ccgagtttaa 
ccgcgtcgga 
attcagtgcc 
tgatggcaga. 
cagattttga 
atctcaaaga 
ttaattcggt 
cacagctttt 
gtgtacagtt 
cagatcctga 
tcttgttggt 
ttttcggtag 
atgataagaa 
ccgtgttgag 
ccaccgccgc 
gaccccaggt 
atatctcttt 
accagagagc 
agaagggagg 
cagatcgttc 
tcgaacataa 
caaaggagga 
aatgttcctc 
tcaggtcagt 
gaagatt tgc 
cagaagtaat 
tcagtgctct 
taggtggaag 
cagagatgac 
ttagcgatgg 
tgtccaattt 
gtgccgcggg 
ttgcaaacct 
tcgctcacga 
aatccgtttt 
cagcggttgt 
gcactggcga 
■ acagtagtct 
gaaatacttc 
tccttcaaca 
cccccgatgc 
agaacaccaa 
catttttgtc 
ggctaggaga 
ctattgttaa 
atcgtccaat 
gttatgcctc 
aagctcagat 
acaggttgac 
tcacagaaaa 
ttaatgaaat 
ttaacgtggg 
ctcactcaaa 
tactggcagt 
ccattcctcc 
ataatatgaa 
ccgccatcga 
tttataacaa 
ccttgttcgc 
atttcgatat 
tatctacata 



ggaacaggct 
ccctattttc 
aaatatcggt 
cagagacgag 
agcattgaga 
atcactagcg 
cacaaatatt 
cgcgcaggca 
ccccttctca 
tactggaaag 
aggaaatttc 
acaagttacg 
tgaaacgtct 
agacgccacg 
aggtgaacca 
agtgaccagg 
gggggtctat 
tctcgcttca 
aaagtataca 
tgctaggatt 
ttctacccta 
tgctcttagc 
ttcaagggat 
taataggtca 
cgactttttg 
gagaggactt 
aaattctttc 
gcataggctt 
tctccagtca 
gtggggaaat 
cagcggaaac 
aagtcgaggt 
aatctctgat 
aatcacaaac 
cttgtctcct 
ttcccttacg 
tgacatttat 
ctcctctatt 
aaccagtaat 
gactgctaat 
tgccaatatg 
cgtggttgtt 
tcagatcaaa 
aggactagat 
gcgagaattg 
tactatcgac 
taaccgctac 
ggagcagtat 
catccctgga 
tggagaattt 
gcaacctctt 
ttcagttagt 
ggctagcccc 
cactgcagag 
ttcagtattt 
gacagattat 
aaggtgcaat 
ggatgttgca 
caataatctc 
ggccgtggta 
ttatcagggt 



agagcagccg 
tctattgttc 
atgttggctc 
tctgcagata 
ggagataata 
gattctattg 
cccacagata 
tttttggaaa 
gatgaaggtt 
gtttcagaaa 
caggtttctc 
cctaatacgc 
actattatta 
caaaacgtct 
aaaaagccaa 
gaaagtgatg 
tatgccatgg 
gccgagtcaa 
gaggaggaaa 
ttggctttgt 
gattcgttct 
cgtgcagaaa 
gaatattcca 
ggaggaggat 
agtgacaact 
tctaagcatg 
aagttccccc 
aacaatttga 
gccatcgatg 
actacagctc 
ggaaatgttg 
ggggaaactc 
aagagaattc 
tttatcgaga 
agattgtctg 
gatagcaaca 
agggactatt 
tcagtcaagg 
aagaatgtga 
gccttgtccg 
gccaacgtta 
tcagggctag 
acaatgaacg 
aagaaagaat 
ggggtaagtg 
ctgaacactg 
gctaaattat 
aatgcacaaa 
tccatcacgg 
agtaacgcag 
ttcacggacg 
aagctgattc 
tatgtcttta 
acttctctgg 
tcttcaggtg 
gtagtggaag 
gattccgatt 
tcaaccactg 
cagaaccctg 
ccaaaacttc 
attcataaga 



tagatgcagc 
gacacaatag 
cacgatcaaa 
tcgctgctaa 
ataagagaag 
aaaaaacttc 
cagaggagta 
ctatgggaag 
ttgcggctat 
tggacccttc 
ccttccacgt 
ctattttgag 
atttcaggga 
caacttccac 
tgctctctgg 
atatgattag 
gttcatcggc 
ttaattcacc 
agcgaatcaa 
tggggcaaac 
ggtcttcgaa 
tcttggcagt 
tggtagagag 
atgaaatgtt 
ctgctgccag 
agaaacgcgt 
ttgttccaga 
aggcggccaa 
gtactggaat 
agccgtctcg 
tgtcaattcc 
tcaagtgtgt 
tagatcaact 
cgattgcagg 
tcattctttt 
ttaaaatgcc 
tggccatcag 
ctat t tccga 
ccattaagac 
cctttaccaa 
tttccccaat 
atagga tcac 
agaacattga 
tggataattt 
gttccgca t t 
aacagcct t t 
tcaacaaaac 
tggctgccaa 
cagacactgc 
aaacacactt 
gattcaacgt 
agaatgggtt 
agcctctcgt 
ttgtctctag 
gtctt t tcaa 
taattagaca 
cagctgc tag 
cagctcccgg 
caaaggtatg 
atggcatttc 
tggaacttaa 



cgcagattca 
caggcgtgag 
atattctgtc 
gatagggaca 
agcagatact 
cattaaaaat 
tgaattttct 
tttattgagc 
cgaaaggatt 
ctctctttca 
ttcagatcct 
tatcattact 
taggcttctt 
ccct tctcaa 
ttgcctacct 
tggtcttgtg 
tgcagctatt 
catgatgaaa 
gaaggctatg 
tga tgcccag 
tgccgctatt 
taggaaacaa 
atatcttaga 
tgatcaaggt 
gaacgcttgg 
attcaacatt 
acaggggcgt 
taagatcatt 
ctctgatatt 
cgtcaaagct 
agttagtcga 
ggacattcct 
ttgcggagga 
taaagaacat 
gcgttatatt 
cctcaacaca 
aggcatggtt 
taggtacaat 
tcaaggtgaa 
caagggaggc 
tgctaatgcg 
tgagaccatc 
agagtatctt 
tgtgtatcca 
gtccagtaat 
gatcgtaaag 
aacaagaaca 
tactattcct 
catcaatgtc 
gggggttatg 
tgcgaacaag 
aaccgtatct 
gcaagatttc 
gtcccagaag 
aa t tgatagg 
gctatctaag 
gattgccaag 
aacttcttcc 
gtctatggga 
tcacgatcaa 
cagcgattgc 



gatcacctat 
ggtattctag 
gcagagtact 
aagattgcaa 
tcagtcgatg 
accggcgata 
ctccacatca 
tgtgctttcg 
atacgtaaaa 
gatcaatatc 
aaggatatcg 
agatctaaaa 
gttaacgata 
agaagagtcc 
atcatcaggg 
ggtgattggt 
gccgctggcc 
aaattctcaa 
cgtaggaatg 
tatgggtatg 
cgtgcaaagg 
ttggacggaa 
gactcatttt 
tttgatatgg 
caacagtacg 
gaaggtcttt 
aaaaagactg 
aacggcatca 
attggttcag 
cttaaaacat 
gcagtaaagt 
tcagtcatta 
gggatgaacc 
acaggaaagg 
tggttcaacg 
atgtccgagg 
aataattaca 
tgcggtagtg 
ttgttaactg 
gtgggtgcaa 
gatgtagtta 
aacttctttt 
aggagatata 
aatattgcag 
ctcgatactg 
gctagcaagg 
gcagcagaac 
caattagtaa 
gttaaagctt 
ggtaacgcga 
cgtttaacag 
ctcattcttg 
gctaagcttt 
agtttcttcc 
gaaatgttcg 
aatgctaccg 
tcaggtgaaa 
tccgctttaa 
gctctccccc 
atgttccgcc 
aaaccagaag 



210600 
210660 
210720 
210780 
210840 
210900 
210960 
211020 
211080 
211140 
211200 
211260 
211320 
211380 
211440 
211500 
211560 
211620 
211680 
211740 
211800 
211860 
211920 
211980 
212040 
212100 
212160 
212220 
212280 
212340 
212400 
212460 
212520 
212580 
212640 
212700 
212760 
212820 
212880 
212940 
213000 
213060 
213120 
213180 
213240 
213300 
213360 
213420 
213480 
213540 
213600 
213660 
213720 
213780 
213840 
213900 
213960 
214020 
214080 
214140 
214200 
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aatgggataa 
gcgataacaa 
tttgcgatct 
gatcgaattc 
aaaaatacga 
ctctggctaa 
tcacctctaa 
ctacatccta 
ttagaaggaa 
tcatgagtaa 
aagaatcgcc 
ttgctacccc 
tcattagaaa 
ctagtggtaa 
catcaatgac 
agtgcctaga 
tccaaatcac 
ctggcgttga 
ttgcaaagag 
tggatcaatt 
ctatctatag 
agaaaatgac 
ctggcctggt 
ttatcaaggc 
gtcaattgat 
gtcagcgtct 
gcctgttcgt 
aaatgaacga 
gtacagtttc 
aagaggataa 
tgaggaagaa 
aaaactttat 
tttccggccc 
cggttcacaa 
^9sg9gctgg 
tcttgaacga 
agggaggatt 
aagaagagtc 
a tgagtatga 
cgta tga tgc 
gcaggggaga 
ggagacgtcg 
acgacgatga 
caaccagaag 
ttaaccgtcc 
cgtacga tag 
cttcttcttc 
gccgaacaag 
ccagagtaac 
cagtaggatt 
aagaatacaa 
gagcatctaa 
atgttgcccg 
caaacgccac 
aacaactaga 
ctccttcact 
ccaaacagat 
gaactcttgc 
ataagaacca 
atactacttt 
acagcgcttt 



ttctcttcct 
ccgttcattc 
ggtgacgagg 
caacacggac 
cgagtctact 
gagcaaatct 
ggagattgat 
ttctgtttct 
tgttgtggat 
agaggaggat 
cgttatgcaa 
caaacagcat 
agttgcatcg 
caccgattac 
cccctcgtcc 
gactggccct 
tgcacctaaa 
gcggctgaga 
catcgaagct 
gcgtaaggcc 
tgatttcgga 
cgctagacaa 
acacaaacaa 
aaagtatgtc 
ggccgtggcc 
agttgcttcc 
ttcattcctt 
attgaacgat 
agatgcgcta 
gaaattagac 
gactgaacta 
gtcttacgaa 
taggccaaga 
atcgttcatg 
actcagggct 
tatgaggtct 
catgatgatg 
tgaatcatct 
ttttgatgat 
tctagaccgt 
aggtgaagac 
ctcatcaaag 
ctactttgat 
agctggccgt 
ggctaaagca 
attctttatg 
ctcttccgat 
cgaggaccaa 
aa tgatcaat 
ctatgaaaat 
aattgttaag 
acaagtgtct 
cacccaagaa 
tctagttaat 
ggcaattacc 
tttcacatca 
cgtttcagga 
ttcatgtatt 
tctcgaatgg 
ccatgcactc 
caccaacctc 



ggaaataggg 
aatttggcat 
gaaatggtaa 
gcgcttcaaa 
ctttctacta 
cctctatctt 
aggacatgga 
gaagacgctc 
gctactaaat 
gtgcgttctt 
gacattggca 
cgcagattct 
aatccccttc 
aaccttaaag 
atgtctgtct 
tcaatgtttg 
ctttacggtt 
gactctattt 
ctggaaacgt 
cagaacatgt 
aacattgact 
caatcagata 
caacctcaat 
actaatgacc 
ggtgttgccg 
gacgtaacaa 
cttaacaacg 
gaaacaaagt 
ggatctaggc 
aggagttttg 
tacagggaaa 
tcttccatgc 
aggtacagca 
gtttcttacc 
ctggctgata 
tcggcggtct 
attagtgatg 
actgatttct 
gataataatg 
ctgaattctg 
gatgaggaaa 
atggggaaga 
gacgaatttg 
attcaatatg 
cgcgctttct 
gaggatgacg 
tctccattct 
tgtgcctttg 
ggtcgagtta 
taccagaagg 
ggtgcttcag 
cccagggaac 
agtaacttga 
gacccatcta 
gagaggagga 
ttcggtgatg 
ggaaatttca 
aagggcacta 
atgaccacag 
gaagatacac 
gtcggatcgg 



ctagcaaatt 
tggatactct 
agaccagtaa 
agagccttca 
aagaaactga 
cttcctcatc 
acacccccgc 
tcaatgctcc 
ctctgtacga 
cgagtagaaa 
ttgaccgcat 
tacagacagt 
tctcttcaag 
ctgtatatga 
ctgacagatt 
ccgatgctgg 
ctagagtcaa 
cttctgcgac 
tcgtaaccga 
acaacaaact 
gcgctaaaat 
ctattcttag 
tggctactca 
tcaataatat 
attactataa 
tgttcctcgg 
tacttttctc 
ctcttttggt 
acgtgtcttc 
ttacatcact 
ctgatactat 
tcaagaggac 
cccttgagga 
cagagagggc 
acaggatgga 
cttccgaaat 
atgaggatga 
cctcatcaga 
gccagtctcc 
ccgctaagcc 
atgacctcta 
tccttagaga 
atggcgaacg 
gtccaggttt 
tgacacgagg 
attccctcct 
cctccttcag 
ttaagagagt 
gcatgatcac 
ccaacaagag 
ctaccttgcc 
tgaggaggtc 
atattatctt 
ctcttttggg 
ataggctaat 
caagtaaaga 
agtctgccgg 
atacagttga 
ccgctattgt 
taaaaatgac 
aacattctca 



ctt tggcctt 
tttggcttca 
cgatattgtg 
agttggtgcc 
cgtatattcc 
tttgtcgtct 
tcttctcggt 
tctttcagcc 
agttgcagct 
gattatggga 
tgctagtctt 
aaacgattac 
attgggagga 
tggtgttgtt 
ctggtcggga 
tcatggaggt 
cacctacgca 
tcaggaaaga 
tgtggtgggg 
gtcagata tt 
catgaagaat 
ctctctcttg 
atttgctctg 
ccacgagaag 
tgtgtcggca 
cggaaccatg 
ccaggtttct 
taaactgg ta 
aattagacgt 
ttattcagca 
taacaaactt 
ttccttggta 
tgtacttgag 
agctgcttct 
atctctttac 
gatgga ta ta 
tattgccttt 
tgaatattcc 
ttattcaact 
tcttactgcc 
tgaagaagaa 
tcttca tgag 
ttcaatgtca 
cctatctcat 
caagaaattc 
cttctctgac 
caaggggaga 
tgtacgtgct 
cccagtgact 
ggaaagggcg 
cgacgaatac 
tcttatcaag 
tgacgctctc 
tgatacac tt 
gaaagaccta 
cacccaaatg 
ctatctaggt 
tcgtcttttg 
ttttgcacgt 
ctccgct t tg 
gcgtctaaaa 



tcttcggtga 
cctgcagaga 
cataatattg 
tcagcagtag 
cttgtttctg 
gagggacatc 
accgctaaaa 
gtgttggact 
gtttgtagtg 
atggtggaac 
gttagtacag 
aaaaattatc 
atatccccta 
tcttcctcat 
gtattttctc 
agtaacatgt 
gctctgagct 
aagaatagga 
ggagatactt 
acttccaact 
gtgacgagca 
cacgaactcg 
gcgagccatg 
gaaacattca 
gctgccatgt 
ctccaacaag 
gataatatta 
ggattttgcg 
gtacagaacg 
tacagagatt 
ttcggacatc 
catgacgctg 
gctccttcca 
aggcgagtga 
ggggaagaag 
gagtatggtg 
attgattccg 
gattcatccg 
acatcttatt 
atctacgggt 
caagaaagga 
agtgatgatg 
gaaactattg 
tctaatattc 
aggccttctg 
gaatctacca 
aaatgcaagc 
tttgtgccca 
agtgaaaata 
cgtctgatcg 
gtagagggta 
gcagctgctt 
accacaacat 
ttgttcgcca 
actgaaatct 
atggccgatg 
gtccctctca 
gctaccaaaa 
tcattcaacg 
acagacatgt 
gtaaagagta 



214260 
214320 
214380 
214440 
214500 
214560 
214620 
214680 
214740 
214800 
214860 
214920 
214980 
215040 
215100 
215160 
215220 
215280 
215340 
215400 
215460 
215520 
215580 
215640 
215700 
215760 
215820 
215880 
215940 
216000 
216060 
216120 
216180 
216240 
216300 
216360 
216420 
216480 
216540 
216600 
216660 
216720 
216780 
216840 
216900 
216960 
217020 
217080 
217140 
217200 
217260 
217320 
217380 
217440 
217500 
217560 
217620 
217680 
217740 
217800 
217860 
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ctcttttaga 
tataccctac 
tgcaatcact 
atattggaga 
gaggagaaag 
cctctgacaa 
gagaggaaat 
tttctctgga 
ctgatattgg 
ggcttaggcg 
tgggtaattt 
cgttcgttct 
ttatcaaccg 
ttgccaccta 
tgcgtaactc 
tctcatcccg 
tagaaacatc 
tcgccgaaca 
agacacgcgc 
ccagcgagag 
ttgttcaaaa 
aaggttcatc 
accgtatgtc 
ttgtaccaat 
ggctgaagac 
tagctaaaac 
tacagtctct 
atattaagaa 
ctccttctag 
ctcccagttc 
ttgccaaaac 
cttctgacgc 
atgttgagca 
gtaacgtact 
ttggcatctt 
aggctaataa 
ctgaaaatgt 
gagatgataa 
catctattga 
tggccatcca 
cctctgctaa 
ccctcgaatt 
ttgcagccgt 
gatctgctcc 
actatttcaa 
cagaaactgt 
tatcatatgc 
accaactgat 
gggccatcaa 
ttaataactt 
ttttggacaa 
aaactacaaa 
tcgattacga 
ctgctgataa 
aatcagacag 
tcatcaatga 
ctagacaatc 
gagaacgtct 
tgaggaagag 
ctgttaatag 
cacaaatcgc 



ttctattttc 
agcgttcatc 
cgctcttaac 
cactgctggc 
gggaggtttg 
tcggctcaag 
cttctacaag 
aaatgccgta 
catgaccgat 
cgcagaagag 
gtacggaggt 
gctccaagat 
tatgaagtcg 
taagaacttc 
tattagcact 
cagtggttct 
tgctgttctt 
ctacaacaaa 
agtcgttgcc 
aggagcatct 
tgaactgaaa 
ccaactgcac 
cgcccaaaat 
gactaacgac 
cga tgaagaa 
catcctcgca 
ttgctttgga 
tatggacaaa 
ctcggccatt 
ttcccca tea 
tatcccctac 
atcatcctca 
caaaaaagtg 
t tc tg ttacc 
caacggaagg 
taatgccacc 
ttcctccacc 
taaggtacca 
aaacgcatca 
ggcagcttcc 
ccagactgcc 
at tggaaaga 
t t ctcttccc 
ctcttcatca 
ctatgct tac 
ccagaaaaac 
tcca ttcagc 
tgatgccatc 
ggaaacggac 
gcaacaaaac 
caaaaggaac 
gagccg ta tc 
aaacacatct 
ctcggggaca 
taaatttgtc 
gggacaaaat 
caacggtgct 
tcaggctgtc 
gttggcagat 
tggtatggtt 
tgaaagcact 



aacactagga 
aaccatgaaa 
attcttaggg 
ctgttgacct 
tctctataca 
ggagcagtct 
aggagcaacg 
ggtcctattg 
atcatttcta 
ggcgctggcg 
attgataccg 
tcagactcgt 
aggaaacata 
ccatcctcct 
atcgtgatgg 
actttggcca 
tcaaatacaa 
ctaaaacaat 
gtaattgctg 
gttagtgaac 
aacattgaag 
gaaaaaatgt 
aacctcgaat 
caaggtggta 
aataacacca 
attgtagagg 
aaatatgcca 
ctcattgaaa 
tcttcttcct 
tcatcatctt 
atgcctatcg 
tcaccatcat 
gctctccaac 
accaagcaca 
caacacgcag 
gtttccgcag 
agcatgcaat 
gtaactgtca 
gaatttaagg 
accacagaaa 
atcactgcca 
gttatcaagg 
gttaatggag 
tcctcctcct 
ggaaaattga 
atggtcgaac 
gaaatgatga 
aaaacaaaga 
aaggagcttg 
ttttcaaacg 
gaattagagc 
gaggctggtg 
gaaaatgaca 
gtttacaccc 
gacatgtaca 
actgtaaagg 
tcagagcctg 
gagatgttta 
tcggctgctc 
aacatcaaga 
gcactaaact 



tggctcacac 
tgccctctga 
gagttaattg 
ttattacatc 
gaatgtccat 
ctctagaggt 
atctggtcga 
ctaggtttgt 
gaacagtcaa 
cagcaggaaa 
ttgtgaacct 
tcaatacacc 
aggctctcaa 
ctgaagcaat 
acatttccaa 
agatttccac 
aggccaatct 
tcagccatat 
aatctttaac 
tattgacaga 
aggcaaaacg 
tgagcctgct 
gtaagaaatt 
cattcataaa 
gttctatcaa 
acaatagaaa 
tgaacgacat 
aactaggcga 
catcatctaa 
cctctttctc 
tcttccaaaa 
catcttcttc 
aacttcaaac 
gatttgcgtc 
agacagttgt 
gccaaggaat 
tgcctccatc 
ggcttaacca 
acttgaagga 
caaaggaaat 
tctctcaagc 
cagtcgaggt 
attctatggt 
catcctcatc 
agaacattga 
aagatgctgc 
gacgtgctat 
tcgtgtcaga 
atatggacaa 
aatcagacaa 
ttcagaacaa 
gaggagatgt 
acaatctctt 
ccactgacat 
acaaacagat 
tagacttttc 
tatcttcttc 
tggcaataat 
agtggaatac 
gtgaaaggct 
ccatcaacga 



tgaagcagtc 
ttacacacag 
tagccaattg 
acgtaaattt 
tgttgatgct 
aggaaagtgg 
tttttgttca 
tcccaatgga 
ggatgacgct 
attcattaca 
aactgaaaaa 
aacagaaatg 
aacaccattc 
tgtagttaga 
gtcaagggga 
atctgaattt 
gagaaccatt 
tagtaatgat 
ccccgtgtat 
caatactctc 
tcacgtgacc 
cgttgcctca 
gactgaagga 
gcacaaagaa 
ggacaatgat 
tgcaaccatc 
ttttgcactt 
agcactcgca 
cacaacatcc 
aatggactat 
caaacaatct 
ttcatctgcc 
acaagaatct 
tcataatcaa 
tgctatacca 
tcttacccgc 
atcatcatca 
gtacgccaac 
agcagaaagg 
ggtcaccgtg 
taaatccctt 
ttacacccca 
ttcttcttcc 
ctcttcctct 
tgaaaatact 
cgttcgcatc 
tgacaagttg 
cactaaacag 
agaacaggtg 
gataaagatg 
caaaactagg 
agcaaacttc 
ccagagcctg 
gagcaatgga 
tctcgaggga 
aaaggctttg 
agttgtggag 
gatggagcgc 
tgttaataat 
cacagaga tt 
cgagattgta 



atgggtctcg 
cgcagagaga 
ccacgaaagg 
gcaggttatg 
ctttcttgcc 
caggatatgg 
aagaacaata 
actaacatgg 
tcaatgatca 
gcctcagcca 
ctatacgact 
gccactgcta 
gggggagata 
gccaaggaaa 
atcaactcgt 
gaaaggatac 
gagaatagac 
ggactttccg 
gcggatgaca 
ctcaaatt ta 
gccgcaat tg 
gccgacatca 
aatagtaact 
acaggtatct 
cagcgtagag 
cgttctcgtc 
gatgatgccg 
gagaaggcat 
tcctcttctt 
tcaaacaacc 
aatgtcaatt 
aatattga ta 
aacgatttga 
gctgcaactg 
aatgcaaaca 
ttctcagccc 
tcatcaaatg 
tcaatcttat 
aaaatcga tc 
tctaagtgcc 
aagaaaagtg 
gattcatcta 
tcgggatcag 
aatgtgacag 
gaagaagggg 
cctct tctag 
aacgaatact 
gcttcctcat 
atttcaaaga 
gctattagtg 
agctttattg 
aaggagat ta 
aaagcattcg 
agagacacaa 
ggaatcaaac 
gaggctttcc 
aggagacagc 
accgagagtc 
gtagaagaaa 
aggaatcaag 
gagtctcctc 



217920 
217980 
218040 
218100 
218160 
218220 
218280 
218340 
218400 
218460 
218520 
218580 
218640 
218700 
218760 
218820 
218880 
218940 
219000 
219060 
219120 
219180 
219240 
219300 
219360 
219420 
219480 
219540 
219600 
219660 
219720 
219780 
219840 
219900 
219960 
220020 
220080 
220140 
220200 
220260 
220320 
220380 
220440 
220500 
220560 
220620 
220680 
220740 
220800 
220860 
220920 
220980 
221040 
221100 
221160 
221220 
221280 
221340 
221400 
221460 
221520 
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tcaccctctc 
gtatccaaca 
cagaacagag 
cctgcttcct 
agaacaaggg 
tgatgaccat 
aaatgctgta 
tgatgatcac 
acaaaagatc 
gtcctgccgt 
attttggatt 
gaggtacttt 
ttaactgccc 
agaatgtggc 
ctcccaacgc 
tcaccgacgc 
gtgatcaaaa 
agcagggcga 
agaacagtgg 
aattcatggt 
acaggctttc 
caaactctgc 
ataaggaatc 
aaaaatctgc 
agaatctggt 
ctgaatctac 
acgaaaacgt 
gctacaatga 
ggtcaaattg 
cctttggagc 
aagacagcac 
tggatgcttt 
tttctctccc 
cttcggacat 
tcctgtcaat 
gttcttcttc 
ttctggggac 
gaggaaaaat 
ctattctcca 
ttgagactag 
caggaggcgt 
tcatgtcacg 
tggaagctgt 
gtggaaagtc 
ctctcaccgc 
acaacacccg 
tagtaaaaca 
gcgcttctca 
acatcaaatt 
aattcttggg 
ccttccatct 
cattcgcccg 
agcttattag 
gatcttccct 
atggaacatt 
tcaacctcga 
gtcatcgcca 
ccgtcttcca 
caaggaacag 
ttttggaaca 
tcgctgaatc 



tttgggagca 
gcagcaacag 
aaaggaacaa 
gcaacctctt 
tcctctcagt 
cactgacttg 
cgaaaatgtt 
caacgtccac 
cctgaagcag 
gagacacgat 
cactcaccgc 
cagtggaaat 
aggagttgaa 
tgacgcaccc 
cgagcacctt 
catcatttct 
tctggtaaac 
aagaaggatt 
catcccttca 
tgctccaggc 
agacattaac 
atggagaatt 
ggctatgaaa 
cagtcacatt 
caaggaagct 
gttcgccact 
tacaaccgtt 
cgctttcaac 
tgaaaatcta 
ctttgaagaa 
caaatattcg 
gtccgctgtt 
tacttctatt 
gaaatctaac 
gggagacagc 
catccctgtc 
tattctcgac 
ggtcgccggg 
gatgaacgaa 
gcaactgagg 
gggagccatt 
tgtcgatgca 
ggatagtttc 
ccccgttccc 
ccgcgctcta 
cgacactcaa 
agctaaaaga 
cctcatgctt 
gcacatgact 
aagatatggg 
ccaccacact 
cccagaagtc 
tgacaaacat 
ccttaagaag 
tgaaccatca 
cgttattgtc 
gcacgtgtgc 
ggcaaccgat 
atacatccaa 
acttatcgcc 
tgtaccagaa 



cgagtcgacc 
caacagcagc 
caatacgctg 
cacgagacaa 
aacacacgtg 
tctaggtctg 
ccctcatcaa 
gaagccactc 
ctgacagaaa 
gtactaacaa 
caaaaggttg 
gttgcaccta 
aatgacaaat 
aagtcatctg 
tacttcccct 
ggtatgagcg 
attgatcaaa 
gtgcacactg 
tgtacaaagg 
tctcttctga 
aacgtgagac 
ggtgaggtgg 
aagatgcttc 
aacgatccta 
t tcccagacc 
caactcgcct 
tcaaatatct 
aacatctttg 
ctgaaatctg 
gcttcatctt 
tccaaccaac 
tctactgttg 
acttctatcg 
ctcatcaaga 
caagtagctg 
accaccagcc 
aagactgtgg 
agtcctgaag 
aa tgaactcg 
gatgctt tec 
tcttctggag 
tccagcggtc 
aa tactactc 
atggccaagg 
ctgagcgaat 
ccagaaaggg 
t tcgacggag 
t cttccatgg 
gatactcaat 
gacgaatacg 
tttactccct 
tccatggaag 
tacgtgatga 
gttagcgaat 
agactcgccc 
aaaccaaata 
accgccgacg 
ggaaacggta 
aagagcacaa 
gatcttggaa 
agcgtatatg 



agctcttgat 
aacagcagca 
cagatagggt 
ttaaacgtat 
gtgttcccac 
tactcgactc 
ttgttcctgg 
atacttctcc 
tgttgaacgc 
tgttagagtc 
cgtgtatcac 
atactgttat 
ttagatccat 
caagcatcgt 
tcaaggacca 
gcgaatcttc 
ctactggctt 
aaaacactat 
accgtcaaac 
atgctaacaa 
attatggcac 
tgagagccgc 
ttctaggatc 
ctgctttgtt 
ctgtttgttc 
accgccagcg 
gccctatgga 
gctctaaaat 
ccatgtctaa 
ctgtcaggaa 
ttgctgtaca 
tcggtcgcca 
caaccagtgg 
caatttcccg 
caggttcttc 
aggatggaaa 
agatcaacaa 
ctcgtgccat 
ctcgtgactt 
aggatttgaa 
caagtaccaa 
ttctgatgaa 
ctttgctcgt 
aaattaggag 
cttcccctct 
cagtcgacag 
ttgacccagc 
attcccatac 
gcttcttcaa 
ccatgtccca 
cagataacga 
aaattagagc 
acatgtccaa 
ggactgaaat 
tctccaactc 
atgcaagaag 
ccaagggaac 
acgaatctga 
tgaacgctca 
aggttatcgt 
aaaacaccaa 



caaggtagat 
gcttcccaaa 
tgtttacgat 
ttcttctgtc 
tagcgatgcc 
ttcttccact 
actctgccag 
tcattcattc 
tgccacttca 
caacaatggt 
ccctgttaat 
ccttcctact 
ggttaacagg 
ggagactctt 
gaggcgacac 
atctcaattg 
cccagtgttt 
ggaaggagct 
ttatatcgat 
ggaagaaact 
tgatgttcat 
ctcctcattc 
tgtatctgcc 
gagcaccaac 
ctctaattac 
cctgttccct 
tttgatggga 
gacatctact 
cgttcctgct 
taggctttct 
ggccatgacc 
gaatggcaga 
ccgtccatca 
catcaataga 
cttctttaac 
tgttgcagca 
gagattcgag 
ccagcgcaat 
gtgcgaaatt 
gaggtctatg 
caatgttccc 
caacaacagt 
taggcacatg 
catgctgacc 
tctcactgaa 
actactaact 
cttccctgcc 
aaagtcatct 
gaacattgaa 
caagcaaaat 
gcatctggta 
cacaccctat 
gatcgattct 
gagaatgaac 
tggcatgaca 
tgtactagga 
tgtcgcttca 
actgatccag 
aactgtcgtg 
gaacgaactg 
ggaaatgatt 



agagtaggaa 

ttgacagcta 

ccttcataca 

tataattcaa 

gacttgcaac 

tcctccaaga 

caatgcgcaa 

aatttcgaga 

tccagtgacg 

tacgtcaaag 

acacttctgg 

tctgagttgt 

acaaccgaca 

gctcgcacgt 

ttcaactcca 

aacactactt 

acaggaagaa 

cgcaaggaca- 

atgggcacaa 

ctccgtctaa 

gtggcaggcg 

cctgacggag 

atctctgctc 

acgtctatcc 

ttggggtctg 

aacggagatg 

agtacaaagc 

aacaaaaagg 

atcaacactg 

cccctttatg 

gatactgctg 

aatactcttc 

ctctcttatt 

gacgctagcc 

tctttccttc 

gcggaaattg 

atgcttggag 

acaatgtcct 

gaaaataaaa 

ctgatgactc 

ctttctcttc 

gccaatgtaa 

atgttggata 

caaccaagag 

atctgcctct 

tcagcctatc 

gccctcacct 

ttcatggaca 

cgatttgaga 

tgtaactgcc 

tcctctttcg 

caggccaaca 

agagtaacag 

tccaacttta 

acggcaggag 

atattggaat 

gccatgccag 

aatgctctgc 

tttgctaatg 

gccggcacca 

gatagactag 



221580 
221640 
221700 
221760 
221820 
221880 
221940 
222000 
222060 
222120 
222180 
222240 
222300 
222360 
222420 
222480 
222540 
222600 
222660 
222720 
222780 
222840 
222900 
222960 
223020 
223080 
223140 
223200 
223260 
223320 
223380 
223440 
223500 
223560 
223620 
223680 
223740 
223800 
223860 
223920 
223980 
224040 
224100 
224160 
224220 
224280 
224340 
224400 
224460 
224520 
224580 
224640 
224700 
224760 
224820 
224880 
224940 
225000 
225060 
225120 
225180 
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gctctgacga 
atagcgaaac 
tctatcacac 
gcgccagcgg 
agaacgccga 
ctactcttcg 
ctgaattcaa 
gtccaccaaa 
ttgtttcctt 
gcgcgttaca 
aaaacttttt 
ccattaattg 
accctattct 
tggaccccga 
acactgcttc 
tgggcttgta 
tcacagacac 
tagtgaaccc 
atgtcttgaa 
gcctcgtcac 
agtacgtcag 
gttctgatgc 
acatgaacgt 
ggctgaccaa 
gcaacaacaa 
ttatcaccaa 
catcatcatt 
ccttctttat 
agggaatttc 
ttggcacggg 
tcatgaaggg 
aactcactca 
ttattaacaa 
acgaggctgc 
ctgctctggc 
ccgcgccagt 
cagctatccg 
gaatcgtaca 
acaacactga 
ttatgcaatc 
aactgacgga 
aagctatcct 
tttaccttac 
cgtctctgac 
ccctgaaccg 
tgtaacaatt 
aagactttgt 
ggttgattgt 
ctcctagtgc 
gcatgaaagc 
tggtcatatc 
aacccaaaag 
agaaaagagt 
atttttgtcc 
aggtacctgc 
atgcaggtat 
aactacatac 
tgaaattgca 
gagaggatct 
accttttcta 
cccgtctgaa 



cctcttcaaa 
aacatccaac 
actaatttcc 
ccctctcgcc 
acaaggtgca 
taagcatttg 
ggatatcctc 
tactaaccaa 
cttaaccatt 
cgagaaagta 
ccaatacggt 
tagcaacaat 
tattatggcc 
tactgccatt 
aatggcgaac 
tctttcttcc 
cacagaaaac 
cttcaaggat 
tgataagtcg 
caatactgga 
gagagacaat 
ttcctctaat 
gagatttggt 
ggaagaggtt 
caacggaaat 
tgccacattc 
caaatgggga 
tggtaaccca 
cctcactgaa 
ggctctgcgt 
atggaacaac 
tcttttgaga 
gatgcacgac 
tctgctcgtt 
catggtttct 
tagaggtatt 
caagcgcctc 
caaatcagcc 
tcttcaagac 
tttgggactt 
ccttattact 
taaaatgctc 
aacaggtaaa 
agtcggagga 
cctttatttt 
ggtttatatt 
attatttacc 
agcatttcct 
tgatacaccc 
gaccatgctc 
agacatgatt 
tataaatttt 
cagtcttgct 
agaaatggat 
cgatctactg 
agaaatcgag 
aataaatcca 
atctgcacaa 
tgaaaggaga 
tcttgatttc 
aatcattgat 



tctaataata 
aatggtcccg 
ggcaaggcag 
tttgatttcc 
gcagctgccg 
gctcgagttt 
aacgatatcg 
aatgcctttg 
cttcgtaaga 
gccatctatt 
ctcagtaatt 
gtcaagaata 
ctcccagaaa 
gaaagcagac 
ggtgcaagag 
cacattcttc 
gtcaactatc 
tctgctaggt 
tgcaactact 
tgctcttctt 
acgcctgtga 
ctcttggacg 
ttcatgcccg 
attaagcaaa 
gggaagaaaa 
cccgacactc 
gatatcaacg 
accgccgccg 
gaaaaacgca 
gcagccgcca 
attgttcagc 
tcgggaggaa 
agcttcaaga 
gccaccagcg 
ccagtacgtc 
agccacttgt 
aacctaccta 
aagacacttt 
gttgtctcta 
aaggggaata 
gtaccagaat 
atttctaacc 
aatgctctcg 
agtggaggag 
cgggtctaga 
tgtatgtgat 
aaatttattg 
ccccctggag 
aaacctagga 
aatgcgatga 
tctggtatct 
attctatact 
ccagccacca 
aaagtttgtg 
tgcgttgcaa 
acaagagtgg 
aatgatgaag 
ggaaaaaaac 
ttaatcagtg 
tccctttacc 
gaagacacgt 



atggaggagt 
tcctcatctc 
ctcgcccgga 
ttctgtcaaa 
tatcctctac 
tcgaagccat 
aacgtaatat 
ctctagctat 
acattacacc 
tgacccttct 
cgtcctcagt 
ttgaagacac 
atttcacagc 
gctcccttac 
ccgctgtggg 
acgagtctac 
attcctctgt 
tgatcgttaa 
tgcaagtatc 
cttcttcctc 
atcttccccg 
tgttctccag 
agattattgc 
tggtttctca 
caaccgtcga 
gtcctctata 
acagaaaaat 
caacagctaa 
agaaaatcgc 
acacccgcct 
ttcaacaaac 
ttccacccag 
cattggagga 
atcttttgac 
cagaaatgac 
tgaaactggg 
catccaacgg 
tgcttgattc 
acgctaggga 
atgcagggga 
atggaaacaa 
ctgaaattct 
caccggtatc 
gaaaactgtc 
ctagtgtaat 
tttttcggac 
atttttttta 
atttcatagt 
ggaatccggt 
cgacaacggc 
gtttataggg 
gctcatctca 
ccgaggcagg 
ttatatcaaa 
cagaaccgga 
tggtgttttc 
gattttctgt 
ccattggtt t 
gaggaactgc 
ctaattattc 
acaatgg tgt 



agaatcaatg 
agaagccatg 
aaatgtacca 
gggagataca 
ctattcttcc 
ctctaagcaa 
ttcttctgac 
caagagagaa 
cgcattagtc 
ttcaaccaaa 
tgatcttagc 
attcatgtac 
tctcttgcaa 
caccttcctt 
tgcaggagga 
cgtcacaaca 
tacacaagat 
caacaacaat 
catgccatct 
atcttcgtct 
tgtcacacca 
ggcagatatt 
tgccgtctcc 
gaacaacatc 
tccagtcact 
cactgcagca 
gcacgccaag 
cggagtgcct 
aggcatctct 
ctcatccgac 
attcaagaaa 
aagccaagaa 
atgtcgtagg 
cggtggctac 
tggtcttatt 
aggtgtttct 
gaaaacacta 
agactctatt 
taacaacaat 
tttggtttat 
tcgcgatctt 
agaaaatgtt 
tgctcaggaa 
atcagacgac 
attgtatgtt 
aaataaaaga 
aacacctttt 
gggacatcca 
aagtgcagga 
caagccgacg 
tttcacccag 
ctcttccgac 
agctcctcgg 
tactagggag 
aatctctacc 
aaggtgcgtc 
ccaacttttc 
gtacatccaa 
atacctggat 
aatattcaat 
tgttttctct 



gattatgaag 
aagaatgccg 
ttcgcctcat 
ttcgaagaaa 
tcttctaaca 
gtaactgatg 
tatactaact 
ttcagcagaa 
gaccctaaag 
tcaaaactag 
catctaaaac 
agaaatgtcc 
caggaacaaa 
aatcatccca 
ggaaacccaa 
tcaaaccccg 
cctgttatgg 
actggaattg 
gaatcatctg 
gataccttca 
gccgttctct 
gtcctcgaaa 
aaattcaagg 
aacaacaaca 
ggggatattg 
aatggaggaa 
gctttcccca 
cttacatctg 
gaaggatcaa 
atggaacctg 
gcttcagata 
acaaacgcta 
gtgatccaag 
ggaggagatg 
ggtgcaatct 
gctgctaacg 
ccagaacatg 
agcaacctat 
ttgggaagaa 
tctgctagac 
accaagcgtc 
gcagatacca 
atggcttgtg 
aatgttcaat 
taatttattg 
attggaataa 
tatgccacgg 
aagagtgtaa 
gtaacagaaa 
atgatgtttc 
aagtgtttta 
ctcacaaagg 
acgctcctgt 
cgcacgttta 
aaggaagaag 
tcggtccagg 
aaggactacc 
ataaaggctg 
ccggcaacac 
gacatttcat 
aacagtgaag 



225240 
225300 
225360 
225420 
225480 
225540 
225500 
225660 
225720 
225780 
225840 
225900 
225960 
226020 
226080 
226140 
226200 
226260 
226320 
226380 
226440 
226500 
226560 
226620 
226680 
226740 
226800 
226860 
226920 
226980 
227040 
227100 
227160 
227220 
227280 
227340 
227400 
227460 
227520 
227580 
227640 
227700 
227760 
227820 
227880 
227940 
228000 
228060 
228120 
228180 
228240 
228300 
228360 
228420 
228480 
228540 
228600 
228660 
228720 
228780 
228840 
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aaaaagagaa 
ttgaagcagc 
atttcgggta 
gtcaatacgg 
tgaagcaacg 
ataaaaatgt 
agtgtgaaaa 
gggatataga 
aaatgcctga 
ctagaatact 
ttttgtctga 
tacacacgtt 
atgatgcttt 
aagccgcttg 
ggaataaagt 
aatgtagaca 
ataaaaaagt 
ttttccatag 
cttccccaat 
aaacagatgg 
ttcaaaggat 
gtagggcgga 
accgtat tct 
atatcatcag 
tacaagaaat 
ccgagtacaa 
ctgtcaacaa 
aattagaaga 
acaactgggg 
tcaatactgg 
atgactactt 
ggagattggt 
ttaccaaaca 
gcaaacgcca 
gtaaaacgta 
ttggtagcag 
ccaccaggaa 
gcagataccg 

gggggtgatg 

tatgaagaaa 
actttttcaa 
ttagtagaga 
ttcggaggaa 
gatgaagttg 
cctgcaactg 
ctgtacaacg 
aaatgcaact 
gacccatgtt 
gcaataaaca 
gaacaatagt 
aggaactaat 
tcttcacgga 
cagcacttga 
atacaaattc 
aagctccttt 
gagctgaaat 
acaaggggaa 
aactcttgtt 
acaggagcag 
atgatcatgc 
agcag ttgtt 



ggatgcacta 
cataaaaaaa 
cttacgtata 
gatagtaaat 
agtgcgatgt 
ggccgaagaa 
tttatgtttt 
aagagaaact 
aaatatgccc 
tttatgtttt 
gaatagatca 
attaaaaaaa 
aataatttta 
tgcgcatgtt 
agaacatgta 
ctggatttat 
tactgctagt 
aaaaatggaa 
aaattttgcc 
tacattttct 
ggatcttttt 
tttttatgca 
tcactacaaa 
tgatacaaga 
gtacgaagaa 
aagaaatgag 
accatataat 
caagttgagg 
caggtgtgca 
tatgtgtttt 
caacagaaat 
gcagtaatat 
gtctcccgtt 
cagagcattt 
caatcggatt 
atgtatactc 
agtcgggaac 
gaagatatat 
atgacgatga 
acgaccccag 
ttttggtcta 
cagatggacc 
gttctatgtt 
atggagttcc 
gccagaactt 
tcacgtgctt 
cattctttct 
tgcgggcaag 
ttgcctgttt 
cactcatgcc 
cagtgtgtta 
tttggtgcta 
tggaacttct 
cattgttaag 
tactgtaagg 
gtacgctgct 
acttgttgta 
gaccaataaa 
ccatacgggt 
agctcacttg 
agacgatgga 



gtgctgataa 
ataatgctaa 
ccaaaatcta 
gttgaaaaaa 
tccagatggt 
agagttggac 
agggacatag 
gaaagcagat 
tattttggac 
gtggatgaaa 
ctaagaaatg 
ggtgtgtcta 
aagagaatac 
aaccttgaac 
aagtcgcggg 
tttgataaag 
atggcatcta 
ttggaagatt 
aagttaaaat 
gttataagat 
tgtaggttta 
tgtaaaagag 
ttctccaacg 
ggacgtaagg 
gatggaagaa 
gacaaaacct 
attaaccatt 
agtggtttca 
actggaaagg 
ttgagaaggt 
atcaaagaac 
caagacggta 
tacgagaagt 
tatcaccgtc 
caccctcgac 
ttggcaagaa 
atttgtacgc 
gaaaaagaag 
cgacgatgat 
aaatgtattc 
tagagcaatg 
ctcatctcac 
gaacggttat 
tgaaagagag 
tgtatctgtt 
ctccaagcag 
catcatctaa 
tctacctacc 
gtcaaacact 
ggagaagtga 
ccagaagttg 
gtcaatagac 
gaccagcaaa 
ctagcatcta 
tttggtaatg 
gcctgccggg 
cttgaaggag 
aactcgcctc 
aaactcatca 
ttattttctg 
atccatgttg 



gggtgacttt 
ggaaagtgtt 
aactggacaa 
atatccctgg 
acctttacaa 
ctcgccagtt 
acctcagaaa 
atgtggtcgt 
caaagtgttc 
tttcttataa 
tttctattag 
ttcatgctag 
caaaaactta 
agtacgaatg 
tagtgactcg 
acactttaaa 
aaatatgtaa 
tgactgagag 
cttgcccgga 
tctttagaaa 
ttcccgactc 
gagaatctat 
ctccccatgc 
gtatacacat 
gatatgaagc 
tcaaatctct 
tgtatgagca 
tttcttatga 
gggcgtgcat 
gtcagaatgt 
ctttcagatg 
ttttcaccct 
cccaatacaa 
tttgcaaagg 
aacacaaagg 
gagaaaatgg 
tactctaatg 
tacgatgatc 
gatgatgacg 
gaggtcgaaa 
aaaaagtctc 
cctaaacggg 
ggtgcaggtg 
aggattacaa 
aaactggaac 
cgtggagtat 
ccatggctgg 
tgaaaaacat 
ccctagaaac 
ttgaaggttg 
tggatgtgat 
tggctgacaa 
tgttcagtcc 
aatgtatgat 
acaatgataa 
actgttacaa 
gtgacaggtg 
tttatggagg 
atgattattt 
caaatagatg 
tgatggatag 



ttctacgcat 
tttcaaggat 
atttactccc 
ttacatatgg 
caccgactcg 
agtgaaaaaa 
aaaggaagca 
aacactaacc 
agtggtgagg 
tgatgaagat 
acataaggaa 
atttaccctt 
ttttgaagat 
gctttgttct 
agcagttaag 
tttaaactac 
tgcaaaacac 
cgcatatttc 
tgttaaatat 
catgacaaag 
acacactat t 
gcatatgtgc 
ggccatcgaa 
agaatacgcg 
taaatacact 
tcttgctcct 
atatggaaat 
cacgtatgtt 
ctaggaccca 
gcacattaaa 
tgttatccat 
tcacagaagg 
catgtggttc 
acaaatatga 
agt tgacgcc 
tgtttgaagg 
aaga taaaag 
cagaaaataa 
acaacaacaa 
aggatgaaaa 
ctcctgtatg 
ccccgtcagc 
cagatgcact 
attttgctct 
atgatggatc 
aaaaggcggg 
tcgtgtagag 
ataccaacaa 
taggtacggt 
tactacagt t 
tctcattgac 
ggggaaaagg 
tattcataag 
ttgtaaaatt 
taatgttata 
aaaaattaac 
cggtaagagt 
agaatatatg 
aactaagaaa 
ggaagtttgt 
atattactac 



gaaaaggcaa 
ggagatcttg 
tattttcgga 
ggagaaatta 
gaatgggaat 
tatggtgcca 
aaggaaaaaa 
cataagcatg 
ttggatgaaa 
gtagacgaaa 
aatgtacctg 
aatggtttgg 
gaggaactac 
aataatagag 
cgtaggagaa 
aaatactttg 
gactgtttag 
aaggtagaac 
gtgcagaaaa 
ggtgatctta 
acacttttga 
acaaacaaac 
caaataacca 
atcgaaaatg 
ggaactttaa 
catttaacac 
tttgatgaag 
actgcaaaag 
taactcaaag 
aacatacaaa 
caaggcaact 
caagggaaat 
aagagaggcg 
gcggaaaaga 
caacagatac 
attttgtgtc 
ttttctacta 
gaccagtagt 
tgttgacgtg 
atatgcctgt 
tagagggtta 
atttaatcca 
agaagaagag 
caagagagga 
taaagcagac 
acgcatcagc 
ctcgtcactg 
gaaaatggag 
tgtggaactg 
tcttctatca 
gaagggcaat 
attgtgattg 
ctattgcctt 
gataccaaag 
tgtgtaggag 
aagaaaaaga 
acccaagcca 
tgctttcccg 
attgaactag 
agtaaaatta 
tcggggattg 



228900 
228960 
229020 
229080 
229140 
229200 
229260 
229320 
229380 
229440 
229500 
229560 
229620 
229680 
229740 
229800 
229860 
229920 
229980 
230040 
230100 
230160 
230220 
230280 
230340 
230400 
230460 
230520 
230580 
230640 
230700 
230760 
230820 
230880 
230940 
231000 
231060 
231120 
231180 
231240 
231300 
231360 
231420 
231480 
231540 
231600 
231660 
231720 
231780 
231840 
231900 
231960 
232020 
232080 
232140 
232200 
232260 
232320 
232380 
232440 
232500 
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ttttctcttt 
ttcctcagcc 
atacttttgg 
tcctagacct 
ccattgagga 
gaattgtaaa 
aagactttgc 
gtgtactatg 
tgcaagtcaa 
gaggattatc 
ttaggaccgt 
gctcttttgt 
accaacctgg 
aatacacccc 
ttcagtacca 
gtaaaaactt 
acaaccataa 
agtatcacag 
ctcggctctg 
tctgcaccca 
attgtaggat 
atataaaat t 
attctctaca 
gttcctgacc 
cctgttgtgt 
atggaactcg 
atttttgtca 
gttacacatc 
gatttttata 
aaaatatgaa 
cagaaacgat 
atctggtgca 
acccagaggt 
tcgagtgagc 
ttgtcaatgc 
atatctaggt 
gaaagggtta 
gtgtaatctg 
ccagcggtcc 
gagatggagt 
gcggctcact 
tggaccactc 
aacttgacat 
tttataaaat 
gctaactact 
ccagaaacga 
tatctggtgc 
gacccagcgg 
ctcgagtgac 
ccaacatata 
ccagatatct 
tctggaaagg 
cttcgtgcaa 
ctacctagcg 
atgagggaat 
acagaaagag 
caactcattt 
catttctggt 
caccctccga 
gaaaaaattt 
tacacacact 



agctagagga 
cgatcttgta 
tgtcgaaaga 
cgcaaataag 
ggtgcaaact 
ctgtttccat 
ccgacactga 
a taacaacaa 
aatacatctt 
accccatcaa 
gggagttttt 
cattgccaga 
aaaaaatgga 
agatagaatt 
tgtaatattt 
tattgtttct 
tactgagtat 
gaaaaatcgt 
ta tgtggagc 
tggcgttgct 
tctggatatc 
tgtgtgtttt 
agcactgtgt 
ctcatatctt 
atttggaggt 
tataccatga 
taccatctcg 
tttccactgg 
ccctcatgcg 
taaactgagc 
tcaaaccatt 
t tatttctgg 
ccaccctccg 
tgaaaaaatt 
atgctccctg 
cgggaagaat 
caatttattg 
acattgggtc 
accctctaaa 
gaaacttcct 
ctctagaaac 
ccatttctgg 
cgggcctacc 
ttt tgatgag 
aaccgagaga 
ttcaaaccat 
attatttctg 
tccaccctcc 
gcagaaaaat 
cacatgaaca 
gggtcggcca 
ggtacaattt 
tctgacattg 
gtccaccctc 
atagtactcc 
aactgatagc 
ctggacaagc 
ataatttctg 
acttgacatc 
ttgaaaaatt 
cgcccatgct 



gtggataccg 
ttgttgatgc 
tttgagacaa 
gacgaaaaga 
aaaattataa 
gatgtgtttc 
aaatggttat 
tgaactgatt 
ccatcacgat 
agagattgtt 
atcgcgcaaa 
ggacaaatat 
agaaatacag 
gtctgaccag 
ttgatatcaa 
ttcctctttc 
gggtcttgga 
ctcaagaaag 
aggaattttc 
gtctgtcttc 
gatgattatc 
atttgtttga 
ctttggacac 
cttcattttc 
gaacagggaa 
attgtactcc 
tatggtcgtc 
tgggagagga 
aatgcgatgt 
atacacatgt 
tctggaacat 
taccattttt 
aacttgacat 
tttataaaat 
tggaaggtgg 
ccagaaacga 
tatctggtgc 
gacccagagg 
ctcgagtgag 
aggtgtaagg 
gtctgctcca 
tgtaatctga 
cagcggtcca 
gaaaaacact 
gtgaggcaga 
ttctggaaca 
gtaccatttc 
gaacttgaca 
tttttaaaaa 
catgaggcgg 
gaacccagaa 
cttataactg 
ggtcgaccca 
taaactcgag 
gtagccaaca 
tgttccagat 
catttctgga 
gcaccttcgt 
gggcctacct 
tttgatgaag 
cgtctagctg 



ttgagtggtg 
ttttagatgt 
attccattca 
atgtatggat 
atattgtata 
tttcctagtg 
tttgtcttgg 
gtaagagttg 
gatccagaga 
gaacgactag 
caagtgaacc 
tgtaacctat 
cgtactagac 
ctagatggat 
attgaaagga 
ttttgtctgt 
tctttgttgg 
gcaatggata 
tgtggatacc 
tgctgcactg 
caggcattga 
taatacaatt 
ggggatcaat 
aaatacgagg 
ggagaatagt 
tccaatatct 
agtgctcagg 
agtattatgg 
ggaaaatttt 
cgtatccaga 
tcatttctgg 
ggcacctctg 
taggccgacc 
ttttgatgag 
tggatgagat 
ttctagccat 
attatttctg 
tccaccctcc 
cagaaaaatt 
tgcactacag 
gaaacacata 
cattgggtcg 
ccctctaaac 
accagtggag 
gtctgtccag 
atcatttctg 
tggcacctct 
tcaggcctag 
tttttgatga 
tctacaacag 
acgtttcaac 
gtatatcatt 
gcgatccacc 
tgacgcagaa 
tatacacatg 
atctgggtcg 
aaggggtaca 
gcaatctgac 
agcggtccac 
gaatatagta 
atagctgttc 



ctctgctagc 
tgaaaagtgt 
agaacgtgct 
taaggtagac 
taatattgtt 
attttcacat 
acgagtctct 
gtgggattta 
ggttctttta 
cagaagagga 
tccaacacgg 
tattacccca 
tcattcactc 
gtgttatatg 
aaatggttca 
cactttattc 
gacattttgg 
attataggct 
agcaatattc 
ctactgctgc 
aaaggagtaa 
ttcaccttgt 
atcttggatt 
agggatgttt 
tgattgttct 
tcaagataga 
tacaatctct 
ttcaatattg 
atatgtttcc 
tatctaggtc 
aaagggttac 
tgtaatctga 
cagcggtcca 
gaaaaataga 
aggagggaga 
ttctggaaca 
gtaccatttc 
gaacttgaca 
tt taaaaatt 
aggcttgtga 
aaaaagataa 
acccagcggt 
tcgagtgagc 
cgaccgtaca 
atatctaggt 
gaaagggtta 
gtgtaatctg 
ccagcggtcc 
gggaatatag 
aaagagaact 
tcatttctgg 
tctggtataa 
ctccgaactt 
aaatttttta 
aacacatgag 
gccagaaccc 
atttcttata 
attgggtcga 
cctctaaact 
gtatagtgcc 
cagatatctg 



gatgagggac 
tcaaataggg 
agagccctat 
gctcgcggca 
gaagaataaa 
gttacctaga 
tctggagaag 
tatgcagata 
tagtgtgttg 
tggggtattt 
gtgctacaaa 
gcaaatgaaa 
tagaacgtac 
ttaacacttt 
gaaattggta 
gtcagacata 
gtgtgttaaa 
cgtgtcggtc 
tacacctcag 
tacatcaact 
taaagatata 
accgaaaccc 
ttggatgaat 
tctcagtatt 
ttggtgttag 
tgcatgatgt 
tgtggtggtg 
atggaggatg 
gacaagggga 
gggaagaatc 
attttattgt 
cattgggtcg 
ccctctaaac 
agtaagatgc 
ggctatccag 
atcatttctg 
tggcacctct 
ttaggccgac 
ttttgagatg 
tcctttctgg 
agtacatttc 
ccaccctcgg 
tgaaaaaatt 
cactcctcta 
cgggaagaat 
caatttattg 
acattggcct 
accctctaaa 
tactccgtag 
gatagctgtt 
acaagccatt 
t ttctggcac 
gacatcgggc 
aaaatttttg 
gcggtctaca 
agaaacgttt 
actggtata t 
cccagcgatc 
cgagtgagct 
cgccaaagca 
ggtcggccag 



232560 
232620 
232680 
232740 
232800 
232860 
232920 
232980 
233040 
233100 
233160 
233220 
233280 
233340 
233400 
233460 
233520 
233580 
233640 
233700 
233760 
233820 
233880 
233940 
234000 
234060 
234120 
234180 
234240 
234300 
234360 
234420 
234480 
234540 
234600 
234660 
234720 
234780 
234840 
234900 
234960 
235020 
235080 
235140 
235200 
235260 
235320 
235380 
235440 
235500 
235560 
235620 
235680 
235740 
235800 
235860 
235920 
235980 
236040 
236100 
236160 
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aacccagaaa 
cttataactg 
ggtcgaccca 
taaactcgag 
ttcttgacga 
gcgtcataca 
acgtccagag 
tttttgtggg 
acaaaatctg 
caggcccagg 
agcttgtcgt 
agacgtacac 
actatgtctt 
aaaaatttcc 
ctctgtggaa 
ataacaacaa 
atgcttcagg 
tttgcaaaaa 
gagggagttt 
aagaaatgga 
aactagactc 
aagggtacat 
tcgtttgtag 
aagcgtttgc 
atcactacag 
aatgtgttgg 
gacatttcat 
ttaaatgttt 
ggaaaatggg 
ttgacacata 
ctaaacacaa 
ttgaagaggc 
aggccataaa 
tgaagaaaaa 
gacgactgaa 
acaaagagaa 
aggcgaaaga 
ttttgtttag 
tagccttcaa 
aaaaaggcag 
ctcaatccac 
taataatatt 
tgtcagtatt 
gtcaagagtg 
aaggagagaa 
ataatgattc 
atttagaaaa 
caatgatact 
gaagtttcag 
ctgtatttga 
atgaaattgt 
tgtacaactt 
tgtatagttt 
ctagggatca 
tattgtatgt 
tcttcatcaa 
gtaacacatc 
ctgcaaagta 
agaattggcc 
tcataatgat 
ggatattggg 



cgtttcaact 
gtatattatt 
gcggtccacc 
tgacgcagaa 
taagaggagg 
ttatatttcc 
acaagtttta 
ttctgtgtgt 
tccatgagac 
accagttcca 
ggaacttctt 
caaacctata 
atcaaacccc 
cttgctattc 
gaagttgtac 
gaactggact 
aaggagtgtt 
caacacacat 
tcttcacttc 
ctctcatgtg 
tttcatacct 
cccgctggaa 
gtatggagca 
aataagaaat 
tttcaacaaa 
atgtttatat 
gcatacattt 
ccaaaccttt 
tataaaccaa 
tttagatttt 
acctgaaaac 
cataaagaaa 
gaaaaaggaa 
ggaagaagaa 
tgaagaatgt 
agtgagaatt 
gaggttgagg 
accgaaacga 
gttagcccta 
tgaaagaaga 
tgctttggct 
gttttaatat 
tggtggagga 
tgataaaaag 
tattggtata 
tacctcaata 
tatcctaaaa 
ggaaactcat 
tacatgcgta 
aacagagaaa 
agctcacacc 
tctaactctc 
tctattgaaa 
atttgttgat 
tttgctggaa 
ttcatatggc 
agcagatgtg 
atggaggaaa 
aatatctttg 
agattttcca 
gtggaatggg 



catttctgga 
tctggtataa 
ctccgaactt 
aaatttttga 
cagtaggtga 
aagaattttg 
gtctgataga 
gtataaaaga 
ctcagtagag 
acccaagaac 
ctcccgtcct 
ccgatttttg 
aatgaagatg 
cactgcactt 
aacaaaagaa 
cctgttccag 
cttatgagtg 
tacttgtttg 
tttgcatcta 
ctcagtaaca 
gccaacggaa 
atagcaatta 
tcttgggcaa 
gatgcaaaag 
aatgtgacga 
gata ttgaag 
tgcttgtcta 
gatgacacaa 
acgactaacc 
att tgctcat 
aaaaaggtgg 
aaggaagaag 
gaagaactag 
gaactctcaa 
gtcaagctga 
aa tggtgtat 
aaaattttat 
gtagaagaac 
gaaaagaaaa 
gatggagaat 
agtgtgtgtg 
cattaatatt 
gattggacaa 
agagagcaag 
cattcgtatg 
gaggaac tag 
ctcaacaaag 
aaaattttaa 
cgct ttgctg 
gaagcgt tea 
aatgaccaaa 
caccctttca 
ggtaatggta 
act ttagtgt 
tcaataaaaa 
atcttgccag 
cagattttat 
gaatctccag 
tcggccaact 
gtggtgtcac 
ttggagaata 



caagtcattt 
tttcaggcac 
gacatcgggc 
aaaatttttg 
ggctgcttgt 
ctgacgtcaa 
tttcttaaaa 
taggtgcaaa 
aacgcaccat 
ttctttctac 
acagtagtga 
ggaacaagag 
tgagttacca 
ataagacgaa 
aattcaaact 
ctatcccgtt 
aaataatgtc 
atatgttaaa 
ggaagaattc 
tagcgaaatt 
aaataccatg 
tggaagacaa 
acacatacgg 
attgtctgga 
aggaagaatt 
acgagaaacg 
ataagtgttc 
tttttagaaa 
ataaggaaat 
ataacgtcaa 
aagaagaact 
aactagcaaa 
caaaaaggac 
aatataataa 
gagatatttc 
tactaaaaga 
tgctagaaga 
gtatattcct 
cggaggacat 
atactataac 
tgttaaacga 
ttaccacaat 
caacattccc 
actactcatt 
aacacacgtc 
aagtactgaa 
gagaaaaaat 
tgaaagggat 
taaataagaa 
attctataca 
ttaaaataat 
atgatggtaa 
tcgtacctca 
attttagaga 
ataagtaaaa 
ttttctgtat 
ccccttcccc 
taatggggtt 
tgtacaaaag 
aaatggggga 
acaccaggtc 



ctggaaaggg 
cttcgtgcaa 
ctacccagcg 
atgaagagat 
ttgatgtgtc 
tggaccataa 
aaaagagggt 
ggtagagaat 
ggttgcttca 
aaactttctt 
tgtagtttat 
tatagtttct 
aatggtttct 
tgaagaagat 
cctcaactca 
tgacagggag 
cacgtcaact 
tatggaacgt 
ttttactaac 
catatgcaat 
ccctgataaa 
ttaccctgca 
ggatcataat 
aattatagag 
tgttaaagag 
ttgttacaaa 
taaagctaac 
atgtccccca 
ggatttgttc 
attagacaaa 
agcaaaaagg 
aaggacagca 
agcagaaatt 
aataattgaa 
aactgcagcc 
ttccgatcag 
agaaacaaaa 
aactaaagat 
aattgcgaag 
ttctcatatt 
ataaataaaa 
gtctacttgt 
attcgacctc 
tttcattact 
aaagattatt 
tatatacaaa 
tatactgatg 
tcttcccaag 
caatgaacgg 
aaatctagta 
aaaagcgtgc 
tggaagaaca 
tttttcaccc 
acatggagat 
ttcattttga 
tgttatctca 
atccctcatg 
agttacattc 
ctccttggtc 
tttatcaaac 
gccgaggagg 



ggtacaattt 
tctgacattg 
gtccaccctc 
tgagtaaaat 
agccacatct 
aaggctttgt 
gggaggtttg 
catcatatgg 
actccgtgtc 
gaagctcaca 
tgtgactctg 
accattggag 
tccgtcttag 
aaaggtattc 
ttgttggttc 
aatatatgtg 
tttcagacaa 
ggcaaacaag 
tttgaaaatg 
gaaaaggaaa 
actaatgatg 
ttgctatatc 
gaatctctca 
tttataagtg 
aagactgtag 
ctcccatgtg 
tttagatgtg 
actatacaat 
aatcgtgcat 
aaatcaaaac 
acagcagaaa 
gaaattgaag 
gaagaggcca 
aagggaaaaa 
ataaacatgt 
gagttggctg 
cttgacagat 
gatgaaacgt 
aaaaacaacc 
gagaaactac 
gtataaaatg 
tcgaatttgt 
gtccatacac 
gaaacgtgta 
gacacgggta 
gctataaacc 
gatgtagaaa 
ggtaaaaatg 
cattactacc 
gattattata 
gcatatttca 
gctagattat 
ataacacacc 
ggacgacctt 
ggcatttatt 
gcaataatgc 
gtatttcctc 
aatacagcaa 
atgacttcac 
ggcatggcgg 
gcaaacttcc 



236220 
236280 
236340 
236400 
236460 
236520 
236580 
236640 
236700 
236760 
236820 
236880 
236940 
237000 
237060 
237120 
237180 
237240 
237300 
237360 
237420 
237480 
237540 
237600 
237660 
237720 
237780 
237840 
237900 
237960 
238020 
238080 
238140 
238200 
238260 
238320 
238380 
238440 
238500 
238560 
238620 
238680 
238740 
238800 
238860 
238920 
238980 
239040 
239100 
239160 
239220 
239280 
239340 
239400 
239460 
239520 
239580 
239640 
239700 
239760 
239820 
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cgtgcttgta 
ggacagggaa 
aagaagacag 
tgtacaatgg 
tgaacatgct 
tgcttataag 
cgtgtgttaa 
ccacagataa 
ttgtaagggg 
aaaaactccc 
atcatttcct 
gtccacccag 
atctggcatc 
atgaaaaatt 
actaacaagc 
cctaaaatcc 
aaatactgta 
aaaaagggcc 
aaaagaaggg 
gataaatacg 
taccaataat 
agaaaagtac 
caaaagggat 
ataaatttta 
tttctatatt 
aggaacagaa 
tcctcatcat 
ggatcgctgt 
attgagatag 
tcttccatgg 
ttagtcgtgg 
acctgttgtt 
tagacttact 
tcttcttcaa 
ctcattaaaa 
gggaccttct 
tgatattgta 
gttttccgat 
tggcgaagga 
tgaagacaga 
agatccagtc 
tttaacactc 
ggaactcttt 
gggtatggca 
cgacacatat 
agcgtctcca 
acgccaaacc 
tcccacgccc 
agacattaat 
tgaaatgccc 
cttatctctt 
tactgccgct 
taacaagaga 
gtatgcgaat 
agaagtgttt 
cttatttttt 
tagacacccc 
catcaacgac 
tcctctgggc 
ttcgactaca 
gaatactaca 



aagctccttc 
tcgcctgagt 
tacactctct 
agactcgcaa 
gtccgcgttg 
tgagtccatt 
tttctcggcg 
gaatgtggtt 
tgaataagga 
tcttgttcca 
tatgtgcaat 
aaagggggcg 
cagaaacgga 
ggacatcaac 
atttcctgcc 
actatcgaga 
gaggaagaag 
ctggacgcat 
atatctgcac 
tggatagccg 
caactgaaga 
aacaaatatt 
tatgaagcaa 
ttttataata 
tttactgcct 
gagcggaccc 
cgttatcgtt 
ccttctttgg 
caacgatcaa 
tgtaagaagg 
tggccatttt 
ttatcgtgta 
cagcgttagt 
cgccgggtta 
ggagatggaa 
tatagcgttt 
gtgtctattt 
aagttgagaa 
gaagaagatg 
gaagcacatc 
gctgtagatt 
aaggaattgt 
gacttcattt 
aacaggagta 
agtaaatgtg 
atttcattgg 
tttttaacta 
gacacaaagc 
attagccaga 
cgccaacaaa 
ctaccttcca 
gccaatattc 
ctgaaaactg 
ggaaaaataa 
aaccatcttt 
atcaaaatga 
aaacaatttc 
aacaacataa 
tattacaccc 
aacaagagac 
ggacaccttc 



cacaggggcg 
acgtgggaca 
ccaacatcgg 
tcggtgaaca 
aaaaagtcct 
gctgctgctt 
gaggactgtt 
agacttgaat 
ctagactcct 
agaattggct 
ggataattat 
aatacacata 
tgcaagcagt 
acgcattggt 
taacctgcag 
gagtatatga 
aaaggactgt 
gggaagcagc 
tcgttttcaa 
aaagggaaac 
accagatgtc 
acagaagaag 
gtcaattttg 
aaaggtgttt 
cctcttgggg 
agccagaagc 
ggcagtgtcg 
tcccatccat 
aaagagaccc 
gccagcggcc 
ttttctgttt 
cggtttttgt 
tgtcgcagtt 
acgacaatct 
ctgaatttat 
taggacctga 
tgcagacaaa 
aattgggtcc 
gagatgaaaa 
gacgaaaaat 
tacccacgtg 
gcgatttaat 
ttgaagaacc 
aattcaccaa 
tagtattttt 
tactagatga 
gggcaaaaaa 
ttttggctat 
tggcattaac 
aaaaagtaaa 
cctcctcctc 
ttcttccagt 
atgatgggga 
gccctataag 
tgttccctgt 
aatcctttgc 
tcaacgggcc 
actcttccac 
aagagagccc 
tacaaaacgt 
agaagagtgg 



tattcgtctg 
gcactttgaa 
tgacaatatt 
tggcgtcggt 
gctcggcaat 
gctgcttgct 
cttatataca 
ctatttgaaa 
ctaccagaat 
agtgggcggc 
aatttgtaat 
acatttctgg 
atctgaaact 
gtggtggata 
ccaagcaaca 
agaaagagtg 
ctcagcaatc 
gataaaggaa 
cgcagcagac 
gtcagaaaaa 
a tctctagtc 
tgccatactc 
ggtgtataca 
taattataat 
taagacataa 
atcatatccc 
tcatcatcat 
acattcatga 
atgcgagcca 
tgggctccgg 
gatgaagagg 
caggacacct 
ggtcatgttc 
tagatccacg 
aaccggagaa 
tccgtgcgag 
taatatacag 
atggattgat 
tgaagacggg 
gatgaagaaa 
gcgagaaaac 
agttgaatgt 
gtgggagatt 
ggaa tea tta 
tgaagcagtc 
tatatattgt 
ccca tct tta 
cgacgag tgc 
tgaaagggac 
caccttcctg 
ctcttcttcc 
gtacaggagt 
aaatgca tea 
gattatggta 
ctttgcctct 
aagtgca tct 
gtgcaaatgg 
gatgtggagt 
tcaaccctat 
gcagccatta 
gaaa ttccgt 



cttgtagttg 
gctcatcttt 
cttccctccc 
agggaggagc 
ggagaatatg 
gctgttaggc 
aattcgttaa 
ccatgacaag 
tcatgtgcgt 
tgcatgccag 
atttgaagtc 
gggtgtattt 
ctctctcact 
aaagggggca 
gcatattaca 
gagaatctag 
gattctgtcc 
cgagaaaacg 
gccaaaacac 
agaagaaagg 
aacacaacca 
aacatgcaat 
aacaatgcat 
ttttgtccct 
aaacggtggc 
tggtcctgtt 
cggtgtcctt 
cggccaggac 
tagacatgga 
tcctgccgaa 
tacgtcaatg 
tacgaaccta 
cgtcaattct 
gcgtccgctt 
ccaccttctc 
gacccagaaa 
gtaacaaaag 
aggagcggaa 
ggtggaaatg 
ttgtcctttg 
agtacagaat 
ggatgcatca 
aaagaggctg 
attgactggt 
aactggtact 
tgtgtctttt 
acagtggctt 
gtgcaacact 
tgcttcttcc 
gacacaatga 
aacaacaaga 
aacttttcta 
gcctgtattc 
agaaaatcaa 
aaagacactg 
ttactcctcc 
atgactctag 
tacacgctag 
cagacatgcg 
tacttttaaa 
ttaatctctt 



caatgaagta 
tcgtccgaag 
gacttgttct 
atggtttgca 
ttttccaatt 
gtccagaaag 
agggtgacgc 
gcattttctt 
cattgcaagc 
tctgtatgac 
gtcgggtccg 
agtgcattat 
tttgacatca 
accggctatg 
caatggagga 
aacaatggac 
tggaggaaca 
acctcgcagt 
gtaaagaatt 
aagcaacctc 
aaacactcaa 
acatcaataa 
aagttttcaa 
gatgttgtgt 
ggcggaaacg 
cttatatttg 
atcagtgtca 
gaggatacca 
gccttcaaga 
aggaagagtg 
taacgtaaag 
tttgtctttt 
gttcactata 
cggcagcggc 
ataaaatgag 
gggtatatgt 
aatgggaatt 
ttgagaataa 
ggggaagaat 
ttggaagaga 
ttgcacgtcg 
aatcaaaaga 
ctgacgttag 
tttttgagtt 
tgaaatctca 
cctacataag 
catccttttc 
ttttaaaatc 
ctcttttaac 
agagacctac 
gaaagagaaa 
cagcatccaa 
ttatcgaagg 
ctattattcc 
gtgcgaatat 
ctggactttt 
cagaaaacaa 
cagattattg 
gcaattttac 
cactcttttg 
gcttaatgtg 



239880 
239940 
240000 
240060 
240120 
240180 
240240 
240300 
240360 
240420 
240480 
240540 
240600 
240660 
240720 
240780 
240840 
240900 
240960 
241020 
241080 
241140 
241200 
241260 
241320 
241380 
241440 
241500 
241560 
241620 
241680 
241740 
241800 
241860 
241920 
241980 
242040 
242100 
242160 
242220 
242280 
242340 
242400 
242460 
242520 
242580 
242640 
242700 
242760 
242820 
242880 
242940 
243000 
243060 
243120 
243180 
243240 
243300 
243360 
243420 
243480 
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atgaataaca 
aaactcccat 
aacaacggag 
t tattgcaca 
aagggcaaag 
gcaaatggac 
attcaatact 
gggtttattt 
ctcctcgacc 
tggaaacaaa 
gaaaaaagag 
tcacgaggtt 
ct taataacc 
tt tcggtcgt 
ttaggtatca 
acatggatga 
tgactgatgt 
ctgatgcaca 
cactcgaagt 
acacatcaag 
aggcctttgt 
aagttggcac 
gaaatctttt 
a tcgtgcttt 
t tgaggaaat 
tttaggtgac 
aggcgcaa tt 
aatagacaaa 
aatcaaaatg 
tgaacacaac 
ctcatcagtt 
atcaatcatc 
cggattcact 
aaagggcaat 
tgaatccgta 
ttgaagcatc 
catgagctca 
ctcgtccatt 
atccatctcg 
tcttccctca 
gtccgtcata 
t tcgagatat 
gtgtgctggt 
ctcccccaag 
aaaaacattc 
cttgcataac 
accaaaccca 
aaacttggac 
cgacacatct 
cggagattct 
gggtttttct 
t tcagtcacg 
gacagacagt 
tggatgagag 
agacgcggca 
a tttaatgtt 
aaagttgtaa 
t tttcctccc 
agaatctgag 
ggctgaaggt 
cgattggtaa 



agtggagtac 
ggaccatgaa 
actactattc 
cgcctgccat 
accaccagaa 
aactctgatt 
gcactgatga 
agatctcgaa 
ccaactacat 
cccaaagatg 
taaaaggcga 
gtcatcacca 
aagcaacgtt 
gtcggccatc 
caacactgtg 
aaacctccgc 
gtcctttgac 
gatgaaggaa 
gactgtgggg 
aaagatcaac 
cggtagctcc 
ctttgtgtgt 
cgacatgtac 
tttatataga 
ttataccata 
tggaaaagtt 
tcatacgctc 
aatggaatgt 
gacaacatgt 
tcactgttaa 
ccccaacatt 
ttaggttgac 
aatccttgtt 
tggtttaaca 
atgggtttct 
ctaacaccgt 
gcggctggct 
ttacatttca 
acacaaaaac 
agagcgtgac 
aacttgataa 
aattcagatg 
acaggaatat 
agtgtagttt 
tttgccatgt 
ctgttccagt 
acaggccacc 
gctttttcct 
tgcacgtcag 
agaccggctg 
agcttgaccc 
gtatcacaag 
ggagtatagt 
aatctttttc 
ctaacagaag 
gcaaattgat 
ggtgtgctct 
acctctccat 
ggaggggatc 
atcttgaatt 
agtcccatca 



actcattcca 
aacaatgtac 
tacatgcagg 
aactaataag 
gctatatgac 
attattatta 
ttgggttcat 
atgcaaccac 
attctggcag 
acacatccgt 
cagctcgctt 
aaggtaacct 
cgataaagaa 
ctcgccatca 
accaagacca 
attcctgtga 
agcgacacct 
gaagatgcgg 
cagaatctca 
atcactggta 
aacacctcct 
ggtaccacct 
gtgcacgtca 
tagggaattt 
ttttattgac 
tagcaatatt 
tcggcctatt 
ttgtatagat 
ctattaacaa 
taaat tcagg 
taaaattgta 
cagttattcc 
gagacaaccg 
ttctctccct 
tgcatttaga 
acagagtatc 
tggaggaagt 
ggtctccatt 
cttcaacaca 
taaaaacagt 
cttttccagt 
gtccaatcat 
ttctgcggta 
tcttctccaa 
gatcgatgct 
tcaagagagg 
aagatgaagg 
ttgcgtcaat 
aaatgaggcc 
caatattggt 
acttgtcaat 
aagatttcaa 
ctctagcctt 
tgtactgggt 
gcaattgctg 
tttcatcaat 
cgtagtcctc 
tattcttctt 
ggcgagtagc 
gccctcctcc 
cggttgctgg 



ggtgtcaaaa 
gagattgttt 
cgaatggtaa 
tatccacgct 
atctctagac 
cggttacgaa 
tttatttatt 
ccaagagagc 
ctcaaccagc 
tgaaggaata 
gccaattgtc 
ttttttttgt 
aaaaactcgt 
ctgctgtgat 
tcgaaaccca 
ctgctgaggt 
tgggcaaaat 
atcttgtcat 
cctttgaggg 
tgcagatggt 
ccttcacccc 
ttggcgcacc 
cctactctgg 
taatattaca 
ctacttaacc 
atccttgaac 
ggtcttttcc 
agaattggca 
ataagccaac 
agctgtatga 
gtacttttta 
atcaattact 
agtgacaata 
catttttcca 
ttgtagacct 
cctcatccta 
agtcgagggg 
taatacatca 
catctttctc 
ttcaaataca 
ttcaagcatg 
tacgacttga 
actttcccat 
tataga ttca 
catggttttt 
gctagtgtct 
gggttgttta 
acagctctgt 
taattttgcg 
atcagcctct 
ggaagaaaag 
gtgttttggc 
caaattgaca 
gtcggcattg 
cttcttgata 
agtaataggc 
gcggagcctc 
tagtctcttc 
gtttttagag 
ttctcctcct 
cgtaagtt tt 



taagtgcagg 
cttcgcccaa 
tggaatatcc 
ccagaatggt 
aaatgtttga 
aattccccaa 
ttattaatca 
aaaacttctt 
sggggtccag 
gaccctggct 
ctgttacgta 
cctcgccgac 
catggatctt 
tgctgtattt 
cacagacaat 
tggatcaggc 
caagatccgc 
cactcccgtg 
aacattcaag 
gccaaagat t 
cgtctctatt 
aattgcagct 
cactgagacc 
acaataagaa 
ttcttgctat 
gggaaacatg 
tggtcataca 
gacaaatctg 
ccaaaagtca 
ggatggttac 
catggtacaa 
atattgtcac 
tttacaattt 
acgatagcta 
tcaggcaggc 
cacaagctgc 
ttaggttgct 
atcacagaaa 
acacctactg 
aattcacagt 
taatataccc 
gaagcatat t 
ttcttacatt 
ca tgctccag 
atccaccct t 
aaaacgga ta 
gtacggttgt 
tttatggtct 
gcctg tctaa 
gccactgcaa 
ttcctcttta 
cccttgacat 
gtagagaatt 
gcgcggtgct 
atg tctgtcg 
ctcttcttct 
aaagtcacag 
atgggcctaa 
caacaagatt 
tctccttctc 
gtccgttggt 



tatcatatcg 
taataataat 
tatcgggggt 
cacctgtaca 
tataatagaa 
agactaaaaa 
tcggaatttg 
ccccaacaat 
gttctggatc 
tactgtaaca 
ctctgtggtt 
aaaacgacat 
tctttcactc 
attgtgattt 
atcgagacaa 
tacttcaaga 
aatggaaagt 
gagggccgag 
gtgtggaaca 
aacccatcaa 
gatgaggatg 
accgccggtg 
gagtaaataa 
aataaaacaa 
acaatgaatg 
caccaattac 
ttttagatac 
cagttctctt 
tggcagtttc 
taaagaacct 
ttaaaccaaa 
tttttatgtc 
ctaccaaaac 
tggctgaaat 
ggccagtagc 
catgatcttg 
gtacaacatt 
tgcctgcaaa 
cgcctttcat 
acatttcctt 
cgactttaac 
ctggttggac 
gggcagtgac 
gggaaagctt 
taccattatt 
ggtttttact 
tagaccacaa 
cgtagttgga 
tcaaatggaa 
agtaacacgg 
ctacatttgt 
gggaacggag 
ttgcacaatt 
tcctcaacac 
catacgttgg 
tatttctatt 
gaggcggtgc 
gagaataagg 
tcacagctat 
ctggcgtttt 
ccccaccctc 



243540 
243600 
243660 
243720 
243780 
243840 
243900 
243960 
244020 
244080 
244140 
244200 
244260 
244320 
244380 
244440 
244500 
244560 
244620 
244680 
244740 
244800 
244860 
244920 
244980 
245040 
245100 
245160 
245220 
245280 
245340 
245400 
245460 
245520 
245580 
245640 
245700 
245760 
245820 
245880 
245940 
246000 
246060 
246120 
246180 
246240 
246300 
246360 
246420 
246480 
246540 
246600 
246660 
246720 
246780 
246840 
246900 
246960 
247020 
247080 
247140 
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cattttaacg 
cactcgtgga 
actcttcttt 
ctgggtattt 
ggaaatgagt 
gcctgaatag 
ttcaacatat 
aacaaatcac 
gttacagcgt 
tcaccatacc 
gaacctcctt 
cttcttagac 
ataaatcgac 
aacatcatac 
ttagcagcga 
gcatttgata 
tcacacatag 
actttaaaga 
tct tggtgct 
tcgtcatcat 
gtgttcttga 
ttatcagcac 
tgtttggaca 
gcatatttct 
tttgtccagt 
ggggatgtag 
agaatgtcct 
tatatcagcg 
actgcagtcc 
ggtataacgt 
ttaacagctt 
tcttctacct 
gaatttttct 
ttctggcggt 
actaaccgct 
ttgccaatga 
gaaggactcg 
gatact tgca 
atagga tggc 
ttcaacaaga 
cagggccatc 
tctgtcctcc 
gagaagaacg 
aagccacaaa 
acaagaagga 
ctattatagg 
acgaactcga 
ttt tcaggga 
ataacaccaa 
ttaacaaact 
aaattagagg 
ctgtcaatga 
cagaaatatc 
ttaaaggcaa 
aagttatagg 
caacggacga 
gtttgaaatt 
actacaagac 
tggtctttgc 
aatacacaaa 
gctacgcctt 



acgtttttta 
tagttccttt 
tctgtatata 
accagtccga 
gagtaaaaac 
aatcctcgaa 
ttgcacatcc 
agttcctact 
cctggatttt 
ctaaaacagc 
ctctccaaat 
agtttgaaat 
caaacacacc 
aaaatttgac 
tgggtaaaat 
tgtcagtttc 
gtattttcca 
tgtaatcgcc 
tcttttccct 
catcatcatc 
cactgttacc 
tctcattctt 
cgtaagcctt 
tacccgtaac 
gattatatac 
tt ttactccc 
tcaataaagg 
aacaagtttt 
tcagaaacgt 
tcttgtacca 
ggggtaaatt 
ggcggcattg 
gttcaactac 
catcttcaac 
cggtgaatgc 
ccaacataaa 
ataacccgaa 
ctgacgtgct 
gtcctcgggg 
aaaacaagag 
tcaagctcca 
tccacctaat 
gctaaagacg 
cgagcgtgtt 
taattacttc 
cgcacacgtg 
tataaaaaat 
ccatgagaac 
atgcgtgggg 
ggtggtaaaa 
cgctaccaga 
aaacaaggac 
gcacgtcttt 
cactttttat 
gtggacagaa 
caatttattc 
taaaaacgtc 
tcaagagacg 
agaaatcgac 
aagcactgtc 
ttatgctcta 



tactctgctc 
atatatttaa 
ttaggtacaa 
atataacagt 
ttgctgagac 
gcttacaact 
aaacgttaca 
atttttgtca 
cttggcccac 
acctctggca 
tctggtagat 
aatgtccaat 
cgcaccagaa 
atcagaagga 
ctgtacaggg 
catggccttc 
ttcagggtgg 
agtgattgtt 
ttctgatgct 
atcttcactg 
aattttcttg 
cattatgttc 
gacgatcttc 
aggtttatga 
agtcgtgtac 
atatttcgaa 
gtccaacgta 
catgtctttg 
atcacttttc 
gtaatcgagt 
agatacagtt 
tcttttgttt 
cagaactcta 
aactggagcc 
gtcttttaaa 
tatcggcaca 
acctatatac 
ggactcgttg 
ggattcttta 
aaaaataaac 
gatccagtcc 
cctctccctc 
tcaaaaatac 
gatgcgttga 
caggttttaa 
aaacaggtgg 
aagtgctctt 
cttatactac 
gcagaaatat 
aaactcccca 
aatttactgg 
attcctcctt 
tccactctac 
caacgaaaac 
agtgaagcat 
gtgctacccc 
ctctataaaa 
ctaataaatc 
aagtgtttgg 
atacaatata 
aataaggcaa 



gggctctggg 
ccaatttttc 
acacacaata 
caaaaggctt 
atatactgtc 
acattaaatt 
gttttcatgg 
gccgcattcc 
agagctttaa 
gtaatggggc 
agattgtccg 
tctttactct 
gaaagggtct 
ggtttcccgc 
ttcacctttg 
attaatttgg 
atgtggggtg 
gtgactggag 
tcatcacaaa 
tccgacattt 
tacatttttt 
ttcagtgaaa 
ctccatgtcc 
gtcagcgagg 
aaggcctcga 
tccatatgta 
ctcttattta 
tatgacacac 
ccacttcttt 
atagattctg 
ctgtcctctc 
gtgtttttgt 
taaagatttt 
atgtctgtac 
gccattacgg 
gcatacattg 
tttgatcctt 
gacgtgtcgc 
caggaataga 
caactcaagc 
cagacccagt 
ctcctcctcc 
gtttaaacaa 
aagagaacca 
agtgctcgat 
ccaaaagtag 
tagtgtacaa 
aaattgccgt 
gtgttaaagg 
atgcaccatc 
agaataattt 
cagaacgagc 
acagactgga 
caacattcga 
caaaacaaac 
attctttcaa 
atagtaccgc 
cccagattga 
atgttctttt 
gaggaaagtt 
aacattctcg 



tcagaggggg 
aaacaggggt 
caggtcaggc 
attttccttg 
tggccgcccc 
tccccatacc 
atactacacc 
tttgcctctt 
tagatctctc 
agtaattaaa 
cttctttatt 
tgtgggaaaa 
cgttagaaga 
tcttacaaaa 
aagctctaac 
ggatattcct 
aaagaacaag 
tgttgattgg 
catccatgcc 
ggtgggcact 
caatagttaa 
gatccctctc 
ttataaagtg 
cttggtagag 
tggtttcctt 
gtagttcttg 
aaattttgtg 
aacatctcac 
ctttcttcgt 
cattggctgc 
cttcaaaaat 
ttccagagag 
ccactttttg 
cggagccaac 
tccatgccat 
aattgcctaa 
tatctaaaaa 
tagaggacta 
tgaccttttc 
accagaaaca 
tcctaaaaca 
tcctcctcct 
agctcttagt 
agcattaaat 
aacaccttct 
cgaaatcgaa 
cgaaaatgag 
ccagttattc 
caacgaaaaa 
ctcatcttca 
caacaaggga 
caacctggac 
cactaaaagg 
taataaattc 
cactaaatcg 
taatttggca 
acatcccggc 
ttcggcgaaa 
ggccataggg 
tagaaggtat 
cgcagtatcc 



ctatagaaat 
tattaaaact 
ctgcaaagga 
atctagagct 
ttccttatta 
gttctgggtt 
ctcgtctcct 
gacagattct 
ggacaattgt 
acagacaaca 
caacaataaa 
gggtcctcca 
tggttcatta 
atcccagaag 
ttctattaca 
attaacaaaa 
ctctagaagc 
gtatagagat 
ttcacagtca 
attattatta 
aacattagca 
cacgttgt tg 
tatgactgat 
ct tgaaggcg 
tgccagctca 
tttgttgtcg 
ttcgaattca 
aagacaga tg 
gtcaagaaga 
ga tggtttct 
cactttgtct 
agtaattgca 
gaagagagat 
tttgttcaca 
atctcaaagg 
tactataaag 
atataaaccg 
tagaaaaagt 
aagacagtga 
gaaccaaaac 
ccaaccaatt 
aagccctcaa 
ga ta t tgttg 
acagaatatg 
gtaccaacag 
ctggccgtga 
tcgttaaaat 
tctaggcacg 
aacaagtt tg 
actgtgctag 
gaaaa taaca 
acgaccaagg 
aagcttttct 
aggtggacag 
ctagacaagc 
gaccacttac 
aaacgaaatt 
gagtacaaga 
aagaatgaca 
ttaatattct 
cctctaccat 



247200 
247260 
247320 
247380 
247440 
247500 
247560 
247620 
247680 
247740 
247800 
247860 
247920 
247980 
248040 
248100 
248160 
248220 
248280 
248340 
248400 
248460 
248520 
248580 
248640 
248700 
248760 
248820 
248880 
248940 
249000 
249060 
249120 
249180 
249240 
249300 
249360 
249420 
249480 
249540 
249600 
249660 
249720 
249780 
249840 
249900 
249960 
250020 
250080 
250140 
250200 
250260 
250320 
250380 
250440 
250500 
250560 
250620 
250680 
250740 
250800 
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ttaatttctt 
gttttttgtc 
ccccatccgt 
agggggtggg 
tggtgtcttt 
gtgtagaagt 
ggtgtgattc 
cgaaaaaagt 
caaataaggt 
aagaagaaga 
tacaaaaagc 
ctcaatacaa 
ttccagccga 
ctattgacaa 
gcatggaaca 
cattctacga 
atagattgaa 
tgtctatact 
ttctggccgt 
ctaaaggcaa 
tagtgaattt 
acacctttca 
aaaacacatc 
gcacccgaac 
cctatacaag 
atcttgggca 
acgaacaaga 
aagaattggg 
tggctgctgc 
tggacgactc 
aggtctcggc 
ctaatgcacc 
gggggttctt 
tactatgtcc 
ccttcgaaag 
aagacgaaat 
ggaataatca 
aatatgatat 
cggttattgc 
gtttagaaga 
ttgcctactg 
ctgaggatta 
gaaactgcgg 
aaccgcgtgg 
agccgaccgg 
acgtccctta 
gcagcttttg 
tttgatgtct 
acatgcccgt 
cccatgtctc 
attgaactca 
atgtacaagg 
cacacatcta 
aaacacacat 
tccaccacta 

aaaacagacc 
ctggccgctt 
gatgatgatg 
caaaaaacaa 
atgaatgtac 



taaccttttc 
cacattgacg 
ctcagccaag 
tgtgagggat 
cttaggtttt 
gcgtgtatct 
agtcatcttt 
atcactcgaa 
gagggccgcc 
ggcgggcctg 
cctcggatta 
tttaggggat 
aggagaaacc 
atactacggc 
aaacacaccg 
agaagaaaag 
tagagacgtg 
cgataaaact 
gtggtgcaaa 
ctacgaatgg 
tggaatatta 
cagagacagt 
cctatctgat 
ccacgtagga 
aggtgataaa 
cacgaattca 
gaatggattg 
aagaggacgt 
agccattgca 
ttctaaagtg 
cgccagagaa 
agcagctggt 
ttctgaccca 
caatacagga 
tgaaacctct 
ggtatgccct 
agacgctgta 
ggttgaagac 
agcccgtatc 
ccctgaattg 
acgagatcta 
ataataa taa 
tgaaact tgt 
agcgcct tct 
ctctttatga 
ttgagggtat 
atgtatcaga 
cttcaatgac 
catgcactgt 
ttcttgcctt 
agaaattgta 
acagctccca 
aaaaggacaa 
caagattgca 
acaactctgc 
ttttatattt 
ttcaatcatt 
cttcttcctc 
atgaagacga 
caacatcaac 
ttaaaaaagg 



tccttcatgt 
ttcgtctatc 
cggctcatgg 
tttggttcac 
gctgaaatgg 
ccagctctcc 
atatatttca 
tcggcgcttc 
aagagatgcc 
acactctctt 
cctgtcccta 
tttgtaggcg 
gcaggatttc 
gaaggagcct 
tatgacacat 
gatgttccac 
gaggaatatt 
aaccaaaaag 
aggaacgtac 
cttggtgcta 
ggtgacttga 
gatagattac 
cgaaaacaat 
agagtgactg 
gacatgtttg 
gctaattttg 
tgggggtata 
ttagggggcc 
atttcttctg 
agaaaaatag 
aaggcgagag 
tea tccagaa 
gacaagcgcc 
tttcttcatg 
a tagtgagat 
tcaacagctc 
ctgacgttgg 
gaggaagcgc 
agtaaagtgt 
caaagtgtgg 
agattttgta 
caacaactgt 
acgatggtat 
aggatcgttc 
agaagataag 
acttttggaa 
aaagctggtg 
catatggatt 
cgagaaaatt 
ctttaacaat 
cctctcctta 
acaatcttcc 
gactcgttta 
gcctaggtgc 
a tcttacgcg 
taaatacaca 
agt taataac 
t ttagaagac 
aaaaactaag 
act tccacct 
agctgttaat 



attgtcatgg 
aacacatgtt 
atatcgattc 
cttcaaaaac 
ctatgggaac 
aacaaaggat 
cctttgtttt 
gcctcatcat 
gaatagaagc 
atgcccatct 
agataaaccc 
tggaacaact 
tcggcatgtt 
tttcagacgt 
cttcagcgtt 
agcaggaaga 
taatggcttc 
aacgtttcat 
tgaaaaagga 
aaatgtgcaa 
aaataaccaa 
cctcagttgc 
tggcccttgt 
caacatcatg 
ccgccctatc 
ttccatattt 
ctcgcagaac 
tgaataaggt 
ccttagatat 
cctccacctg 
aagctagtat 
acagtaacag 
agaagcttat 
ctgctgttcc 
tacgtttgag 
ccgaagctaa 
atgatgaaga 
gtgaaagatt 
gtgagcggaa 
atgaacaatt 
atatattatg 
tcaaaaatga 
acagagtacg 
ggaggaggcg 

^^gggsg^ta . 

agggctctaa 
tattgtagtt 
gatggcaata 
agtggaggtg 
ctggtagaga 
ctaacgggct 
ttcaacggct 
gaggctgaag 
gtctgttccg 
tacaaggcaa 
atctttgaag 
gagcctgttt 
gacgatgatt 
aagaaacaac 
atcagcaaaa 
ggaaaacgga 



tccgtttctc 
tttccccatg 
cgccctaatg 
aagtctccat 
aatgacggct 
atctaaatcc 
attccacaga 
ggggcagacg 
agcggaaatg 
attgggtctt 
tctcatgaca 
tctaaaggct 
tgataatcta 
ggttgaaaat 
gatgacacct 
aaattctaca 
tcctatgaag 
gtctgttggg 
ttggaacgaa 
ccatttactt 
taaacttgac 
agatcagaaa 
tcactcgtgc 
ggctgtcgat 
ttcatcgctg 
tagtagaaat 
ctctgaaaaa 

aggggtggct 

gggggaagta 
cttaaatgtt 
taaacgtctt 
gtttctcctc 
aaagggtgaa 
tgattttgtt 
actgattaaa 
taagaagagg 
taacatcgtt 
acgccaccag 
aaatccaaag 
gatacgggaa 
tgtcttgcag 
gagacgatac 
attgttgttg 
tggacgccac 
aatgcatacc 
ctaaacccga 
gtaataacac 
atagtaaaaa 
ccgaatctat 
aagaagcctt 
cggcagccgg 
cttggacgtc 
ttttagtcag 
atctgttata 
gaaatttgtg 
agaatggacc 
ctgagacaaa 
gttgtgatga 
ccaagaaaca 
ccaatcacga 
aaatgatgga 



cattccgcca 
ggcacagccg 
aaaggaggaa 
acaagaacat 
ctcttatctg 
ctagaaagat 
ttcagtggtg 
cacgcccaca 
gaaggtgtgg 
ccttactcta 
gcatcttctt 
aagagagagt 
gtgaaagatt 
gtaaaacaag 
atccctaaag 
caacaaagat 
atggtgtttg 
gatattgccc 
tacgctatcg 
ttagctgatt 
acaaataccg 
aaatttataa 
gttaacgtga 
gcgcttcgta 
gatatgtacc 
tacctatgta 
ttggccaaag 
aaaacagaac 
gaagctgtaa 
aatgcagcca 
cttctggcca 
aaagatttgt 
gcagtttctg 
attgagtatt 
cctgaaaaac 
aaattagtaa 
aaatacaaca 
gacaaacaat 
aaaaaacgtc 
ctggctgcca 
acctaatatc 
ttttaaccaa 
cccattggtt 
gtctgtgcga 
ctttaggata 
tttagctgct 
tcaaggcaat 
gtatgaagtt 
tcacaagaaa 
cgccgaaaga 
aggaggaggt 
gttgttgttc 
taacaaga ta 
tgccctatgc 
tgttattgaa 
tttcgactcc 
ttcatcggca 
tgatgacgat 
aaccaagaaa 
caacatgttg 
ttctttgtca 



250860 
250920 
250980 
251040 
251100 
251160 
251220 
251280 
251340 
251400 
251460 
251520 
251580 
251640 
251700 
251760 
251820 
251880 
251940 
252000 
252060 
252120 
252180 
252240 
252300 
252360 
252420 
252480 
252540 
252600 
252660 
252720 
252780 
252840 
252900 
252960 
253020 
253080 
253140 
253200 
253260 
253320 
253380 
253440 
253500 
253560 
253620 
253680 
253740 
253800 
253860 
253920 
253980 
254040 
254100 
254160 
254220 
254280 
254340 
254400 
254460 
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ggaaaaaagg 
tcatccgacg 
actaacaata 
attttttcac 
gttgttatta 
ctgcctcacg 
tttcgcattt 
caatctgact 
tcatacataa 
gattttgtcg 
agagaacggc 
tcaccgaccc 
gacaatcaga 
ctaaacgcat 
cacgggcaat 
gaatattttt 
cagctttgcg 
acatcatcaa 
agcaagtgga 
tagctgctca 
gcgttaatgt 
ccactggttc 
acggtgtaat 
ccttgattaa 
tcctccctcc 
gaagtagtac 
accaccaaat 

^ggtgtcttt 

cttctattct 
cagcagcagt 
acaagtacag 
agaaatggag 
tagtgaaact 
agatggctgc 
actatgttcc 
tggctacact 
ccctagaaca 
gaaattcccc 
tcttgaaggc 
aatggacaga 
cacccatttc 
tgacttatac 
agccagaaaa 
gaaaaaatat 
ttcccataat 
tttcaacacc 
ttctggcagt 
attttaggga 
cgtcatcttc 
ctaatctctc 
tgtcactagc 
ggataattat 
tggtggtggt 
aggagcaccg 
tgttttctgg 
cgttctgggg 
aactcgttct 
atgtgcgagt 
gtgggacgga 
tgtgtggaca 
ttcttccttt 



gccaacactc 
ttgttgcagg 
gggatagaaa 
aacacaggat 
atggaaattg 
aaaatattct 
tgactggcgt 
ctgtggtgtt 
gagaaaactc 
tgaaatcaca 
aattatgctc 

gggggtcatt 

tgtgggaata 
aggtgtggaa 
ggaaaaggaa 
agacaccggt 
cttaattaga 
acaat tgccc 
taaaacaaga 
agtaatgcac 
tgtcatgtca 
gggatggaaa 
agatagagta 
aaggggcatg 
agcttcttct 
tactacgggc 
caagctatcg 
tacggtagaa 
tcagtggtta 
agtagaagaa 
tgatactgat 
agcgtgcaaa 
ccacaaatat 
ctttgttgcc 
tcctcaaacc 
ctttactgac 
gctagaaaat 
tgatggtaat 
gaagcagtac 
taaggccccg 
aaagagacct 
acctaggtgc 
aataacagaa 
taccaaccaa 
tgtcgcctct 
ttctgccacg 
tgaagaggtc 
caataagcgc 
tgaaggtggg 
taaaaagcgc 
catgtaacgt 
cataatagaa 
ggtgtatcat 
gcagtcccac 
gtcgaaggaa 
aagaagattc 
attcaatggc 
catgaacatc 
tcaagaaaaa 
tctttatgga 
acttctacgc 



taaaaaattg 
agaaaatgag 
agactatgtg 
gaacaataac 
tttaaacttg 
tttttgccca 
ttcttgcttt 
gagtaataat 
attgaagcga 
ggacactaat 
agcttcaacc 
ttacccgttt 
atgacactgg 
gaaagactag 
ctaaataata 
tcggcaatct 
tcccccgctt 
accataacca 
gagaagatag 
ctcacagacg 
gagaccaatg 
attatgccta 
gaaacattag 
gatttagtgg 
acttccaata 
attggatcct 
gaatatagga 
ccttcaatag 
cctcctctag 
gaaaacgggg 
ttttggtcta 
ataaacgatc 
gattggacaa 
ctcatgacct 
ccttctcagg 
gtgtacggtt 
gaaaccggga 
tctgtcaggg 
tttacagaaa 
tcatctgtaa 
agaaaagata 
atataccaca 
cacgtatctt 
taccaaacaa 
atgtcatcaa 
accaagtttg 
agaaact tta 
aacacaactc 
cgtt tttca t 
gcactggaag 
gtctcgatafc 
aattaggtaa 
tgttattggg 
ctttttcatc 
atcgtgtttt 
cttattatgc 
cagaaacttc 
aacaccgtac 
ctaccacca t 
gggataacga 
agaaagaagt 



aaaacctccg 
gaagagaaca 
cttccatgcc 
aagttggcag 
tttgttactc 
cctttagtcc 
tttgatagaa 
gctgcccatt 
agtgtgagga 
ataggcatac 
ctgagtatgc 
ccaaggatgt 
accctcacga 
ctcaatacaa 
gtagaaatat 
tcgaagacaa 
tgaattctgc 
aatctctaag 
tggtggattc 
gagagatgac 
cttcgtccat 
acaatttcaa 
tggagtttgc 
atatgcaaag 
atactcctag 
tgtccattct 
ctggattatc 
acggcgttca 
ttaaaaggcc 
acaataaacc 
atgtccccgt 
gggcaatgat 
ataaaactac 
ttagaaaatt 
gaagtgaata 
ttgaatcgaa 
ttgaaagcat 
tcgtcagact 
tggccatacc 
aggagtacaa 
acaacgacgg 
ctgaacgttg 
tcaacaagga 
actacaaaag 
caaaaacaat 
ttcaggatcc 
aactcaaatc 
tttgttcaca 
gtccatcaga 
aaattgcccc 
atgacaattt 
aatgggagta 
tgttgttaca 
gtcttcctat 
aagtggaacc 
taattccata 
ccct ttgggc 
tcatgaaaca 
aattctacct 
ccagaagtgt 
ccaaggggaa 



ctgctgctgg 
acccctcpttc 
ctcaaataga 
aaagtgtagt 
aacacagaaa 
agcatgtagg 
ttgaaattgt 
cggctattct 
ctgcttcggt 
ctttaagcaa 
tagctggttt 
tgtactcttt 
tttggatata 
tactctacct 
gaaagagtca 
catgttcaac 
tgtattttca 
gagaagtcaa 
tttcagcata 
gtacgtcccc 
ctacttgatc 
taaaacactt 
gtgcaagtgc 
aactataagg 
agtagcgata 
tgcagaagat 
cattactgaa 
agcagagcat 
agaagtggta 
ttctgataaa 
cacacctcta 
tagtagttgg 
aaaggttgat 
ccaagacata 
cgcagtgacc 
tggaaataag 
atacgtccta 
ggaaaaggaa 
tcctattaat 
gtatttttgt 
cggtgtggag 
tttagtccat 
tttgaacata 
catattcgaa 
gactgtaaac 
gccaaaaaca 
tgttcttgtt 
aataactgaa 
gtcacttatt 
agagactgag 
aaacaacctc 
cagaagaata 
cttctaggaa 
tcctttactc 
aagaaggaca 
ttcagacatg 
ttgaacctta 
acagaacctg 
aaaaaatggt 
gggtgcgggc 
tggcttgctg 



tggtggtgct 
agtgagtcct 
agaagtcact 
caaacattct 
aaagtatatc 
atttaacaaa 
tttttccgac 
aaggctattg 
gaaaggaatc 
caaggaaata 
gggaaaatga 
ggatcacgag 
aaaatcacat 
atggatttta 
atattcacgg 
ggaggaggtt 
agcaagaact 
gctagagata 
cttagtgcta 
gatgggcact 
ataaacgacc 
gaaatgagag 
gtcgcatcat 
tctatggatt 
atgacatctg 
ggatcaacac 
aataatagag 
cctctatccc 
gcagcagcag 
gataacgaag 
attacaccca 
aaaaataatc 
tattttgata 
ctagcggata 
atgtctaacg 
ccattgtttg 
aatatcatag 
atgagtttcc 
gaaaaatgca 
gatctaacag 
cattctgcgt 
ctttactctg 
ttagaaattg 
attgtggacg 
aactacataa 
gggaaacaac 
cctcctcctt 
caaaattgcc 
ctcaagtact 
actagcattt 
ttaataaaga 
tcctggtcgg 
cagtaacaga 
ctgaatcgag 
ccttaatcaa 
actgttctga 
ttttctgttc 
acgatttatt 
ggtctgatgt 
aggcatttgt 
ctcacactaa 



254520 
254580 
254640 
254700 
254760 
254820 
254880 
254940 
255000 
255060 
255120 
255180 
255240 
255300 
255360 
255420 
255480 
255540 
255600 
255660 
255720 
255780 
255840 
255900 
255960 
256020 
256080 
256140 
256200 
256260 
256320 
256380 
256440 
256500 
256560 
256620 
256680 
256740 
256800 
256860 
256920 
256980 
257040 
257100 
257160 
257220 
257280 
257340 
257400 
257460 
257520 
257580 
257640 
257700 
257760 
257820 
257880 
257940 
258000 
258060 
258120 
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cggaaagaca tctgagggtg atactaattc 
tacactcaag cctatcatca ctgacgtcac 
aggcacaccc atgaacccta aggatatgac 
aggaagtact cctcagtgtc ttgtatcgaa 
gatagctgtt tggggtgttg cagactctaa 
gcgggaatat gggagcgaag gaagaagaag 
gagagtggaa gaagaagaag aagtagaagt 
acttatcgga cctcgtagaa gacctttgac 
tactaatcct attgttagag aggttgtgga 
agaaatcttt ggcagtaact caaaacttcc 
gagacttggt atcatgagca gagtttcttc 
gtcatcaaaa gccttatttt gtttggcagt 
cagacttagt gtatggaaga actggaagcc 
tgtaggttat tcctctgatg ttttctggca 
gtcaagggac ctgatagaaa caaaagtgac 
aaaacaaact gtagttgcta tgtttatttc 
ttctatattt tcaattgtaa tatattacaa 
tttattcatt ccttattcct actactagtg 
cttcccttgt tccttaaaca atatgtttga 
atccattact tcattacctt ctcctgatat 
agtaaacatg ctctttctga tatccaatgt 
ttcagtcatg gcaatgatta tagcaccata 
ggcaaagttg ggagagaata tctttgtggg 
cagtgctcct ccagttctct ttacatgcaa 
ttgtccacag aacaggacat cttcagacat 
tgtctttctt ggtgccaaaa gactaatcgt 
aatacttttg tccttaatgg ggaattcact 
gaacattaac atacaatata cattagcttc 
atctacatgg atattttctc ccttctcttg 
atgtttcgat agaccacaag tttccgtagg 
gtgctgcata ctcttacata caacatccat 
gtcactaact tttttcaaaa taccttcaat 
aggaagacgc aatttgacat ctctttctcc 
ctcgtaaccc agatttggta catgagtaga 
ctgattcagg cacggattgg gagatggcaa 
tttttcatac actaaaaata tataatggac 
atggggtgca atatcaaaca acaacatttc 
aaaaactggg tgtattttga tatagtcttt 
acacaacttt gaagagggct gttttacaat 
acgaacgata gcagtgtata gctcaatcat 
agccacatct ccattattaa caccaaagaa 
tagcccatgt tgctgaataa ccttagcagt 
agattttccc gatctactta ggatgaattc 
gtcataggca gaatacattg ttactcctcc 
ggaagattct gccacctctc tagagacact 
attaaaatca gcaatgcctc caataggcat 
tttaatgtta cacaagttac cccgttcaac 
aggattacta ttgcgggtta cagcaacata 
aacaatagtg tctctaaaga aagtgtgccc 
agattcaaat ggaagatatt gaacgtatac 
tggattaccg ataatttgtc gtccttcttt 
cacgtacaca aatactgtca attcaccatt 
gtccttagtc acgaaccttt cctgagtgcc 
cacattttgg ccctgataaa gagtaataga 
caaaagaatc gtagttaatg ccctaggaga 
aaacttgtca aaaagtggtg tactgcgagt 
cttgtgagag atggttttct cgtggctaat 
ttttacttcc actcttaatg cgggcacaga 
agtagaagag gaggatgagg aattgttatt 
atccactaaa ttcgagactg cattctttaa 
catcatcttg tcagttttgg aagaaattgc 



agcctacctt ttcataagcc ttcaacgaac 258180 
agaagataat atgatgatgg gaagaatgtc 258240 
atattttgtc aacgattttt cagacgatat 258300 
ttcggacatc ctgaacaaga gggaagaatg 258360 
agacctccta actaaacatc aactggggga 258420 
aaatcccggt gttgaggagg aagaagaaga 258480 
tgcgctacct tacattaaga aaagtggaaa 25854 0 
aacaacaaca accactacta ctactactac 258600 
agattttgat tacgagtctt ttaatgaacc 258660 
cttcattaga tttcttgatc aaaaaaattg 258720 
ttccatcgcc aactttaaaa ttgaacaaga 258780 
ctgggttggg gatgaacata cccctaaatt 258840 
ttttacttct gcacctatta ttgtccagaa 258900 
tgaaactctt agaagcaaaa ttgttgatcg 258960 
aaagaaaatt ggggaagatt gggctaataa 259020 
aggtattgtt tgtataacag taacagttat 259080 
aataaaaatg cctaaattct aatacaatgg 259140 
tatagcacat gaaacaaaga aagagacacg 259200 
tacatttttc acaataaatt tgtcaacttt 259260 
cttttccaaa tccaccacag ttccaacacc 259320 
tgaatacttg gcaaattcat gcccttgcat 259380 
cattttacat tttttgaata gaggatatgt 259440 
aagaccttct tcttcctcaa aagtactttt 259500 
agaaaattgt acccttttca ttggatgaac 259560 
tgaaaagtgg agattgttag taggtgaaat 259620 
atcagatggt gtttctgggt tactttcaag 259680 
aacaccagaa gctgctgcta tttttgacat 259740 
gtctgacata cacagccatc ccttttctgc 259800 
agctgtgcat gtggacgtta caatagaccc 259860 
acgacaaaat ttaccagtcc tacgcaagtt 259920 
gtatttatta accttatttt gattagatgt 259980 
gttatttgaa aactcatccg cgcttgttat 260040 
cccgtccttg caatgggcaa aatctttaga 260100 
attggtggta attgattttg ggcgtacata 260160 
aaaggcgaat gaagaattca aaacgttcaa 260220 
ataaaattgg aacacaattg tagccttgtt 260280 
aatagtggac tcctttggtg ccctattcac 260340 
cttgttcttt tgtttcgtct catattcttc 260400 
accactggct atagagtagt gtgtgattga 260460 
tttcttgtta cataatttct cataatgttt 260520 
aaagtctcta atgtttttaa gttctgtttc 260580 
gttggccatc caagacgcat taaacgataa 260640 
ttcagcggca gctaccacat ctttagaagg 260700 
atctccatta tagtcatcat ccattgcagc 260760 
aactgtatcg tggttaataa agtcgctccc 260820 
agtataagcc ttgtctttac tcatggcgca 260880 
agaatgattt tgaataatga acaagttttg 260940 
cgcatcctgg gtagatgcac cagaaaggtc 261000 
ttggcaagaa tatatcgtca tggcttgact 261060 
attgttccca aatcctccat tcttgatatt 261120 
aatttctagg catttttgtc ctagccgttc 261180 
cttgtacatt aaattagtaa ccactccagt 261240 
atggatcatt ctattagatg tagtgaaaat 261300 
tttagcccta gttaccttct ttttctcatc 261360 
ttcggctgcc ctcaaaattt tcctgtctgt 261420 
tttaatggcg ttaattaatt gttgccggta 261480 
gtcagacaga tccatattca aaactacaaa 261540 
attttcaact gccgcagccg ctgcaagtga 261600 
gttcttttta gtttctttga .tgatagaagt 261660 
aacgtgtttt ttagtgagtg tgattgcacc 261720 
atccttttct tgtttggctt tagagttggc 261780 
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catgaggcgt tcacacatat ccttaggctg cccttgttct tttgttataa actcttcaat 261840 
cttcttcctc ctctcatttc cctttgaaat actagtcacc atttgcatgg ccaacactat 261900 
agatttttca gagtatttaa gtgagttatt gtgcctattg ttaatgttgt tattttcagt 261960 
gtgagcagtt tgaagcactc tggaaatctt taaactaccc ttaacaccct ccatcatttg 262020 
accattattg aaaacatctt cctttctgta cataagaaat tcgttaacca ttgaagggaa 262080 
gatgttatca aaatattgca ctgggtcaaa gttgtccatg aaaacactcc catttctgga 262140 
tgatctaccg gccattcgcg tatctaccaa gtcttgaagg ttggcagtca tatcaaacaa 262200 
tgaaataccg tacattaacg aataagcgtg cttaataatg tctattctct ttaacgacga 262260 
catgatatat gttctcgaaa gggcagaaag ataggaagaa tatgcaggta ataacatact 262320 
cttccctaca tctcctcctg ttgatatggg catagataat accctcaacc aatattcgtc 262380 
atggagaggc ttatctgccc ctgctcctcc tcctcctcct actcctactc ctgtgcttgt 262440 
ggcccattct gagggcgagt ggaacaagcg tccagacagt gcagatttat caagagaagg 262500 
gagttttcca ttatcaaatt taacctgttg cattctcact agtgtataca aataccacac 262560 
acgggctcgt accaaatctg aaggcttatt ttcagtgagc cattttttcc acacaacatg 262620 
aaagtcaact aaatgaccta cacgatctag agtagatgcg tcacggtgca atttatctgg 262680 
tatatcagtt ctatatatca tttctccact ttcttcatca tcgtcatcac ctcccccacc 262740 
actacttttc ataccagaac ctccacttcc tccgtcttcc aaatccattg cttcttcttc 262800 
atcgaaataa caatagtcct cgtcgtcctc ataataagct gctgctgcag ctgtacgtga 262860 
tgatgcacta ttattattac cagtttcttt cttcaagaga atagatctta tattttcaga 262920 
tttcgctagt agttctttgt atgttaaagt ttcattctct aaaaatgaag caactgtacg 262980 
caaattatct ttaaagtaat tagcgatagg agtttcatcc acaaagtctt ttccatctct 263040 
aaagaaggcg ttcctcacac acgctgccac agcagacgca cggttatgtg cagaattgaa 263100 
tggtactgca tctatacact tgggatcaga accgtgcaca tacttgacgt tcaatatctc 263160 
ttcatccatt aaggaggtca gatgtgtatc cttaaatctc tctgataggc tagaaacaat 263220 
atacaat ttt tctcggccac tgtaatcaat ttctctcatg gctttggcca tcatgtcgaa 2 63280 
cacaatactg catacatttt cagtgtactt ttctagattt gcattgctca ttttcttgag 263340 
ttcagatatg tgagctagct ctgaacatat tccttcttct attagagcgg aaagggggag 2634 00 
agatttcatc atatccaaag aagacttctt gtctgataca gaagttgttc cacttcttat 2634 60 
cttttcgttt gcatcagaaa tgctcttgca gcgtaaagag aatccattca aagctcccga 263520 
tgctctgtcc ttttcaagtg catttctttc catcagtttc aaaaatgctt cattgttaaa 263580 
gaaatcgtca tcttcttcgt catcaaattc gtcacaattt tcttctgccg tggattgggc 2 6364 0 
cggggaagaa cttggtgccg tcacttcctt tttattagaa gcattaccac agtcttcaac 263700 
tttaatagca cttgagacta aattgttgtt aactgacttg acgttattac tcttcattaa 2 63760 
aattgcttca tggcgtttat tgacaatatt ccgcaggtta cttgggttgc caaaaaagtc 263820 
tctagagcaa aacattttca gtatctttat aaaaacttca gtgtatagtt ttgcattatc 263880 
ttcttggcca aaattatctt gttcgactga cccttggaag aaggtactct tttcttcctc 263940 
tttcttcgtc tcagcttcat cgtcactttc ttcttcttct tcctcctcct ccccctcttc 264000 
ttcttcgttt tcgtctccct cttcttcttc accacctttc tctttcttct ctttcttctt 264060 
cttcttgttc ttgttgttct tgttgttctt ctgaggtttt tcttcctcac tcgggttctt 2 64 120 
tatt ttct tc tggatatggt tagtgttatc ttccactact gcagaatttt gcctgttatc 264180 
tccaagtagt ataacaaaca ctcctccgaa aggtatatct ggcctgtgtt tcgctatacg 264240 
taaaacagca tcaataaaca ccaattccct gcaactggcc attgtatact catcgataac 264300 
aaggaacctt aaatttgcca acacttgtat cactttccat tttagtttat cggccacaaa 264360 
ttggattttt tctagagtta tttcatcttt cctaaatata actggaatct caaatccttt 264420 
gttgagggta gaaaacacac tttctgggta catcaaggaa gaacctcctg gtttcttgca 264480 
tgcacacatg agactctcct ttgcagaatt agatttgcaa cacaggagca tgtctgcaga 264540 
tccagaatca atcaagccgt ggtcgttaag aactgcaatt agttccttga ccattgtagt 264600 
tttcccagct ccaggaagac cacccagatg gaacacagat attcctcctc cagcataact 264660 
gacagaaatc gagttgcaca gatacctcac tgccagagac tgttgcaagg aaacagtgct 264720 
caataccttt ctggcctcct tgcacgtatc tgggagcttg accgcagaag cagaggagga 264780 
ggaggaagaa gaagacccag aaggggatgc cgctgcagct gcaactgcac ttctgagcac 264840 
ctgggttctg gtagccagaa aggggtatga tttgatgttg ataactggaa ctggatttga 264900 
attattgttg ttgttcttct tttgtttttt ggtaactggc tcttggccgc tggagcacgg 264960 
acgtttactc cccatgatgc actctcggtg aatgtcggac acgggggtaa aaccaatcct 265020 
tttatactag ttctacctcc acgaggttgc cagatggtac gatactttat taccaaaaat 265080 
ggatcgtcct cttttgctta agattttgtg tgtaaccctt atcgttacga ttgttacaat 265140 
atgtcttatg gcatttttta tacatagcaa ttcctcatct tcttcaggcg ccactgccgc 265200 
cgctaccgtt gcagcggcag gagggatatc ttcatctgct ttatttggta cctacatagg 265260 
gtccttggtg cagtcagaat ttttttcaga agatatcgat ttaccagaaa aacaaaagaa 265320 
aaaggctgat aaggaaaagg catcaaaaac cgataacaat aacaacaaca acaagcgtca 265380 
acgccgccag acaaggaagc agagcccgcg cgccagaaac aggactttaa gaaggaagag 265440 
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gaaattatcc 
tctcagagga 
tggaaaaagg 
atcatgttga 
aagaagaata 
tctggaggag 
tctagggtac 
tatgcgacat 
gacgtggagg 
aacggtgatg 
ccatattata 
tctactgaaa 
gatgaaaggt 
gaaaataggt 
tcagtcatgc 
gttaaaatta 
taaaatacat 
tatcctcttg 
gttagttttc 
aaaggctttc 
gccggccagt 
agcaataaca 
tcctaaagac 
gttattatcc 
aaacatataa 
tgttgattct 
gcatatggat 
ggggttttgc 
ttgaatacaa 
aaaaattcaa 
cgttaatgac 
ttaaaacaat 
cgatatttac 
gagctatcca 
tcctccactc 
acctaaagat 
agctattatc 
tacatggtat 
aaaatatgta 
taatgaaaat 
tacagcagat 
gtggaagatc 
ttttttccca 
tacaatatct 
agaaggagat 
aattgccact 
tatatcaatt 
cttgttcggt 
ctcagttacc 
atttattagt 
tccgtagggt 
atctgacgac 
tagttctgca 
atttgaaaca 
cat tatcctt 
aacagaccac 
aacttccata 
tcatcgagcc 
cagtat tcct 
tccactacct 
atcgcacact 



aaaagaaaaa 
ggaagaggat 
acatttggat 
ttatattagc 
actggttctc 
taggaggatt 
aatattccac 
tagtaaggaa 
atagccagt t 
aaaataaaga 
atggagggtt 
agccgtatat 
tcttctgcct 
actttttacg 
ccactattga 
agcctgtaac 
tttaaagtat 
aatatggaac 
ttggccccac 
ttgaatgcac 
gctgtgcccg 
atgacaatta 
gccaccaatg 
ccttccatgg 
attttggctc 
gaattcaggg 
agtacaaatt 
tgggaacaat 
gtagttatgg 
gtcaactttc 
gtagataatt 
tttataatga 
tgatggtcct 
ttttttttca 
cctcctcctc 
cttcttccac 
agaaaaatca 
tggaaaattt 
accatgcagt 
tttgcattac 
gaacctcttt 
tttacaaagc 
ttatctactt 
tgggccttag 
aaattgtagc 
gtaaaatcac 
tcaacgctag 
acaaaaaaag 
caggcactaa 
gctctaattt 
ttaatatgtc 
ctcgtgtcca 
agtctcgcct 
aaaggtaa ta 
atcacacaca 
taccatcatt 
catcattctt 
tgtaccccaa 
ggagagaaat 
ttacttcccc 
cgccaa tggc 



tacacgtcaa 
taaagcgagc 
tgtttttcct 
cccattaata 
tggggccatt 
atgggataat 
atcaacgaaa 
cggaatgtcc 
ctatttacat 
ccaccttgaa 
tttatcatac 
taacacggag 
cgagaaggaa 
ccacatggca 
aattagtgat 
tggtacttct 
tatgtttatt 
gacgggtggg 
catctggtgt 
cgcccactcc 
ccaggagaga 
ttgttattaa 
gcgaagccat 
ttgttgcaca 
aaggcacctt 
ttgtgatttt 
gtcttgaatt 
gtcctatatg 
aaatggagat 
tcgtcatcca 
gttgcattca 
tttttaaagg 
gctactacct 
tccgtaaaag 
ctcctcctcc 
cattattcga 
ccaggatcaa 
tccacccttt 
acaataaaag 
gcgtattctt 
tttcaggctt 
aagatttatc 
caaaattctc 
ccagtgttct 
tggtttcata 
tcatgtgaag 
ctggcgtttt 
ttaatgccga 
tgttgtcttc 
tccctctgtt 
ctcca tacaa 
aaacatttgt 
ccattattat 
acaaaataaa 
tattttttga 
cattacaaca 
ctttacatga 
ccatgggttt 
gaaggggtct 
gagtccaatt 
agccaacaag 



gagaaagaag 
atctagaggc 
ccccttttag 
tttgttgggg 
agtaatgaca 
gtttctgatt 
aatttagtca 
acaaatcaaa 
tttttccacg 
agggatgaaa 
aatatcaaca 
ataacttcca 
tacgtcgaag 
agtaattatg 
ctgactggca 
ataatttatg 
gttttatttc 
gggttttttc 
ttttggtcct 
tggtattgat 
tgctaagaga 
aagtaaacca 
cttggctatc 
taataatagt 
caaggctgtg 
tggtcgcacg 
cctcatcttc 
ttcccttttt 
gcgaaccact 
ctcttctcat 
aagcactgat 
gataactata 
ttaaagtcat 
tatcgtcaac 
cccagtcttc 
aaatacaaaa 
aacaggggca 
aatacataaa 
gtgatcaact 
tgtaactata 
aattgcgata 
cgtttgatta 
aattgttaga 
gattagcccg 
cctcccttgg 
attttcaatc 
tagtacgtcc 
tttcggatca 
agacattggt 
gccctgcatt 
atctacccca 
caccctagag 
tatttttaat 
atacataaga 
cggacaagta 
acagaaaatg 
atcaagaaag 
acaaatagac 
ttcccgtcca 
agattgactg 
gaatcgggaa 



actctaatta 
aaaggactgc 
aaaggatgtt 
tgaatttctt 
ctactattaa 
cagagtcagg 
atatagccat 
ggccatatga 
ttagaaattt 
gttttgtgtt 
acccaaatcc 
tcgtcagcac 
atggtgaaga 
ttgtaaaggc 
catacaataa 
aaaaattaca 
ttctaaaatt 
ttacccttct 
cctttcccct 
cccagcgcag 
ggagccttga 
aaaagtatat 
ttattgttat 
gtgtattata 
aaaatattac 
tttactgacg 
tttgtcctct 
tatatagtaa 
aaaagcttgt 
ttttagggta 
gtgcttttca 
aagagaccct 
gattgtgaac 
cctccgcttc 
tgccttttat 
aaacctactg 
ttcgtattag 
aatttggtta 
ggtttttcat 
tcggaatata 
tccttttcag 
ttgaacaaaa 
gaaaaatccc 
tgatttattt 
tcatcacatt 
aattgaggaa 
catgcaaacg 
gggaaaacat 
aaagttgctg 
ttattattga 
gccatggaag 
gaaatttccc 
gtgtacaata 
aagattcaaa 
cattaacatt 
gggaagtttt 
tattccaatc 
tattaaattt 
ggtcatttgg 
tcgtcttttc 
gatcactcct 



atcggcgcag 
ttagagggat 
tcttttaatt 
cgtatacata 
ctccacaaat 
gacgtttcca 
ggatggagat 
aacctacaaa 
taaaccttta 
gatcgaatca 
cat ttacaat 
cactggtaca 
aggagttaca 
tagtttcaaa 
taagaatagt 
aggaacaaaa 
ttattttagg 
ttggtttttg 
ttccttttcc 
aagcgatcgt 
ttattattac 
tttttcctcc 
tattgttatt 
acattaaaag 
atgcaga taa 
tccaacatag 
gtaccatcct 
ttattcgggt 
atatttatca 
atatctctcc 
tgatattcgt 
tcttctaata 
cctctcaaaa 
cgcttacccc 
tatcgcttcc 
caacaagagc 
acattcttgc 
caaatagtta 
ttctatatcg 
cttcatccca 
attccatttc 
aatcgagccg 
tcttccagac 
ttcctccca t 
ttactctaaa 
ttctcttccc 
cattgaggct 
ccgcagccac 
ctgctttttc 
atacagtctc 
cagt tgcagt 
catctt taaa 
aaaacaatta 
aagtt ttat t 
a tcaacattc 
gttgcacaag 
atcccattgc 
atcccatgaa 
tccaaca t t t 
ttttgtctcg 
cattttcttt 



265500 
265560 
265620 
265680 
265740 
265800 
265860 
265920 
265980 
266040 
266100 
266160 
266220 
266280 
266340 
266400 
266460 
266520 
266580 
266640 
266700 
266760 
266820 
266880 
266940 
267000 
267060 
267120 
267180 
267240 
267300 
267360 
267420 
267480 
267540 
267600 
267660 
267720 
267780 
267840 
267900 
267960 
268020 
268080 
268140 
268200 
268260 
268320 
268380 
268440 
268500 
268560 
268620 
268680 
268740 
268800 
268860 
268920 
268980, 
269040 
269100 
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ggcggtgaag 
agtttatttc 
aactcttcac 
tccaccatag 
cctctcccca 
ggcttgtccg 
catacagcaa 
ttcccataca 
tctgttgggg 
tcaattttct 
tctctggtca 
ttgcatctca 
catcttttgc 
gtctcatatt 
aagtgttcta 
aagcacctca 
tcaagaatag 
taatactttg 
acttcacagg 
ccttctggca 
aagtcgataa 
tcttctcgaa 
ctgttggagg 
catgatacca 
tccagtgata 
attttgccag 
tgttcaggtg 
atgtgctttc 
tcagaaagga 
taaatagccc 
atcatcattt 
cgagacaact 
gtacaaccct 
aaacacgcgt 
tgggcccttt 
tgttgttgtt 
tatgcacaat 
actcttccaa 
gcgtcgccat 
ttcaattctt 
ttcttgaaga 
cacgatcttc 
ttgatcacat 
aggatgcgta 
ggtttttgtc 
gagttggcac 
acaaaaatga 
ggaatcatgt 
ccgcctccgc 
gaacagtctg 
tagtaaacta 
gaacaactat 
gagatactta 
ttagtccatc 
ttaaaaaaag 
cttgaacgga 
taggaaacga 
ttcgtgtgac 
ttttcaaaaa 
taagacagga 
agcggtccac 



aaggagaagt 
cttcggcaat 
cctctcccag 
ccaaagaagg 
ttgcgacgag 
gaccaagagt 
ctggaatatc 
aaatccccac 
gaagattgag 
tggcattcaa 
atttattgtc 
agcacaattt 
atatatattt 
tctccccaat 
gtttattggt 
aaaatactac 
agggcaaaat 
tgaatgccat 
aggcagccca 
gactcagacg 
acatcatttc 
gcacttcctt 
tttttgacgt 
atttcttgat 
ataatcctcc 
cttcttccaa 
tgagatctgt 
cttcttttag 
tggtagagac 
caatttcatc 
tgaagagttt 
tggtttttct 
tacaatcaca 
tcatgtgcag 
gagtcactac 
gttgctgctg 
tcaagaacac 
aatctaaagg 
act tttcaaa 
cttggttgat 
gggaaataga 
tgtcaaagga 
cttcttggca 
gccaacaatc 
cccaagtctt 
ataattttcc 
ggatggggaa 
tgttaaacac 
acacttcggt 
gtgcacgcac 
tagcattttg 
cagccacagc 
gaccagtagg 
catcttctaa 
atgcagattt 
tacggcccaa 
tctcgacaac 
gtcaattgac 
cgtttatata 
tgtttttttg 
ccttcgaact 



gacctgaaga 
tgcagtgttc 
ttcttcatac 
tgctccacat 
tttgttggcg 
agacatcatg 
aatattgccc 
agccttcgct 
agctccaatg 
aagattgagg 
ccagtgatag 
ttcatcatta 
caag tcct tc 
ctttgagtcg 
aaagttcttg 
accggaaaga 
aatatccaaa 
ttttgcaata 
agaggcggct 
tacatcatac 
tgccacatct 
ctggtcatct 
ctccaaaagc 
gaaatcaaag 
tcctgtatat 
ggcatcattg 
attgttaatc 
gaaggcatat 
cctcgagcac 
aggcacagca 
ttcagcaaac 
tccttga tea 
gaggtttgta 
tgccagagta 
agccagtttg 
tggattaaag 
gctacgcttg 
tgccatattt 
tcctgcaggc 
ttccctcaag 
ttgcctcata 
ttcaaaaagt 
caat tcactc 
attctccact 
catgatgaat 
agtgtcacta 
tttttcatcc 
gtacatcagg 
cttgcgcata 
aacaggtcca 
aatgcctgga 
cgatttcaca 
ttttgcagtt 
aagt tcaatc 
cacagtagaa 
cattgtagta 
tgta tgctga 
gctcgtacgt 
atggtgcaat 
gccaatacgt 
tgacatcagg 



ggcgctggtc 
ttgagtttca 
atcacatcac 
tcaatcaaat 
tacctgcacc 
tacttggatt 
tcattcaata 
gcagctgaaa 
atatcactag 
gttgtgggtg 
accattggtt 
tcatattcca 
tctgataggt 
ccagcctttg 
tcaactccta 
ttattctcga 
tcagtctcgg 
gagaggatca 
gcacacacag 
aaagt tccca 
tctcgtcctc 
gatccacaaa 
acatcttcaa 
cactgtccaa 
gaattggaga 
ggcacataga 
acttggagtg 
tctgccttgt 
actcctgcat 
ctgcaagcgt 
atcttggccc 
gatgtagctt 
tggtcaattc 
gaggcggcat 
ctgataccac 
gtttgcttag 
tggttgttat 
aaaaccctag 
tctgatcttc 
atggggaagg 
ataaaggttg 
tgccatctgt 
tggttactgc 
agcgccttcc 
tcttcttctg 
ttggcaatac 
ctggtaatgc 
gaagaatgaa 
tgcttcacaa 
ctttcaatgg 
gcaatcacat 
ct tcccattg 
ggagtatctg 
ttggaggaag 
ctagaactct 
ta tctttatg 
ggtgcctccg 
ataaacaggg 
ttcctattac 
ttctggcgca 
ccgacccagc 



tcttattatt 
agaaatagtc 
cagatgtcct 
ccatcagcag 
agggacattt 
tgcagcaatc 
gcttgtagag 
cagcaatttc 
ccatctcctt 
atgcagtttt 
tgcccatcct 
tctttagaga 
caagtcgacg 
tcaagattct 
tcttattggc 
tctctccatc 
aactgctatt 
tgtttgaagt 
atttgattgc 
atttttgaat 
caaatagatc 
agactgggat 
catttctagt 
actcgaacct 
aactaaatga 
accacataaa 
catcatagct 
ttccgcattc 
tattgcctgt 
acatcattga 
agtgtacgcg 
caacacttgc 
caaaatcgtc 
ttcctccccc 
caccaccgct 
agcaacaaaa 
atcttcctcc 
acttgaaaga 
cttcactttc 
cactaaattg 
cgatattctt 
tgttgtcgta 
cactacatct 
tcatgtcaag 
gtgcagtttc 
gcacatcctc 
catgcatttt 
gagcttcact 
aatctccccc 
agtgtttgtc 
tgagtgtagc 
aaccactaat 
tatcagcagc 
gtgcattttc 
tgctcatggt 
gtagaagttg 
caggccgctt 
tctattttta 
tcaaactagt 
atctgacatt 
ggtccacccc 



attttgttgc 
gataatgtaa 
gatggcttct 
tttcttccct 
tctgttctgc 
acattctaca 
aagacaagtc 
attgtcttta 
tactttgtca 
ggcacgagga 
gtagtttccc 
ttttacacca 
tacagggaac 
cagagccaaa 
ccatgtaatt 
ggcagttcct 
ctgctcttgg 
gttggagggt 
tgcataacct 
cacagtctcc 
gctggccaat 
ggcaaaatat 
aatctttaga 
cttgtattcg 
ctggtggaag 
catggaggat 
agcagatttc 
agttcccaga 
aacaaaggca 
ttgttccttg 
agcatacttg 
catcaatttt 
caacgactgg 
tgtaattgac 
agaagactgt 
ggcactatca 
tccctgtaca 
ttcacgttga 
aataaaaaca 
atcattcagg 
gcgcacattg 
tccagaaaaa 
agagaaattg 
aatagtcaag 
cgaatcgtac 
atcggtgaac 
cttgaatgca 
gccaccgccg 
ttcccactct 
gaaatcattg 
catgatgcca 
atccatcaac 
agcagctttc 
catcaaagag 
ggaagaagcg 
atgttgacgg 
ttatacccct 
gagatgacat 
catatccggc 
ggcccgaccc 
ctaaactgga 



269160 
269220 
269280 
269340 
269400 
269460 
269520 
269580 
269640 
269700 
269760 
269820 
269880 
269940 
270000 
270060 
270120 
270180 
270240 
270300 
270360 
270420 
270480 
270540 
270600 
270660 
270720 
270780 
270840 
270900 
270960 
271020 
271080 
271140 
271200 
271260 
271320 
271380 
271440 
271500 
271560 
271620 
271680 
271740 
271800 
271860 
271920 
271980 
272040 
272100 
272160 
272220 
272280 
272340 
272400 
272460 
272520 
272580 
272640 
272700 
272760 
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gtgagctgaa 
tactatacac 
gggtcggaca 
gttacaattt 
tctgacattg 
gtccaccccc 
gagggtgaaa 
ccagaaatgg 
catttctgga 
gtatcttcgt 
aggccgaccc 
tttgaatcga 
gttagatcga 
acgattctaa 
gtacaataac 
gcggtccacc 
tgagctgaaa 
gaaaggcacg 
aatctgggtc 
aaagagttac 
tgcaatctga 
cagcggtcca 
agaaagaggg 
cagaaatggc 
ctggaacagc 
atggtatagt 
acttgacatc 
ttcaaaagtt 

atcaccccat 
ggaagagttc 
attttcttcg 
atctccagtt 
agacctttgt 
actacaacgg 
ccttccacgt 
atctagatat 
atattcaaga 
gtttctctaa 
attgtgattt 
aacaatatat 
cccccctctc 
gttcctcttc 
tatatgcttt 
gattctttcc 
caccagaaaa 
attctggttc 
atggataata 
ttggcccgta 
aataatttga 
cattgaatct 
ttttgtctcc 
catccaattt 
ggtatacctc 
cggccgaatg 
atgtggggta 
attggtcatg 
gatgtgctat 
acagtttctt 
caattatgta 
atagagggga 



aaatttttca 
tcagagggtg 
gattccagaa 
cttataactg 
ggtcgaccca 
taaactggag 
acacctgtta 
ctgtccagaa 
aagggttaca 
gcaatctgac 
agcggtccac 
ggatgagggt 
gccggtccag 
ccatttctgg 
ttaccataca 
cttcgaactt 
aaatttttca 
atctggatgg 
ggccagattc 
aatttcttag 
cattgggtcg 
ccctctaaac 
tgaaaacctg 
tgttccagaa 
catttctgga 
acacctctct 
aggccgaccc 
tttgagacgg 
ctcgcacacc 
ctctaaataa 
atggatgtcc 
tatgaacact 
aggtttcaga 
cgactcgtgg 
ccaagtcatg 
gtttacctat 
ggatactgca 
aataaaagtg 
cactaagaga 
gttgtgccct 
aaaattcagt 
tagtgagaga 
ttcctcctct 
aattatcctc 
ctcaccacca 
aattctaata 
tggtgctaac 
ttgcacattg 
ggcagacatg 
acacatgaat 
gtttgtgagg 
tattcttcct 
gttggtttcc 
gtttggcata 
tgtaacaact 
catattactt 
acaactaccc 
atcgtgtaaa 
ttcgactttg 
cttgcagttg 
gaaagtgaaa 



aaagtttttg 
gcagatcggc 
acgtttctaa 
gtgcattatt 
gcggtccacc 
tgagctcaaa 
gaaagtggtt 
atcagattcc 
atttcttaga 
attgggccga 
cctctaaact 
gaaaacctgg 
aaatggctgt 
aacagtcatt 
tttctggtat 
gacatcaggc 
aaagtttttg 
tgtcgtccgt 
cagaaacgtt 
atttggtaca 
acccagcggt 
tggagtgagc 
gtaggagaag 
atctaggtcg 
aagtgttata 
gcaatctgac 
agcggtccac 
aggaagaggg 
ttagacacac 
tccatcatgt 
aagttgatag 
tctgactact 
gaaaccagag 
aagaaagagc 
gagatctaca 
acagatgaag 
accaagtgga 
acccctagag 
aacttgtata 
tttgcaaggc 
tgtattttta 
tcccaatcac 
tcttcctctt 
aatacattta 
acatcgaccc 
gacggatttt 
tgatcatcgc 
aattgatcat 
atcaaacggg 
ctgattcctc 
aaaaagtctc 
atttcgtttc 
tctagttttt 
tggatgacct 
gcattcaaaa 
gaaaaaatct 
actataggat 
ggtaccattg 
ttgttggtta 
tgaattgctg 
gacttgtcat 



agatgaagaa 
cctgttccag 
ccatttctgg 
tctagcatat 
cttcgaactt 
aaatttttga 
gctggtcggt 
agaaacgtct 
tgtggtacaa 
cccagcggtc 
ggagtgagct 
taacagaagg 
ccagaaatct 
tctggaaaga 
cttcgtgcaa 
cgacccagcg 
aatcgagtaa 
tcactgctgt 
tctaaccatt 
ataacttacc 
ccaccct teg 
tcaaaaaa tt 
gtacgcacgc 
accagaatcc 
ttt tctcgca 
actggg tcga 
cctctaaact 
tgaaaaccct 
aacctcatca 
atatcttcgt 
atacaggatc 
acaactggcc 
tggtggatgg 
aaggaaaaga 
tcgcagaa tg 
ctgtcaagag 
tggctgaaat 
tag tc t tcaa 
act t tggaac 
ataggat tgt 
ttgctcaaat 
tttctactgt 
catct tcatc 
aaa tggtgcg 
tacgtcca tc 
ttatagaacg 
caataaaaga 
a taaagtat t 
gcctgaaacc 
tttctgtacc 
gtgcagataa 
catat tcttc 
cttctaggta 
tat tacgt tc 
ataatgccat 
tccctaacat 
t ttcta tgcg 
attcgccccc 
tcccacaacc 
cattgaatag 
ggggtaaagc 



gagggtgaaa 

aaatggctgt 
aacagtcatt 
attctagccc 
gacatcaggc 
aaagtttttg 
agcatccgtt 
ataaccattt 
taacttacca 
cacccttcga 
caaaaaattt 
catgtctacg 
gggtcggcca 
gttacaattt 
tctgacattg 
gtccaccccc 
gagggtgaaa 
tccagaaatg 
tctggaacag 
atacatttct 
aacttgacat 
tttcaaaagt 
atactggcgg 
agaaacgttt 
tcttgtacat 
cccagaggtc 
cgagcgaccc 
tgcaacagag 
ccctccgtcc 
cgaaggttcc 
atgtggaatg 
tgccgaaatc 
aatgtttgaa 
gagtttgaag 
gttgagacaa 
tggattcttg 
tattagagag 
tggcaatgtt 
aaactataac 
acatttctta 
aagttactac 
cactgttgct 
ttctttagtt 
ctcttctcgg 
ctggtatact 
tacaacaatg 
taaaacaact 
gaagatgata 
accctcatct 
agaatatagg 
tgcaaacacg 
ccttgcttct 
tttcctcctt 
atgcttaccg 
gcattgatca 
actcactgga 
gaggcctttg 
taaaatacta 
catcaataaa 
tctatcagtg 
ttctcttttc 



agtgggtacg 
ccagaaatct 
tctggaaagg 
ctttgtgcaa 
cgacccagcg 
aaacgaggat 
cactgctgtt 
ctggaacagt 
tacatttctg 
acttgacatc 
ttcaaaagtt 
acggcttctg 
gattccagaa 
cttagatttg 
ggtcgaccca 
taaactggag 
acaccctctt 
gctgtccaga 
tcatttctgg 
ggcaccttcg 
caggccgacc 
ttttgaatcg 
cacatgcctc 
atagccgttt 
taacttgatt 
caccctcgga 
taaaaaattt 
ggggtataaa 
aatcaacatc 
cccctcacag 
tctttcctca 
gggacagaac 
cctgtcctaa 
gaatatctgg 
aggccactag 
aacgaggagg 
aagaggggca 
tgtagtgcat 
aatgttgtac 
taaaaaataa 
acccaattct 
gctgtttgtt 
cgttcttcat 
ttaaacttct 
gtttcaccat 
tccactaatg 
ccagaaacac 
gggtaagatg 
ctcattgtac 
ggtgtgtctt 
agacacgtac 
tcatcgatag 
gcggcgtcaa 
tccatttttt 
aaatttttaa 
acaactaagc 
taggataatc 
cccatactta 
tcagcagtta 
agaattctat 
actatattgt 



272820 
272880 
272940 
273000 
273060 
273120 
273180 
273240 
273300 
273360 
273420 
273480 
273540 
273600 
273660 
273720 
273780 
273840 
273900 
273960 
274020 
274080 
274140 
274200 
274260 
274320 
274380 
274440 
274500 
274560 
274620 
274680 
274740 
274800 
274860 
274920 
274980 
275040 
275100 
275160 
275220 
275280 
275340 
275400 
275460 
275520 
275580 
275640 
275700 
275760 
275820 
275880 
275940 
276000 
276060 
276120 
276180 
276240 
276300 
276360 
276420 
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tgatattcct 
tccttgtata 
agcaggggtt 
gagtattttt 
ttttgataga 
caaacccgga 
acttataagt 
tttttatcct 
gtcaatggaa 
ggattcacac 
aacttccaac 
ggcaagtaaa 
atctaggaag 
tgctggtgac 
atgtgagccc 
attcttcttc 
atactaaaaa 
ctgtgaataa 
caatgggtat 
atggcaaaaa 
taaaaaaaag 
gagtaaaagg 
ggggacaata 
attgaaaata 
aaatatgata 
gttaaatacc 
gctgaaattg 
ataaagctca 
tcagttggtg 
gtcagtagcc 
gatgcaaaaa 
acgaatcctt 
gtcggctgat 
tatagaa tta 
aacaactaaa 
atattggttt 
tcttggaacg 
ccgcatcact 
acaccagtaa 
aagacccaga 
ctgtttacac 
gtaatgtcac 
tggtcaaagg 
atctagaaga 
tg ttgaggaa 
aagatgtt tg 
ataatcccat 
taaaaacttg 
tatctggatg 
ggataat tgg 
aagaaggaac 
agaagaatgg 
at tattacaa 
t tgtcccaaa 
gaat ttgtaa 
atatactt ta 
gtccaaactg 
ccagagaagt 
gaggagatga 
atccgggaag 
tgagaaaaca 



caaatcttct 
gaaattcctc 
gcctgctgag 
atgcaaagag 
ctcgataatg 
cgccatcttt 
gtgtgtgtta 
caacttctga 
atgccgattg 
tttgtctgtc 
aagaatgggc 
cgactacggg 
aggaggaaga 
tgtcatgact 
acagagctac 
cacaaccacc 
agttgacgac 
tggaggtgtt 
caatacagat 
ataaaatatg 
agtaactccg 
tactgtgcca 
aagattatga 
aaactgaatt 
aatgtttcgt 
tcaaagagca 
acaagagtaa 
ttgatgaaga 
ttataaatat 
acggaaaaca 
aaatatgata 
caacaaaaag 
attgatagga 
tccattgata 
taactcttta 
gcaaataaat 
gaagttatga 
tctaactgag 
ggaaaatgat 
accgtctgtc 
ttctgtaaag 
ctcaaccaaa 
ttcactgttt 
cgaaatcgat 
gggtttagac 
gaattccccc 
aaagagacat 
cttcgaaaca 
gttagctgaa 
atgtacctac 
gaaaaatttc 
tacaatttcc 
gtttctacaa 
acacaaacta 
agatttggga 
caagcaagga 
gaactttcac 
ggaagaagta 
ggataggaga 
aactaatgtc 
cagaatgact 



ggtagtcttt 
atttgtgaat 
aggcgtattt 
acgatatata 
gacggtattc 
catgactcat 
acgcacgtaa 
gaattgctca 
acgtcatgtt 
ctgagcagta 
tctgctaact 
tctaccaaca 
attgcatgtg 
ttgttgattg 
tcgccgccgc 
actacttcca 
gactatgatg 
ttcatcaatg 
aatggaggat 
gtatattaaa 
tctacatagt 
ttacatgtgg 
aacaagactg 
atgtgaaaaa 
agaggaggag 
gggaatccct 
caaagagtta 
gttggaaagt 
atccctgcaa 
cacttgaaaa 
aattatttga 
agaagggaaa 
gtatgaaaga 
cattattgga 
tcgggcaagt 
aatccacaca 
ctcacataga 
acatcgttgg 
acaatggaaa 
agtaaaaggt 
tgttaccttt 
gaaaggtttg 
ttgaggttag 
aaatggccag 
aagaagaaat 
atcgtccaag 
ctcatggatt 
aacctacgca 
aacgggttcg 
aagagtgatt 
gtgactcttc 
tattcacctg 
acagagtatc 
aagatacact 
gtgtggaagg 
tggtacctat 
tctatccaga 
gagactgttt 
aattacccgc 
gtttcctgtt 
gcgaaacaat 



tgagccaatc 
gatgttgaag 
tattcacaat 
caccttcagc 
ttctatagtt 
tttggagggt 
ataccagcca 
acgtactcat 
aggggtataa 
gagaaggacc 
tgaacgaccg 
agaaacaacc 
tcatcgcatc 
tggtactggg 
cgcagccgcc 
ctactactac 
acgacgtcaa 
gaagaaaact 
ttgtttggaa 
tatgtgttgt 
tttttttaaa 
agatcttgta 
attcaaagaa 
atcaatgagc 
acggaaaaat 
ctagacccag 
gatgcccttc 
atgaaataaa 
ggagatacgg 
atacaccaga 
agatgataaa 
ccctctagac 
gattgatgat 
tgaatgtgaa 
gactgtattt 
aaacgtgttg 
gtgtataaaa 
aagaggaaca 
aaaagactga 
ccagaaataa 
cttccataat 
aggagaggtg 
tagtggacga 
atatgacgaa 
tgaaagaagg 
aaacattcct 
tcaataccat 
cccatttccg 
ataaaaagta 
gggtggacag 
ataggaaaca 
aagagaaata 
cacaaaacaa 
attgtacgtt 
atatggaaga 
tattcgacgt 
cggacggcga 
ccaagaaaag 
ccactaatgc 
cggtatttga 
actatcaaga 



agattcagat 
aaataatgac 
ttcatccacc 
gagttcgctt 
tttaacgatt 
gagatcatgc 
ttgacgtcat 
ctcttttcct 
aacgactggc 
cagtgaagaa 
aactctactc 
agtgtgctgt 
tgttctgggc 
aactgcacca 
ggtgcagttt 
tactacacca 
tattggaggg 
ctcaaaagaa 
gaa tggtgtt 
tttattgttt 
ttggaagata 
t tagcaacat 
ttaatgattt 
agatgaaaaa 
tccgcaagat 
aagaaagacg 
ttgaggaaaa 
ataaaagagt 
tctcaaaaaa 
atagatgagc 
aggttccgag 
ccagaagaaa 
tgtctcgagg 
aacttgcatt 
gtacgtatgc 
tttttctata 
ggaattggat 
gagaagagag 
gacggctgca 
agaacccaaa 
caagagtgaa 
taaatccgta 
gtgtctgaga 
ggataacttt 
atctacacat 
atcccagcaa 
cacctacgcc 
gacacgacaa 
tacgaaactc 
tggtgatttg 
tttatgtgtt 
tccgataccc 
gaaaatacag 
tgaccaaacg 
acgacacaaa 
taaaaagatt 
aggcgtctct 
taagaaaaat 
caagtacgta 
tacccgtcaa 
atcttggatg 



tcagcacctt 
gttttttcca 
tcatgttttg 
ctccgtatac 
cccttgatag 
aatgaaaaaa 
atccgcctag 
ggtacatgac 
tcgagccctc 
aatactcctc 
gtggaggaag 
cacatttgct 
tgcgtcctcc 
gttaattgtg 
catccttacc 
actccaccag 
caatcagtta 
gaagaaaaag 
ttttctcaat 
ctttaaaaaa 
taaatagaat 
tagagaaatg 
ggaatcagaa 
tacacaacta 
ggaggaaaga 
tacaatgttg 
tgaacgtata 
aggagacttt 
tccatgactt 
agacaggaga 
aaatagagga 
gacttgtatt 
aaataaacca 
atggtcttca 
aattaggtcc 
tcctataaat 
gttcctatgt 
gttaacaaca 
acaacagaaa 
acaacttcta 
agtagtagaa 
agcaagatga 
cgttacaacc 
tacgtccaat 
cctgttgtag 
ggagaaggaa 
gccaaacaac 
cagagggcca 
gtacaacact 
gaaagggtaa 
attagtgata 
tcaatactaa 
aaaatgatag 
acca ttcaag 
caatcagaag 
aagaaattgc 
gtactatttt 
aaaaaaccta 
gtgggtgtag 
aaaagggtag 
acagatagaa 



276480 
276540 
276600 
276660 
276720 
276780 
276840 
276900 
276960 
277020 
277080 
277140 
277200 
277260 
277320 
277380 
277440 
277500 
277560 
277620 
277680 
277740 
277800 
277860 
277920 
277980 
278040 
278100 
278160 
278220 
278280 
278340 
278400 
278460 
278520 
278580 
278640 
278700 
278760 
278820 
278880 
278940 
279000 
279060 
279120 
279180 
279240 
279300 
279360 
279420 
279480 
279540 
279600 
279660 
279720 
279780 
279840 
279900 
279960 
280020 
280080 
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gaaaggcaaa 
taactaggta 
ccactaaaaa 
acgaaaaggg 
agtgcatatc 
acattgcttt 
ctgccaggat 
tcgtggacga 
tggagaagga 
caacaacaac 
tcagagaaaa 
caaaccgaaa 
cctcttcctt 
gggacttcaa 
taatggaaaa 
aggatgtaaa 
atgctatttt 
tcaacaacat 
ccaaaaaggt 
tgcct tcacc 
ggctgttata 
ttcacgggca 
tatctgcaaa 
tctcggacgg 
ataaattagg 
ttttatgtct 
tgatgaggcg 
atcgactctg 
tccttcagtg 
tgctgctgct 
tgatgttgta 
gatcctgcag 
ccaactagta 
tttttggata 
caggaggaga 
gaatagtgaa 
gtctcataaa 
gatcgaagag 
caaaaacgaa 
agaagaaaga 
catcgaggaa 
aaaggagg t t 
aaaaataca t 
ttttatttgt 
aaggggttat 
ggagcaactg 
gtgtattacc 
ctccaaaact 
ccatgtcatg 
gcaccaaaag 
cctcatcctc 
ggagggagag 
gcttgttact 
cctttacttc 
ctgcagatga 
ctcgtgggta 
tagtttgcga 
cgcagataat 
aaacgaattc 
aaaatttgtt 
ttgtaaagta 



cgaaacgtac 
cgataatggc 
attcgaagct 
aaagaaaaag 
taactttata 
tggggatgcg 
tttcgccaag 
gaaatatacg 
ttgtttttca 
aacaacagaa 
tagagatacc 
agtatcgaca 
t tgtactcct 
tgcaagcacc 
gaaagacaag 
aaactcgtgt 
cgaccttacg 
aaaaatggta 
gacccttcaa 
tt ttgcacaa 
ggtccaatct 
aggaaagcaa 
tcactgaacg 
aggcggttag 
cgagaggcta 
cggaccatgg 
aggatctgca 
gggagcttac 
aggtcctccg 
gcagcttttg 
tatgaacttc 
gccttatata 
cc tttgtcac 
ca tt t tctgg 
gcctgtgtca 
tctcgttcta 
ac tctttact 
t tacatcaca 
gagcgaaaga 
eg tgcaatat 
agtgaacgtc 
gaagaggaac 
tatatgattt 
tttt taccaa 
atcttccttc 
gt taactgta 
ccacctagaa 
aaaat tgaca 
ttcccttagg 
attttccatt 
ctcttcctca 
gtga tgtgtt 
tgacgagtcG 
t tctggctcc 
agatgttgtt 
gaacaagtag 
aatacgggca 
gttagatttg 
gat ttaaaaa 
aatgctacaa 
catggttata 



aagaagaaca 
gaagaaatta 
tacttgaagg 
tacaggaaaa 
gatgaattaa 
aaatttgcct 
aagataaagg 
tcaaaagtat 
ccgaataaga 
gaagacgaag 
agaagatgct 
ggagatatct 
acttccatta 
aatattgttc 
atgccgttga 
cactagtgaa 
gtcattgtgg 
caatattcta 
gttcacttga 
cagactctat 
tcatatcgtt 
tatcagtcag 
tgcggtctgt 
tcatgttttc 
tttctttcct 
cttcggcaag 
catcaccatc 
tcggcgggtg 
cctttctctt 
cttccataga 
ttagtgttca 
caatttcaag 
ctacaccaga 
gtccagttgt 
ttctgtgtgt 
gtggtattga 
tgtcaaatag 
aaacaaataa 
agcatcagca 
tggctgaaat 
taggtttact 
atcttttgaa 
tttggattaa 
caaggatcat 
actttctgta 
agagatccaa 
acggcttcac 
g tattttgta 
gggtttttca 
ttctctgcag 
tattcctctt 
tctttgtgtt 
tctaccagtt 
ctcttcctct 
gatgttctag 
tagtagtatt 
atgttttatt 
atgaaaatcc 
tttgcgaggt 
atagattaat 
taaacccacg 



ataaagagta 
taaatgatgg 
tagtgaacga 
atgccatgaa 
tccctaaaag 
gcacgggaag 
aaagagtcgg 
gccatcgctg 
aaagaaaacc 
aaaatggaaa 
cgtccgaaaa 
ctatggaaac 
catgtgtctt 
ataaatttct 
agtatcactt 
atagtggcgt 
tatatcaact 
ggtgtgtttt 
tacactaact 
gaatttatta 
gtagaatctc 
tatagggaat 
aatggtcatt 
cagttgaggc 
cgcttccgtt 
tctttctctc 
aacattttca 
tgtttctact 
actggtcgtg 
cctggtggaa 
aagagactga 
atggggaagt 
aatccacaca 
gtgggggggg 
atgtcttgta 
aattgtacat 
aactgaaaga 
gcaatttgaa 
acagcaacaa 
cgataaacac 
tgtagatgca 
acaaaaagaa 
taaaaacaag 
caatcaaact 
ctacagcctg 
tctccatgtc 
ggttgatcag 
cctcaaattc 
gtttgatgtt 
ngtcgtcgtc 
cttcaccatc 
cttcctcctt 
cttcttgttc 
tggtggggag 
aagagaccat 
agactcttca 
atttatagat 
tggatctata 
taatatagca 
atacataaaa 
gaaa ttatta 



caaagaggcg 
taacggtgat 
gcactacagg 
agtatactct 
ggataaaatt 
aggtgagcaa 
aggtgataag 
caatcaacct 
gccgacaata 
atggaagaag 
gacgcaattc 
gccagtacct 
gggaggaaaa 
agggttttgg 
tattcgagtt 
agccctgaag 
cttgcccgat 
actattcttt 
tcaacaaatg 
cacatcaaat 
gaaccctcga 
tcaaaagtga 
tcgttgtgcc 
aaaagatata 
gcggcttttt 
tccattctgg 
tcaatataga 
gattcttctt 
ggagaaactt 
accattttga 
acaagtgtgt 
ggggggagaa 
tgcagaaggt 
ggtggtataa 
ctgaacatta 
aaaaatggtg 
catgcccaga 
caggctcaaa 
ataataattc 
atgaaagaga 
gaaatcaata 
gactaaacaa 
tggaataatt 
atctattgta 
tacaaaatta 
gccaaactct 
gaacccaata 
atgctcatag 
tttggtagta 
cactctgtct 
gtcttcttct 
ctcttcctta 
catttttact 
aagagtagag 
tttgggggtt 
ttgtttgact 
aatgtcacat 
aaaaagaaat 
aataatagta 
cctccatctt 
ttaga tgcaa 



ttagaggaaa 
acttctacac 
ttactgtgga 
agaaaacaaa 
gaagattacc 
tacgcatcac 
aggtttactt 
ttaaatatgc 
gtaacaacca 
gctacacctc 
ggttacagtt 
tcttccactt 
ttcgtcgaca 
gataaaaagt 
gcctgagaat 
gatgagaagt 
aaagggttat 
gcttgaaatc 
gggatatttc 
tcaatttgta 
cacgtgattt 
ctttgttgaa 
tgagaagtga 
ccatttcttt 
cttcctcga t 
cgacttcgat 
ct tcatcttc 
cttcttctgt 
cttttgcttt 
ggttcttgga 
gctcaggtgg 
aataaggttg 
ttga tgctag 
atatagccca 
ctgaaatggg 
caccaaaaag 
tacaaaagca 
aggtcctcga 
ctctagaccc 
ttgatggttt 
acttggaaga 
taaaacatct 
tt taggtata 
acttcaatag 
tcacact tgt 
tttggtgggg 
tcaga tagta 
tgtgaaactg 
ttgtccagtt 
tcatattctt 
tctttacatg 
a t tc tggggc 
tccactt tea 
gtggctgcta 
atttttggaa 
tctttgttac 
atggcaacta 
caaaacactt 
gattaccaca 
tacacga tgt 
aagggaceca 



280140 
280200 
280260 
280320 
280380 
280440 
280500 
280560 
280620 
280680 
280740 
280800 
280860 
280920 
280980 
281040 
281100 
281160 
281220 
281280 
281340 
281400 
281460 
281520 
281580 
281640 
281700 
281760 
281820 
281880 
281940 
282000 
282060 
282120 
282180 
282240 
282300 
282360 
282420 
282480 
282540 
282600 
282660 
282720 
282780 
282840 
282900 
282960 
283020 
283080 
283140 
283200 
283260 
283320 
283380 
283440 
283500 
283560 
283620 
283680 
283740 
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ggaaatcgtt 
cgatggttta 
gttagaaatg 
gaatacttta 
taagtacagg 
tttttgtccc 
gccttgttca 
cctttactgc 
ccatctccac 
cgttctcgcc 
tctttgtatt 
tttccattat 
agttcccttt 
agtctcgtct 
gttatggaca 
gaagagacat 
ctgcatagca 
tagcgggatg 
tcagagacgg 
ttctctgcaa 
gcgtatatga 
gggtgtgtac 
ttggagtaac 
catatatcat 
atgttacgaa 
gtggttatta 
agcctgtaga 
attttgttga 
tcacacccat 
ccgacttgaa 
tgaagtactt 
gtaaccctat 
ccagaaatag 
ttaaatgtgt 
tgacacagag 
gaaatcggca 
gtacacaaaa 
tggacgtcat 
agaccaataa 
ataatcttct 
ttaccaacac 
ttgtacacat 
acgtcaataa 
acatcttctt 
acacatgctt 
tctggaaact 
aacattccat 
atggtccatg 
gactgtttgc 
ggagatcctt 
gagctctctc 
gcattaataa 
aacatgtcct 
ctcaaggctc 
ggaagaaacc 
gaaagaaatg 
ccaagaatag 
tgttattttg 
ttacagggcg 
agagctcccc 
gttcttcccc 



tatatgttcc 
atgaatactt 
tacttagata 
aggaatctca 
gctaattata 
aaacccattt 
aaaggactac 
tggtgctgcc 
gttcttcatt 
gcaaacgctt 
ccttagatgt 
ccatgaaaat 
ctcctggagg 
tgaagtggtt 
tgatcaaacg 
acc tgattcc 
aggagaaatc 
tcgcaatcga 
cacaacgttt 
actttatttc 
cggttgcatt 
tgttgtagaa 
tgcctgtgat 
acactgatat 
gtgttctagg 
tcatatagtc 
gaggagaaaa 
tattttgcat 
catcttgcaa 
gagtattgtc 
ccttatcttt 
caacagattt 
tgaatccatt 
catactcggg 
tgggccgata 
aaaaattacg 
tgtggcactt 
att taactgg 
aacaacaaaa 
cagcaa taaa 
ggtacacgta 
ttctggggat 
accagttatg 
ccctgtaaac 
tgaattcttc 
gttctgtgtc 
ccacgagcct 
ggaagatggg 
cctttccggc 
cgacgaatac 
tcttttatag 
tatataattt 
cctcctcgtc 
tggctcaccc 
agaagtgccc 
gaggaaatct 
tgggaagtct 
tcaaggatct 
ttctggacaa 
tgatcccaat 
ccagatctaa 



gctcattttt 
tcggtttact 
aagacccagg 
gagaaattga 
gaggtttcag 
aaaataaatg 
atattcattc 
gtttccatct 
atacatggaa 
ccgttcgtct 
cggatcaatc 
tttgatgtct 
gactgcaaag 
tcgtacactg 
gggcctacgt 
ttcttctgtt 
tcggtcagaa 
tgattttttg 
cctcaattcc 
tcttctgaaa 
gaaaagagat 
aatcaaacct 
tgggtttgca 
gcgggaagaa 
gcaatggctc 
acattttttc 
tttaaagttt 
atcaatgggg 
taaactccct 
gcctgtcaat 
tagacggact 
gaa tgataat 
ttttaccttt 
tgtcacagat 
cgcaaattgc 
tctttagaat 
ttcaagccac 
aa tatgtgta 
aacattttgt 
cttacatttc 
attct tactc 
aact tcaatc 
tgcatcattc 
catcaca tgg 
tccatacatc 
gtgcctccac 
gggacgttcc 
taaatag tec 
agt tctcatg 
aa tcatgttg 
aggcaaaaat 
ttatattttg 
gtcgtcgttc 
agatctggtg 
catccagttt 
gttccctttg 
tcacgggaga 
gtataatgga 
gaaaa tatct 
aactgatg tg 
agcgtaccga 



tggactactt 
agaatttgtg 
tgaaatattg 
aaaagagcca 
ttaataaagg 
catggttagc 
ccttcatttt 
ccatcttctg 
attgcccttg 
tcgggcaagt 
acattgattt 
gttcctgtta 
gttatactat 
tccctaatac 
tctcccacat 
gcagaataaa 
aaagcaaaca 
tgcacatccc 
ttcagttctt 
ttatccatct 
gacatataat 
agtttatcca 
atgtaggccc 
aagccacatt 
ggcgtgtagc 
acagctgcat 
ttgtcactaa 
agtgattcgt 
aacctgggga 
tgtctcagtc 
atatatagtc 
agaatatgtt 
aaagtcttga 
aataaaccta 
cacgttgctt 
gtctcggaat 
cacctctaca 
ataatatagt 
aatttattta 
atatcacagt 
gtcgagaaac 
tttacaaaca 
aattcttgaa 
aaaactggct 
tccaaatact 
a tgcctgtga 
tggtaatcgg 
ctataattgg 
ttatccaccc 
aagggtaaga 
gtagttgcca 
tatcattcaa 
tcgttccgca 
gacaagatta 
ctggccgaca 
cacccgttca 
acattggaca 
gtattttcct 
ggatctggat 
gatttgctgt 
gtcatcactg 



gtatatcacg 
acacaaaatt 
tatcatgcag 
attagggatt 
actgttttgt 
aaacttttat 
cttctctttt 
aagtttctcc 
acgtattgaa 
tttcttccct 
cggcaccatt 
cctctctgac 
tgccactggt 
atataggggc 
ctctcttttt 
aggaaccatt 
ctaagcaggc 
tcaaagaaat 
cttccttcat 
gtaaattatc 
ttccaatact 
ttggaacaat 
ctttgtatgt 
gttccaccaa 
cacaccctac 
tagaaagtct 
gaagggcctt 
tccacttttc 
gattgtgagt 
tatgcactcc 
cgtcccttaa 
cgtatttgtc 
attcttccat 
cagaccttgc 
gaattactta 
gtcatatttc 
tcccagtgag 
ccaattctta 
ttcaactaaa 
gtacaatact 
atgaactaca 
acgctctctc 
taattggttc 
tgtatttgac 
ccctcaatcc 
aaaagttcaa 
atgcacgcaa 
tgtacatgaa 
atgaagtttt 
cttgtctaac 
gccaccacct 
ccgtagtcca 
tctccaccta 
cccagaaatg 
tctcgcacct 
agaaccaacc 
atgacattga 
atgtgaacgg 
ctggagaatc 
acattttcgg 
aagctgt tct 



gatgggatgt 
tcattaacca 
aatatcttat 
taggttggat 
ttttttttat 
gtctttacaa 
ccttttctct 
tccacttcct 
aatcatgcgt 
tttgagtaac 
ttgatatact 
catttccatt 
gcgggagaaa 
atccgatcca 
atattcaggt 
tccgttggtt 
tccaccatca 
cctttgttct 
atttacttca 
cctttcattt 
tttagctgat 
taggcacatg 
taacctatca 
taattgttcc 
taaaaagtca 
ttcgtgtaaa 
ttctttaacg 
cccataatcc 
atactttaat 
gttttttatg 
ttcacttttt 
agggttttca 
tttgatatac 
cattgaactg 
atctgtagat 
tttgtgcgta 
gcggtggagg 
aaaaatcaat 
atctcatcaa 
attcgtgcat 
aatattccct 
ctccttgaga 
cattttcttg 
ccactcggcg 
aatttttgat 
tgatgatcca 
atgctcttga 
gttgaggaaa 
cccagttagg 
tctctcgagc 
taaaattcgt 
tctaagagtc 
ccagaccttt 
tgacgagaca 
gatccaagga 
acatctggaa 
agaatcatac 
cgtcaaggag 
ctcctcttct 
tactttggta 
agcactcccc 



283800 
283860 
283920 
283980 
284040 
284100 
284160 
284220 
284280 
284340 
284400 
284460 
284520 
284580 
284640 
284700 
284760 
284820 
284880 
284940 
285000 
285060 
285120 
285180 
285240 
285300 
285360 
285420 
285480 
285540 
285600 
285660 
285720 
285780 
285840 
285900 
285960 
286020 
286080 
286140 
286200 
286260 
286320 
286380 
286440 
286500 
286560 
286620 
286680 
286740 
286800 
286860 
286920 
286980 
287040 
287100 
287160 
287220 
287280 
287340 
287400 



wo 01/38351 



80 



PCT/USOO/28888 



ttcaatgaat tcagtaacaa ctggcctcct 
gatttcagga tgtttaatct gttggccggt 
gaaagtgaat gggaatccat acacgcatct 
aacaaagctg agaagaaacc tccatcaaca 
ccagttaatg atgcagtgac aaagatccct 
agactcgccg gtattctcca gaaagtgtac 
ctgtcttcaa attctataga catcaagcaa 
tctgcaagaa aattgggatt ctatttactc 
tcacaagatg ttgccgactg catagtttca 
tctgaaaaaa tacgacagtg cattgaggag 
ttttgtgaaa ttgacaagac gcccgatttt 
ttcttccccg tccacgcatt ctcctcatca 
tgttccgaat gtatattccc gtacatcatt 
ggtgtaaaag ttgtagattt gttccagtgc 
aagggaagat gttacgagtt ttcaaatttg 
actcattgtt cttctctctt tgatgctacc 
ctagagtttc tccagtatga ttttgaaact 
attccccatg tttacaataa ggtagtaaag 
ttgtactgtt acaaatgcgt atctcctgtc 
tatgaaatgg tggatgtaac tccccctcta 
gagtatgatg atggtcccgg aaatcatatg 
gcgggttcca gtggagaaac acccaccatt 
ggaagagcac cgaggaagaa aatggcagga 
attatagcgc tagccaactg gagaaagagt 
ctagaagggg taaatgccgt actttataga 
tacccctgtg ttattggtag atccatgagc 
aataaaattg ctttaataaa ggccttcttc 
aaaaagttac ttgaatgggc agaactatta 
cagacgccag aatgtgtcat acaccgcgcc 
actgacgaat tggttcacat gcgtccagat 
ctaaatgcgg ccagacagaa tgcggctgct 
cctgaatttg tcccctgcaa agaaaggacg 
gatgttagag ttgtaaattg tccttcatgt 
gtgaatgtgt tttgtgaatg tggaacaaac 
cctgctgatt ctaatcattg tgtggagaaa 
gttaaatatg ccttagaaag tatgtacggg 
gaaggagtta aaaattatta tgtaatggag 
gttgctaaga aataataaaa cacttgtgaa 
ttcaacaaac tttaatttct ttgcatgcct 
tctcctccct ctttctcttt gttcctccac 
gttcctcgtt gtacgtctca attgccttaa 
gcagacgctt cctttcctcc tctggtaatg 
ctttagaaga ggggtcaatc acttttattt 
ccatgaagat tttgatgtct gatccagtaa 
cattcggagg atcgatgaat gatatttcat 
taaaatggtt tcgtatacta tctctgatgc 
tgaccaatct gggcctacgt tcatccacgt 
atctgattcc ttcttctgtg gcagagtaaa 
aggagaaatc tcggtcagaa aaggcaaaga 
atgcaattga ggacttttta tgcacatctc 
catatagctt tttggattct ttaagttttt 
ggttgacttc attttttgca aatctgattt 
tcttttctag cgtgtataca actgcagcat 
ttttagctga cgggtgtgtg ctgttgtaga 
ttaggcacat atttgaaaaa ctgccaacta 
ttaacctatc acagatatca tacactgata 
acacttgttc catattctta agagtacggg 
ctaagaagtc tgttgtgatt atcatataat 
gttcatgtat gattctgtat aggggtgaaa 
tttcttccac aattttgttc atttcctgga 
ttccataatt agggtcttca tcttctaaga 



acaaatatca aaggagcata cgtgtctaga 287460 
ttagatcata tagaaggaga agttggggga 287520 
gtcgtcaagc gaatggtcac cattatgcgc 287580 
tctagaattt ttagagtgta tgtggctgaa 287640 
atacgtgtac tcagtaaatt attcggttca 287700 
tcttattcaa tgctaaatct tccatatctc 287760 
ggagtaaaag gaattacatt atctatacca 287820 
caaaaggata caacattaca atcttcccta 287880 
atcaacgctg gtattattgg tgatgatttc 287940 
aaaaacaagc cagaaaactg ttgtatgtgc 288000 
tcttatagtg aacatgtggc aaggcacaat 288060 
catgatgaca agtgttgtgg agcaaagatt 288120 
tccctgtatg agaaaatgac tggtgtggca 288180 
cctggttgta aaagtggcat gctcaatcta 288240 
tgtaagagaa tgatactacc atacacatcg 288300 
ataaatcgcg cagaggcttg tttttactcc 288360 
gcgaggagaa ttgcgcatgg agctaaagac 288420 
aatgtaaaag atttggatag actgtgtgcc 288480 
gtatgtgatg agccgaatga aagtacggac 288540 
attaatctta ccgagattgt tgattcggaa 288600 
tggccagcaa aatttacctg taactttatt 288660 
agtacgtgca gagatgctgt aacttttcta 288720 
tgggatgatc aatcggcagt ggggcaagcc 288780 
ggggaattgc ccaaaaatat gtttgattta 288840 
ggcgacagtt tcttgttacg tgcgataaac 288900 
cctagtctgg aactcgttaa aagaaaggtg 288960 
cacgagaaaa gggtgcgtcc agacgcatct 289020 
gtcaaaagtt atctcatgga agttttactt 289080 
cattcatttg taggcaaaac tctcctcatt 289140 
gatgccacaa gaaacgccta tatccagaac 289200 
gcagcctcat tttctgggtc gctccccaaa 289260 
attgaatgga tgtatgaaaa ggacaatgat 289320 
aaaaaggcta tccagaaata tggaggttgt 289380 
atgtgctgga tatgtgaaga gaaggtttct 289440 
cacaggattg tttatagtaa ctgtgttagg 289500 
tttgagattt gtaccatgaa aaatgtagaa 289560 
aatggatttt tctttgatgt acaagaaatg 289620 
atgttataac ttgttttatt gtccattata 289680 
tgttcaaaag ggcaacataa tcgccttctt 289740 
tgcttgtggc tacttcttct tcttccccac 289800 
ctgtgttgaa aatcatgcga cgctctcggc 289860 
tttcttcctt ttttattatg tccttgcctt 289920 
ctacaccatc ttgatatact tttccattat 289980 
cctctctgac catctccatc aacaaccttt 290040 
taccctttcc ggatgcaatg aaatggttat 290100 
atataggtgc atctgagcca gttatggaca 290160 
ccctctttct gtagtcgtcc gaagagacgt 290220 
aggaaccatt tccattggtt ctgcacaaca 290280 
ccagacaggc accgccgtca tatctggatg 290340 
gaagagaaat cctttgttct tctgaggcag 290400 
cttccagaac acttctttcg ctaactagaa 290460 
ccttttcgaa attatctaat ttttgattac 290520 
tgaaaaggca tgacatgtaa tttccaatgc 290580 
aaagtaatcc aagtttatcc attggtacaa 290640 
tggggtttgc gatacttaag ccattgtacg 290700 
cacgagaaga ggtaccaccg tgttgcacta 290760 
tacaatttct aggagaaaac ccacacccta 290820 
cacatttctt aacggccgca tttgaaagac 290880 
atttaaagtt cttgtcatta actagagcct 290940 
atgggcttgg tagtgattcg ttccattttt 291000 
aactaccttg ttttgggaga ctggatgtgt 291060 
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attttagtcc aatctgaagc gtaggatctc 
tcttgatgtg gagtacttcc ttgtctctaa 
cacttctcct gacactcttg actgatttga 
tattttcccc agacaaacaa ataccatttt 
ttataccgtg taaccctaac aatggtgtgc 
gatctgggaa gtgtactcgt gatttctgta 
ttacaggaaa agatgataaa atgaagacgt 
atatcaaatt tgggcaaatt acatattttt 
ttttggagtg aaattactaa aaaaggctat 
gacattaaat tggtttgctc acccccccaa 
tatataaaac cagcctgcac cccctcggaa 
tgacatctcc agctccatca ccctcttcca 
accgatgtgg tttcctcctt gacaacaaca 
aattcaagtg tgaacccaaa aatctggaac 
atgttgttac ccagatatcc cccgaccaga 
actggtctaa atctggtcat gaacgcttca 
ttacacgccc ttcagaagtg attgaaagtg 
tgaagtacac acgttccttg atggaccaca 
aattgagcaa aattagtagt tggtgtacaa 
tctagaaaac tttgtcaagg aagaacatga 
gactggattc aacatattta atagttcccc 
cgccctaatt tttatggcgg ctttcttgaa 
caatatggac ttgtacacgt ttgattatgc 
ccacgagaag agtttttctg tcgtccgagc 
gagaaaaatc atgcaagccc tggacaataa 
tgttggtgga gcacctgcgt tcacatttat 
cctagaagct agtttcgatg tgattggggg 
aacaccatgc ccccattctt cagccatgca 
atgtattgcc tgcttcgcat caatgttctt 
cttcatatca acggatgcct ctaaaattgg 
acgaaacaat gaaaatggag gagaagaaaa 
ggctgatatt acccaaaagg ctcatgaaac 
caatggcagg cagtggaatt ggcctactca 
tacccattct ctagtaaaca cattgaaaaa 
ttttgatatc ttgtacaact tgcttaatcc 
aaattgtcac attttaactg gacataaaaa 
ttctgggaag tggtggacta ttaattttgt 
aaaatgtaaa gttgaaaagg atagaaaaat 
tcgtctacct aatccaggaa gcactaccgt 
tagaggggaa aatctaggga gtgtaggtga 
gaatttttgt ctcatggttg aaaataggaa 
ctcttcagaa tcgatagtgt ctgatccctt 
taatcgtgcc caagatacta ttaataatgg 
aaagtcacgt gaaggtgctc gtgtatggtt 
aaacgttaac cacgatacgg ccatctctac 
caagatactg tactatgata ttgaaacaac 
catcacatct attgggttct gtttgtgtac 
aggagtattt ggactggttg cacctggatc 
aaattcgtac gatcctgaag aaaaggaaga 
aattttcacc aacgagtttg aaatgttgct 
gcctcacgtg attagtgggt ggaacaatgt 
tatcgtcaaa catttgagtg atcacaccaa 
tacagcagaa tctgtccttc ctagagcaac 
tagattgagc acccctcaag aaagaataca 
gggaaaattt gtagacaaga aaactggcat 
ggccggggca gaaagtcagg ccaataccaa 
taataaagga tcagcaggat ggttccagaa 
gttggatctc atgaaagtgt gcgaaaaggc 
gaacgccgtg ctcgccaaag tgagtagtgt 
agtagaccta cactttcatc tattgggatt 
aaaagtacac gtctattgtt gcaaggatgc 



ctgttaattg tctcaatctt tcgttaccgt 291120 
gacgaactat aaaatgtcct tcctttagct 291180 
atgttaatag tacccgttca taattttcgg 291240 
ttacttttaa ttgtttaaag tcttccattt 291300 
cagtataata aatctacgga ccatgctctt 2 91360 
gaaattgcta tttcctattc attatttcca 291420 
gcgatgagat atgtgttttt tcaatatcgt 291480 
tagcgtcaac actcatcttc atgcaagact 291540 
atttcatgta caacataaaa cacatgtgca 291600 
catcgttttc atcaactctt tcctcacctc 291660 
cccacagtcg acttgtaccc tccgtcgaga 291720 
cccccaaatc cagttgtacc actattgtaa 291780 
aggaagtggt catctacgac accaattcca 291840 
taattggtgt actttctgga gtctctgata 291900 
tatttgtggg aacatatatg gtcaaatata 291960 
gtgacatgag taacaactgt ctggacaata 2 92020 
tgataaagaa aacgtccagc gactttaaaa 292080 
ccgagaaata ctatttttct ggtgaccaaa 292140 
cccctatacg acagtgggta tgcaactccg 292200 
aactgtcgtg gtgcacaacc cttctggaat 292260 
cgtgtatttt gaagtgcaca atgagatgga 292320 
gcacaatagt ttatggggag aaattaacgc 292380 
gggtgctttt ctggacgaaa gatggtgcca 292440 
acaacttatc aactcgtatt acaagtgcag 292500 
ctacaacaac aagaataaga agaggaagaa 292560 
gagcggggac ggagagggag gaaaggaagc 292620 
aacaagagga ggaagatttg gtgttgattc 292680 
actaaaactg gacaatgaag gaaactatgg 292740 
tgtattggag aacccaggtg atgaatcttc 292800 
acaagcgcaa gcatggatag atgaacgact 292860 
taatgtcttt aaaaagacct tccatatgct 292920 
tgcctattcc aataccatcc cacttggacc 292980 
cactgtggaa cctattgccc atgaatttgt 293040 
tctaggggat agaaaacttc cccgattcaa 293100 
atttggaaaa atgttgctag tgtttattca 293160 
caatgaaaat gtggtgcctc gaggttctgc 293220 
gggtgtgaac atgtggactt ttcaagtaac 293280 
atccgatttg gcctgtatgg aaactctccc 293340 
cgatgacaga atagttttta agggattctg 293400 
agtcgtatcc gacattacac agagtgtcaa 293460 
atttagtgtg gataaagaaa ctggtttcat 293520 
cttttcacta gaagtgactg gctgtagatc 293580 
ccgagttagt gctcgtgtaa tgaggatcct 293640 
ggccaaggat gaaaatgcca tcatctttga 293700 
ggacgctatg gagcgagcta tagggcagca 293760 
agataaagat ttcaccgaca aaaaatcagt 293820 
gggaggcgat atgacacatg gaggagagag 293880 
cgacgtggaa aaggtgaaag agactataat 293940 
cattatgaaa cagtgccctc aagtgattga 294000 
tggttttgga aagtatatag ataaagtgaa 294060 
agcttttgac gacccctttg tctttactcg 294120 
agacatgtct tattgtgtag cagatgcatc 294180 
agaaggagga ggaggaggag aaactccata 294240 
actagcaagc actggtattt tcaataaatt 294300 
gttgaaacct gaaatgactg cagatttatt 294360 
gtttaaggaa cgcaacaagt tatcctccag 294420 
aattattggc ggtatgtgca gtgctattcg 294480 
ctataaagaa tccctctctg aatttaattt 294540 
cggcgacaag gttaaaaatg taaaagatga 294600 
cttgaagctg aagaaggccc aggatcaggc 294660 
ctacttgact ggtatagttt ctacctccat 294720 



wo 01/38351 



82 



PCTAJSOO/28888 



caacaaggaa 

gacagccaac 

agaagaaagg 

aaagggaggt 

gacaagtttg 

ccatcgacaa 

caccgactct 

gtacagaccc 

tccaattact 

gaagacaaat 

aggtatggat 

gaatgatgat 

aatgttaatc 

ggaggatatt 

aagaacccat 

ccaaatggct 

tgaaaaattc 

tgattcggcc 

gttggttagg 

agccacttac 

gggagacatg 

gaattccatg 

tggatctgcc 

aggcacaagg 

caatattggt 

agattctgta 

tgaacaaact 

aattattccc 

cgctggtgtg 

tgccaagaaa 

gataaaattg 

ccacgcagat 

acccgctggt 

tatggaggaa 

caacttgtat 

ttcccgcccc 

tttcacccct 

aaagggtctc 

ttatttggat 

tatgatgatt 

taatcctcta 

gcagaaa ttt 

agtccctgct 

aagtgttcct 

atacaagatg 

agtgttgtgt 

taatacaggc 

taaacaaatg 

cgacgaagaa 

ctgtgtcacg 

taaaacaagc 

ggaggaaatt 

tgaccgttta 

cttacaacaa 

ccaaagagta 

ctccaacaag 

tgctgcagt t 

ccattggaaa 

tgtgcctttt 

cgacatggct 

tgagaacgaa 



ggggagattt 

ctggccactc 

gcagatagaa 

atggtgagtc 

tacccgatga 

attatgcaac 

ttattgttgt 

attgatattg 

cgcatagagg 

aataaaacgt 

tacttccccg 

atgcatatag 

gacagaccgt 

ctctcagaac 

tggacattta 

cgtcacatta 

gagaggttgg 

aatgatccag 

acatggaacg 

agagccgatc 

aaaagagctg 

tatggtcatt 

aatactgcct 

cactcggtga 

tgtggattac 

ttctgcgtgc 

ggcaaatatt 

atcctagtca 

ggcatgatga 

acataccata 

aaatcaaccg 

gggtatgttg 

aagaaattga 

tggtttacct 

gcctcacaaa 

atagaagcgg 

tacaaaaagg 

caatcattga 

gtgatgaaga 

actagttccc 

gctctagcga 

aagacggtga 

ggttatttta 

gcattttcag 

gttgagagcg 

tctacatctg 

gtcattgtta 

ttattgtttg 

gcagaagaag 

gaaaaaaaga 

gagaaaacgg 

gctagttcgt 

ctaaaagcaa 

tatagcaatt 

gcagaccaaa 

aagaggcaag 

aaggttaagt 

aagtgccccg 

gaagttgcac 

tgttgtcaat 

agat tggccc 



ttaggctgtg 

ctctatgtat 

gaggtgtggg 

aacctatcgt 

ccatgtgtca 

tgagggatag 

tggacgttat 

cagtcgcatc 

aaagtttggg 

ggtgcaccaa 

agattgtgtg 

ccccagcaag 

acattggcgc 

tcgaaaagga 

agggtcaaaa 

ttgaatctac 

ttagcttatc 

ctgttagact 

taaaaactga 

gagttgtgat 

gtctaaacaa 

tggccctaag 

caagtatttc 

cggccaatca 

agatggctct 

ataatattgt 

attatgtgat 

actcgttaac 

atatcgccca 

tgcttcactt 

ataacaataa 

ttccccataa 

aaagcttttt 

cttcacctac 

ttgtaggggt 

gtacagaaat 

gagcctttgt 

ttgcaagatt 

atcatgtgga 

gagtcaacaa 

taaataacca 

catcagtttc 

acgctggtag 

tcaagaattt 

ataagacggc 

ccaatactgg 

caaaggacgt 

ttggaggggg 

aggacgaaga 

tccctggacg 

agggagaaag 

tgagtaaatg 

cacattcttc 

gtacattatc 

tggaaaattt 

aaaagggagg 

ttttcccagt 

tgtccatccc 

tcaattctct 

cattgtattt 

ggcaaattgg 



tatggactct 

aggagaagga 

agtaagaaga 

caatcatgtt 

gaataatctg 

attggtactt 

tgacgagtgc 

atggaagaat 

tctaagattc 

tacatctccc 

tgacattaat 

tttagagtat 

acacataaca 

cttttctgtt 

acaatacgat 

gggaagaaat 

agacagaatt 

gtggtcttct 

cattcttaag 

gcagaacaag 

agttggacaa 

agcacgttcg 

caacatgtca 

gattacagaa 

tcctggtact 

aggtgatgga 

ggatattgct 

aaagggcatc 

tgaacgtcta 

taatgaaaat 

taagtttgca 

tccttcattg 

ggaagaggaa 

gtggatggcc 

ggagaagggt 

gatggaggcg 

gaagaaggga 

tttaccaaaa 

gaattttgca 

gtttgatacg 

cctgaaccct 

ttcttggagt 

cgtgcgttgg 

atctgttgtg 

aataaaatcc 

attttctctg 

ggctatggct 

aaaggattac 

aaatggtgaa 

aagcactaac 

aaagggctct 

tgggaagaaa 

ttgcaccaac 

ttcctatata 

aatatctcaa 

gccttttaag 

tttagatgcg 

agaaacgcgt 

aataggaaaa 

tgtcctcttg 

cctagatgac 



gctttaaccg 
gcaatctgta 
cactctattg 
ccctatcaaa 
tgcaccacta 
gaaaaaatga 
aatcagattg 
agcaactcta 
atagaaaatt 
aatatgaatg 
atgcagtttg 
atgcttcaag 
gctggaaaat 
gaaaaagatg 
ttctgtcata 
atccgtgatt 
tatcgccgcg 
cgtctaatca 
ggaatcatcc 
gccaaggagt 
aatattatga 
agccgtaaag 
gccaccggag 
aatgctcgat 
aagcagacgt 
ggaatgatac 
ctaaaaaata 
cagtttgtag 
gctgtcgctg 
agtgcagcgt 
tcctttatca 
attcttagag 
ggaattcatg 
atggatgctt 
aactggattg 
gtgacgcaag 
attacaccca 
atagaggaaa 
tcccatataa 
tcaaaagaac 
tcttcagaaa 
ctttcggcag 
gatgccacca 
cccaacgcca 
atgattgcta 
agaagaggag 
tgtatacgat 
ggtgaagacg 
aacgaagaga 
aaggatgttg 
aagacggcaa 
gatgcgagag 
aatgaagaga 
acttcagtca 
ttggatcaaa 
tctgaattgg 
tctagaaaat 
gaagaaaaac 
cacaagtgca 
tacacgctag 
tctgtagatt 



aggcggtcgt 

gaaatatggg 

ccacagacac 

cgattgacat 

cctttgtgac 

aaaacaaaac 

tgttgtccga 

atagacaaac 

tggatgccga 

tcactgccgc 

cggccaaggt 

tattgcccat 

gtcgtacatt 

aggaaattat 

gtcctgtgac 

atgaaggcaa 

ttggcgcatt: 

atgttggaat 

ctcaaatgca 

ttgccaagat 

agctcggtat 

agtttgcgtc 

gaattggagg 

gtgtatttgg 

acggggatac 

cagaatatga 

aaatggctgc 

agcgccgaga 

gtcttttgtt 

tcaatgacat 

agagacctag 

cggccgaagg 

acgagaagag 

ctgttatcaa 

acgccatgac 

cgaatgcagc 

ccaccaaact 

agaaaagctg 

caaatcctgc 

agagtagacc 

tttcattggg 

aggaagggga 

acatgaaggg 

tcacatctgt 

aaaatgtaga 

cattgtcatt 

ctctaaataa 

acgacgacga 

acaaaggtga 

gtgaagaaac 

agggaaagac 

atgtcattct 

gaaccagagt 

tgaaat tgga 

tacgtaa tct 

acgccatggt 

tgactcaaga 

cattaatggg 

cagatacatg 

ctttaaaatt 

tgatggctga 



294780 

294840 

294900 

294960 

295020 

295080 

295140 

295200 

295260 

295320 

295380 

295440 

295500 

295560 

295620 

295680 

295740 

295800 

295860 

295920 

295980 

296040 

296100 

296160 

296220 

296280 

296340 

296400 

296460 

296520 

296580 

296640 

296700 

296760 

296820 

296880 

296940 

297000 

297060 

297120 

297180 

297240 

297300 

297360 

297420 

297480 

297540 

297600 

297660 

297720 

297780 

297840 

297900 

297960 

298020 

298080 

298140 

298200 

298260 

298320 

298380 
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gatgttgttc 
tcaagataga 
aattatattt 
tagtgaaata 
gaattatact 
acgtgtcaat 
agaattggct 
taatttcatt 
gcagcaatag 
tcattttcta 
gcaatgattg 
acattctggc 
ttcgggaagt 
tcccacaaca 
caattctgta 
catgtatttt 
ctctacttct 
gagaggttgt 
actgttaata 
atgttcaaat 
ttccatggca 
gttgtatatg 
gataatgcat 
ggtagcgttc 
agagatattg 
gtcaggaaga 
tgttgcataa 
ttcagaacaa 
gtaagacgat 
atttctcacg 
attgaaacta 
tctgtaaaca 
agtgggtacc 
cgattttgct 
ttcacctccc 
tattttacta 
tgatgcagca 
tggcatcatc 
cagacatatt 
ttagtgttac 
aagatttgcc 
taggggaaga 
caaaaaacat 
taaaagtaag 
cagttggccc 
gagcagttgg 
atggagcagt 
gagatggagc 
gagcagttgg 
ttggccctgc 
cagttggccc 
gagcagttgg 
atggcgtgga 
gagatggagc 
gaaaggatgg 
acggttctat 
gggacggcac 
gaccaccagg 
caggacccgt 
caattggccc 
gtgtggatgg 



ggaggggata 
aagttggtga 
ttgtttgagt 
aaagacgctg 
gatgaaagat 
ggaaattgta 
gctgcatgtc 
ctggttttta 
cagcattatt 
cagcccatgg 
caatgatcaa 
taaaataaaa 
tttgcagcat 
tcacgcgtca 
ttgattatag 
acaatcgaat 
ctcccacgtg 
aaaatagagt 
tcaatagaaa 
cttactacat 
tcctctcctc 
ttcattgagc 
gcaccgacag 
tct tcatcat 
gaagccggtt 
gtctcgaata 
gatagccgcc 
agatctccta 
acatgaagtc 
tcaactcgtc 
aaatctcttt 
ggagaccaat 
gctctgcggt 
ccttcatttt 
attttattta 
cactgaaaaa 
acaatacctt 
ttctcttcaa 
tgttgtgata 
aaaaaatgtg 
tgataaggtg 
tgaggaggaa 
aagacttaaa 
gaaat taca t 
tgcaggccct 
ccctgcaggc 
tggccctcaa 
agttggccct 
ccctcaagga 
aggccctcaa 
tgcaggccct 
ccctgcaggc 
cggttctgtg 
agttggccct 
tgcagttggt 
agggcctcaa 
tgcagcagaa 
aaaggatggt 
tggacctagg 
tgcaggccct 
ttctatcggc 



aactattggc 
aatctttatt 
ctttaaggtt 
ttagaggtct 
tttttggacc 
aattgtcctt 
tatcttgtgt 
ttttatatcc 
ttcagcgaca 
tcgtttaagg 
tattaatagt 
aagtttatct 
gacacactat 
tatttaaaac 
ggtttacaac 
atgagttgtc 
tattgagcga 
tatatttagg 
ctgagctata 
aacccaagaa 
ttggctcttt 
tatgatttcc 
aggtgtagaa 
agtaaccgag 
cagtagcgga 
aaataccgac 
ctgaactact 
taaaattgtg 
ctggtccagt 
tcaccttcac 
cgtggggatc 
ctgaatagta 
gcattcggta 
cccagagcgc 
ttttcctcta 
a tggcgtaca 
cttccaccct 
ccttcctctt 
act tgtgttg 
gtaaaacttt 
tacaaattgt 
gaaagggagg 
agaaagtgga 
ggcccgagag 
caaggagaaa 
cctcaaggag 
ggccctccag 
caaggagaaa 
gaaagaggag 
ggagaaagag 
ccaggagaaa 
cctccaggag 
ggccctcaag 
gcaggccctc 
cctgcaggcc 
ggaatacaag 
agaggagaaa 
g tggatggtt 
ggtgaaccag 
ccaggagaaa 
ccccaaggaa 



ccaggaagtg 
gcctttaaat 
tgctcagaaa 
ggccttttct 
attgtataac 
tataactggt 
tttgtaagaa 
tcaagagttg 
gcttcttcat 
aatattcttt 
agcgaattaa 
ctcaatgaaa 
tactccaaaa 
atgcaccttt 
ctttaggcct 
ttgtgcacca 
gagatcatta 
tttaattagg 
tgctgacgac 
attataatct 
tagtacaatt 
cacaacatca 
taatcctcca 
cttgattttc 
aaaaatagac 
aggggaatta 
tgtattataa 
attagggcag 
tggcacgaaa 
tggtactttt 
acttccatcc 
ccttaaagct 
cacctccaaa 
atcagaatcg 
tatgataaaa 
ttgaccaagg 
ctaataggct 
ctaataaacc 
ttttaattgc 
ctaaagaaca 
ttgaaaattt 
agagggagga 
aagaaatgac 
gagaaagagg 
gaggagcaat 
aaagaggagc 
gagaaagagg 
gaggagcaat 
caattggacc 
gagaaaatgg 
gaggagcaat 
aaagaggagc 
gagaaagagg 
aaggaagaag 
ctcaaggaga 
gcccaagagg 
gaggcttccc 
ctgagggccc 
gtctagctgg 
gaggagcaac 
gaagaggagc 



ttaaaaaggg 
tataaccatg 
cctgtagctg 
accactacag 
atggatgaac 
atttatcata 
atgataataa 
tagtagtagc 
ttctattgat 
gaagcagaaa 
ttaaaataag 
aatttttatt 
ctatgctcac 
aaagcctgtt 
tcctttagag 
gacgggagcc 
agcacaactt 
aaacatttgt 
atggcttgca 
gcttcttcac 
tcttttggtt 
tcaaagaaaa 
tatctaatga 
acatgactca 
acttgttgga 
tcttcaacac 
attgttatat 
accaacatcc 
atggcagccg 
ccaaatcgtc 
tttaaagaga 
gttctatttt 
tccgtttcct 
tacctgtcgt 
accacaccta 
ggcgtttgga 
ctccaaacaa 
taggagtaca 
tgcttttata 
aaccgagaaa 
aaaggatgga 
aggggaggaa 
tgaacaaggg 
agaaactggt 
tggaccggca 
aattggaccg 
agaaaatgga 
tggaccggca 
ggcaggaaag 
acgcccagga 
tggaccggca 
aacaggtata 
agaaattgga 
aggagcaaca 
aaaaggagaa 
agagactgga 
aggaccacca 
tcaagggaaa 
cctcccagga 
tggtctacca 
aacaggccgc 



taaaagatgc 
acacaaatac 
gtatgagtgt 
gtactgtgtg 
tttgtaacga 
cggcagcagt 
aaaacatgta 
agcagcagca 
tgatatctgg 
cacataccca 
cattttgtcc 
gaccaatcac 
tcgtatctag 
ttttcttata 
atactatttt 
tgtcatgaaa 
ctgaagaaac 
ggaaacttga 
ttttcaaatc 
catcatcttc 
ggttagtcat 
ccccttctgg 
caccgactgc 
tgggtccgat 
acattttgtc 
tattaaagtt 
cggatgctat 
cgtctccagt 
ttcgtgaagg 
gtggatctac 
agagaggcac 
tttcttctag 
ttacttgaat 
cgaacgaatc 
aaatgtctaa 
gccaaatctg 
caacctccac 
tcaagagtaa 
agcaactcat 
atactagaaa 
acatttggta 
gagcttgaaa 
gatcaaggaa 
ccagcaggag 
ggaaaggatg 
gcaggaaagg 
cgcccaggaa 
ggaaaggatg 
gatggagcag 
agagatggag 
ggaagagatg 
ccaggaaggg 
cgcccaggaa 
ggacgcgcag 
gctggtaagg 
ccaccgggaa 
ggcgaaactg 
agaggagaaa 
agagatggag 
ggaaggaatg 
gcaggaaagg 



298440 
298500 
298560 
298620 
298680 
298740 
298800 
298860 
298920 
298980 
299040 
299100 
299160 
299220 
299280 
299340 
299400 
299460 
299520 
299580 
299640 
299700 
299760 
299820 
299880 
299940 
300000 
300060 
300120 
300180 
300240 
300300 
300360 
300420 
300480 
300540 
300600 
300660 
300720 
300780 
300840 
300900 
300960 
301020 
301080 
301140 
301200 
301260 
301320 
301380 
301440 
301500 
301560 
301620 
301680 
301740 
301800 
301860 
301920 
301980 
302040 
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atggggcagt 
gggatggtgt 
gaagggacgg 
caggaagaga 
aaaatggacg 
gtgcaatggg 
caggccgcgc 
aaactggacc 
gattaactgg 
aaggagaaaa 
ctgctggccc 
gccctatagg 
atggagcaac 
tggacggttc 
gagcagttgg 
atggagcaac 
tgccaggaag 
gaaaggatgg 
taggtcctgc 
ctgttggccc 
gccctgtagg 
atggaaagga 
aaagaggaga 
gcccccaagg 
agggacggcc 
taatgggtcc 
gacctgaagg 
gaggattaaa 
aggacggaat 
gattaagagg 
aaggcgatag 
ctcaaggtct 
gccgttcagg 
aaggaagaac 
tacaaggaaa 
gagaccgtgg 
gaggaattcg 
ctagaggggt 
gaccaacagg 
ctggtcccat 
gaaagggaaa 
aatattcaat 
catacattgg 
ggatgacaaa 
taacaactag 
ttttagttac 
cgaggtcaga 
ggccgtctga 
cagacgcaag 
caacgtctgt 
ctgatagagg 
tgcaagc 



tggccctgca 
ggacggttct 
ttcagttggc 
tggggcaact 
cccaggaaga 
aaagaatggc 
aggaaaggat 
agcaggaagg 
cagcccagga 
aggagaaaat 
tcaaggagaa 
tcctgctggc 
tggccctata 
tacgggtcct 
ccctgctggc 
tggccctata 
agatggagca 
ggcaactggc 
tggccctcaa 
tcaaggaaaa 
tcctgcaggc 
cggttccgtg 
aactggacga 
actaagagga 
aggaaaagat 
taggggtctt 
aagagatggt 
aggtatccag 
agaaggccct 
ttttcaagga 
agggctaaca 
tagaggagaa 
acctcaagga 
aggaatacaa 
ggaaggaaga 
agaacaaggc 
aggccctatt 
gagaggtgtt 
tcctcaaggg 
gggccctcct 
aaatggccct 
cacccatgct 
aagatcttat 
tgaggacaaa 
aggaagaaca 
tttactgatg 
gagggtgaat 
ccaatctata 
catgagttac 
gcttttatta 
aactgtaggg 



ggccctccag 
gtgggccctc 
cctgctggcc 
ggccctatag 
gatggagcaa 
gtggacggtt 
ggggcagttg 
gacggttcag 
agagatggag 
ggacgcccag 
aaaggagaaa 
cctcaaggag 
ggtcctagag 
caaggaagaa 
cctcaaggag 
ggtcctgctg 
attggtcctc 
cctatgggtc 
ggagcaactg 
agaggattaa 
cctaaaggag 
ggtcctcaag 
ccaggaaggg 
gctacgggag 
ggtgaaacag 
cgaggagaaa 
gcacctggtc 
ggacgaccag 
agaggtcaag 
agaacaggag 
ggccctcaag 
aggggagcac 
agtaacggcg 
ggcctcactg 
atggggaaaa 
atcgctggag 
ggtgctcctg 
cctggctatc 
ccagcaggtc 
ggagcagtgg 
aaaggagcgg 
cgtgcagaga 
ggaactgata 
aaaaacgaa t 
tcgggtgttt 
ccagaaagtg 
gctgttagag 
ggtactcatt 
tcagttgata 
gcagaaacca 
aggttgatgg 



gagaaagagg 
caggagaaag 
ctcatggaga 
gtcctgctgg 
ctggccctat 
ctatgggtcc 
gccctgctgg 
ttggccctgc 
caactggccc 
gaagagatgg 
atggacgccc 
aaacaggatt 
gagaaactgg 
gaggagcaac 
aaagaggaga 
gccctcaagg 
aaggagaaaa 
ctccaggaga 
gtcttccagg 
tagggcgcac 
aaacaggatt 
gagcaattgg 
acggtgagga 
ctccaggacc 
gtcctccagg 
aaggagcacc 
ccgctggccc 
gaagagacgg 
atggaacaac 
aaactggtgc 
gaagagacgg 
c tggccctag 
tgcaaggacc 
gcatagaagg 
ttggacatcg 
cagacgggga 
gtaagcctgg 
ctggcgcaca 
ctcaagggcc 
gaccaagagg 
acggaaaaga 
taatgtggga 
caatccctgt 
at tgtataca 
ttgtggtaag 
t ttcctgtag 
aaagagaaag 
cacgttcaaa 
cattagacgc 
tacacgggga 
tacctcccac 



agcaacaggt 
aggagaaact 
aagaggagaa 
tcctcaagga 
aggccctaga 
tcaaggaaga 
ccctccagga 
tggccctcaa 
tataggtcct 
agcaactggc 
aggaagagat 
aactggacgc 
tgcaatggga 
aggccgcgca 
aaatggacgc 
agaaacagga 
gggagaaaat 
aaggggagag 
aagggatggt 
aggaagggat 
agctggcctg 
acctataggc 
tggttccaca 
gcaaggagaa 
gcgacaagga 
tggtaatgat 
tattggacct 
agaaatggga 
tggcgctaaa 
acaaggatct 
tccacccggt 
aggtcctaga 
tcgaggtccc 
tcctcgaggt 
aggagaaaag 
aaaaggtcca 
cacggaaggg 
aggggaatta 
gatggggcgt 
agagaaagga 
tgccgtaaat 
aggaaatgaa 
gatgatagaa 
agtaatgaca 
caataagaca 
aacagatgtc 
caaatcgtac 
aattgccgtg 
tgatgtggcg 
aaaagataga 
tgaagaagag 



ataccaggaa 
ggaccagcag 
aatggacgcc 
gaaaaaggag 
ggagaaactg 
agaggagcaa 
gaaagaggag 
ggagaaacag 
gctggccctc 
cctataggtc 
ggagcaactg 
ccaggaagag 
aagaatggtg 
ggaaaggatg 
ccaggaagag 
ttagctgggc 
ggacgcccag 
actggtccta 
gtggatggtt 
ggggcaattg 
ccagggatag 
ccacgaggag 
ggccctatgg 
agaggattaa 
agggatggaa 
ggtctagagg 
caaggaataa 
ccagccggca 
ggacctagag 
agaggagaaa 
gaagaaggtc 
ggtattcgtg 
cgaggaacaa 
cctagaggta 
ggtgataaag 
agaggtttac 
gttagaggtc 
ggtccccaag 
acaggagata 
ggtagaggaa 
atcatacaaa 
atcggagaag 
aatagaatag 
atgcactcaa 
gattatatcc 
agtacaaatg 
agatttatta 
gtaatgtatc 
cgaagagaaa 
ggtttctatg 
ttattggtat 



302100 
302160 
302220 
302280 
302340 
302400 
302460 
302520 
302580 
302640 
302700 
302760 
302820 
302880 
302940 
303000 
303060 
303120 
303180 
303240 
303300 
303360 
303420 
303480 
303540 
303600 
303660 
303720 
303780 
303840 
303900 
303960 
304020 
304080 
304140 
304200 
304260 
304320 
304380 
304440 
304500 
304560 
304620 
304680 
304740 
304800 
304860 
304920 
304980 
305040 
305100 
305107 



<210> 2 
<211> 855 
<212> DNA 
<213> SHRIMP 



<400> 2 

atggctgctg caaaaatgga cgctatcctt gcggatatta atgggaatga tacagattta 60 
tccaagctaa tcacagacgt gattcaaaag agagccaagg ctgtcatgga tagaaatagg 120 
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gctaaaatgg 
aaacaaaaag 
gtccctccaa 
catattgaac 
aacgtacaag 
gccatggtag 
gctcctacct 
tttacaaagg 
ggtaaaagta 
tgtagcgtct 
cgtgaatatg 
tttattccaa 
tcccgtatta 



acatgaatag 
cattagtggt 
cattatcggg 
ttgtagagag 
acatagacga 
tggtaaaagt 
ttggtgtgac 
ctgataaatc 
gacaactgac 
ttgcagagtt 
acaaacctaa 
tacaacccgc 
tatag 



aagagtagat 
atttgataaa 
atccgattac 
tgttgaggga 
ccaaatcggt 
cgactgtgaa 
acaaaataat 
tgtgcgtatt 
gtatgcggta 
tgaatctgac 
aaatgaattc 
agggacgaaa 



gaggctattc 
ctcgtggaag 
gacgcgtggg 
gattctttgt 
gatgaga tag 
actggggata 
aaaatataca 
tttaaacttg 
aaaaataaca 
agtgacacta 
gaatatgagg 
ttattgttat 



aggaagccgt 
aaactgacag 
tagacagagc 
atgataaact 
atacaccaat 
tcgaagaaga 
gagatgaaag 
ctaaattgga 
atgaatatac 
ctaaatcagg 
agcgagagat 
actttttggt 



agcggccaag 
cggacaaagt 
catgccttcg 
ccctcctttt 
atcttacctt 
gtacaatctt 
agaccagatt 
tagtatatca 
agagtttgtc 
tataggtatt 
ctttacattt 
ggacgtcagg 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
855 



<210> 3 
<211> 284 
<212> PRT 
<213> SHRIMP 



<400> 3 

Met Ala Ala Ala 
1 

Asp Thr Asp Leu 
20 

Lys Ala Val Met 
35 

Val Asp Glu Ala 
50 

Leu Val Val Phe 
65 

Val Pro Pro Thr 

Ala Met Pro Ser 
100 

Leu Tyr Asp Lys 
115 

lie Gly Asp Glu 
130 

Val Lys Val Asp 
145 

Ala Pro Thr Phe 

Arg Asp Gin lie 
180 

Leu Ala Lys Leu 
195 

Ala Val Lys Asn 
210 

Ala Glu Phe Glu 
225 

Arg Glu Tyr Asp 

lie Phe Thr Phe 
260 

Leu Tyr Phe Leu 
275 



Lys Met Asp Ala 
5 

Ser Lys Leu lie 

Asp Arg Asn Arg 
40 

lie Gin Glu Ala 
55 

Asp Lys Leu Val 
70 

Leu Ser Gly Ser 
85 

His lie Glu Leu 

Leu Pro Pro Phe 
120 

lie Asp Thr Pro 
135 

Cys Glu Thr Gly 
150 

Gly Val Thr Gin 
165 

Phe Thr Lys Ala 

Asp Ser lie Ser 
200 

Asn Asn Glu Tyr 
215 

Ser Asp Ser Asp 
230 

Lys Pro Lys Asn 
245 

Phe lie Pro lie 

Val Asp Val Arg 
280 



lie Leu Ala Asp 
10 

Thr Asp Val lie 
25 

Ala Lys Met Asp 

Val Ala Ala Lys 
60 

Glu Glu Thr Asp 
75 

Asp Tyr Asp Ala 
90 

Val Glu Ser Val 
105 

Asn Val Gin Asp 

lie Ser Tyr Leu 
140 

Asp lie Glu Glu 
155 

Asn Asn Lys lie 
170 

Asp Lys Ser Val 
185 

Gly Lys Ser Arg 

Thr Glu Phe Val 
220 

Thr Thr Lys Ser 

235 

Glu Phe Glu Tyr 
250 

Gin Pro Ala Gly 
265 

Ser Arg He He 



He 


Asn 


Gly 


Asn 






15 




Gin 


Lys 


Arg 


Ala 




30 






Met 


Asn 


Arg 


Arg 


45 








Lys 


Gin 


Lys 


Ala 


Ser 


Gly 


Gin 


Ser 








80 


Trp 


Val 


Asp 


Arg 






95 




Glu 


Gly 


Asp 


Ser 




110 






He 


Asp 


Asp 


Gin 


125 








Ala 


Met 


Val 


Val 


Glu 


Tyr 


Asn 


Leu 








160 


Tyr 


Arg 


Asp 


Glu 






175 




Arg 


He 


Phe 


Lys 




190 






Gin 


Leu 


Thr 


Tyr 


205 








Cys 


Ser 


Val 


Phe 


Gly 


He 


Gly 


He 








240 


Glu 


Glu 


Arg 


Glu 






255 




Thr 


Lys 


Leu 


Leu 




270 







<210> 4 
<2H> 1461 
<212> DNA 
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<213> SHRIMP 



<40Q> A 

atgttcgtca 

ataacaattt 

aggaagagat 

aacggaggag 

ggaacaacaa 

ggttctggaa 

ggaggaggaa 

gacacagacg 

tcttcaaaac 

caattatctc 

gacgcctcag 

tcatcggcct 

aatgaggaag 

gaatacgatg 

cttgt tacta 

gctttccctc 

ggaactatta 

gaaggatcaa 

gtagtaaata 

tgctttgtaa 

aaaggtga tg 

cat ttttcaa 

ttctcagaaa 

gacgccaacg 

aatttattgt 



taagcatagc 
tagatggtgc 
atcgtaccag 
gaggaggtgg 
acggaggagg 
caacaaacgg 
atggaaatgg 
attttgagcc 
ctaaagaata 
gagatgattc 
acactctaca 
ttttagaagg 
aaattgcaca 
gaaaatatgc 
aatacacaaa 
atgagattga 
cttcaaacac 
caacagatcc 
aggaacaaca 
ttattcctcg 
aaattaaaca 
tcatcacaga 
gaattctccc 
tactgtcagc 
ccagccaata 



tacgtcattg 
aaaaacaatc 
tgagagtggt 
aggagaagga 
aggtggagga 

aggaggaaat 
tacgccagcc 
ctatgaagca 
tacccaaact 
aggaaaacca 
taccactatc 
aacaatatta 
cacacccgag 
ccatgaagtg 
gtttgagaga 
tatgcagtac 
aggggaagtt 
ttctctagaa 
tttatccgca 
agaaacctat 
caaggatgaa 
cactttgagt 
ctacggaaag 
a 



gtgttattct 
gactctcaac 
gacggcatag 
ggtggtggtg 
gaaggaggtg 
ggaggagaag 
ggaaatggaa 
cttctaaaag 
ttcgtatctg 
ataataattg 
agggattatt 
cgcaaagcgg 
agtcaactaa 
gaaagagctg 
ggtcttttag 
tgcataattc 
aggtccaacg 
gtccctgatg 
atatctgcaa 
ataggaaaaa 
ctgtttgtgg 
tctgttggaa 
gaccctgcaa 
cgcctaggtg 



tcttccttct 
cttttaggaa 
acggaggaac 
gaacaaatgg 
gtggtggaac 
gaggtggtgg 
atggaaatgg 
aacgtctact 
cagaggtaga 
atgacgacca 
tgttcaagct 
aagaccgtgc 
gagaaaaaca 
atttttccaa 
taggagaaac 
tagtagagga 
cttacaaaat 
attgtttggt 
ctaacagatg 
atgctaccat 
ccaataagaa 
tagaattaaa 
ccgttccaag 
ttggtgcctt 



tttcgtctca 
aaggaggaag 
tggaacaaca 
aaatggaact 
aaatggaaat 
tggaacaaat 
aggggataca 
taattccata 
aactgcctta 
attagaattg 
cgcaggtgtt 
ccgtaatata 
catcaacgat 
tagtcttaat 
tattgaaaag 
ttttaatagt 
cagagtagta 
ttttgccgta 
ccaagacata 
ggtaataagg 
tgacactact 
catgctcatt 
acctttgact 
tacagacaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1461 



<210> 5 

<211> 482 

<212> PRT 

<213> SHRIMP 



<400> 5 



Met 


Phe 


Val 


He 


Ser 


He 


Ala 


Thr 


Ser 


Leu 


Val 


Leu 


Phe 


Phe 


Phe 


Leu 


1 








5 










10 










15 




Leu 


Phe 


Val 


Ser 
20 


He 


Thr 


He 


Leu 


Asp 
25 


Gly 


Ala 


Lys 


Thr 


He 
30 


Asp 


Ser 


Gin 


Pro 


Phe 
35 


Arg 


Lys 


Arg 


Arg 


Lys 
40 


Arg 


Lys 


Arg 


Tyr 


Arg 
45 


Thr 


Ser 


Glu 


Ser 


Gly 
50 


Asp 


Gly 


He 


Asp 


Gly 
55 


Gly 


Thr 


Gly 


Thr 


Thr 
60 


Asn 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Glu 


Gly 


Gly 


Gly 


Gly 


Gly 


Thr 


Asn 


Gly 


Asn 


Gly 


Thr 


65 










70 










75 










80 


Gly 


Thr 


Thr 


Asn 


Gly 
85 


Gly 


Gly 


Gly 


Gly 


Gly 
90 


Glu 


Gly 


Gly 


Gly 


Gly 
95 


Gly 


Thr 


Asn 


Gly 


Asn 
100 


Gly 


Ser 


Gly 


Thr 


Thr 
105 


Asn 


Gly 


Gly 


Gly 


Gly 
110 


Gly 


Gly 


Glu 


Gly 


Gly 
115 


Gly 


Gly 


Gly 


Thr 


Asn 
120 


Gly 


Gly 


Gly 


Asn 


Gly 
125 


Asn 


Gly 


Gly 


Gly 


Asn 
130 


Gly 


Asn 


Gly 


Asn 


Gly 
135 


Asn 


Gly 


Gly 


Asp 


Thr 
140 


Asp 


Thr 


Asp 


Asp 


Phe 


Glu 


Pro 


Thr 


Pro 


Ala 


Leu 


Leu 


Lys 


Glu 


Arg 


Leu 


Leu 


Asn 


Ser 


He 


145 










150 










155 










160 


Ser 


Ser 


Lys 


Pro 


Lys 
165 


Glu 


Tyr 


Tyr 


Glu 


Ala 
170 


Phe 


Val 


Ser 


Ala 


Glu 
175 


Val 


Glu 


Thr 


Ala 


Leu 
180 


Gin 


Leu 


Ser 


Arg 


Asp 
185 


Asp 


Ser 


Thr 


Gin 


Thr 
190 


He 


He 


lie 


Asp 


Asp 
195 


Asp 


Gin 


Leu 


Glu 


Leu 
200 


Asp 


Ala 


Ser 


Asp 


Thr 
205 


Leu 


Gin 


Gly 
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Lys 


Pro 


Arg 


Asp 


Tyr 


Leu 


Phe 


Lys 


Leu 


Ala 


Gly 


Val 


Ser 


Ser 


Ala 


Phe 




210 










215 










220 










Leu 


Glu 


Gly 


Thr 


Thr 


He 


Arg 


Lys 


Ala 


Glu 


Asp 


Arg 


Ala 


Arg 


Asn 


He 


225 










230 










235 










240 


Asn 


Glu 


Glu 


Glu 


He 


Ala 


Gin 


Thr 


He 


Leu 


Ser 


Gin 


Leu 


Arg 


Glu 


Lys 










245 










250 










255 




His 


He 


Asn 


Asp 


Glu 


Tyr 


Asp 


Gly 


Lys 


Tyr 


Ala 


Thr 


Pro 


Glu 


Glu 


Arg 








260 










265 










270 






Ala 


Asp 


Phe 


Ser 


Asn 


Ser 


Leu 


Asn 


Leu 


Val 


Thr 


Lys 


Tyr 


Thr 


Asn 


His 






275 










280 










285 








Glu 


Val 


Gly 


Leu 


Leu 


Val 


Gly 


Glu 


Thr 


He 


Glu 


Lys 


Ala 


Phe 


Phe 


He 




290 










295 










300 










Glu 


Phe 


Glu 


Arg 


Cys 


He 


He 


Leu 


Val 


Glu 


Asp 


Phe 


Asn 


Ser 


Gly 


Thr 


305 










310 










315 










320 


lie 


Thr 


Ser 


Asn 


Thr 


Met 


Gin 


Tyr 


Arg 


Ser 


Asn 


Ala 


Tyr 


Lys 


He 


Arg 










325 










330 










335 




Val 


Val 


Glu 


Gly 


Ser 


Thr 


Thr 


Asp 


Pro 


Gly 


Glu 


Val 


Val 


Pro 


Asp 


Asp 








340 










345 










350 






Cys 


Leu 


Val 


Phe 


Ala 


Val 


Val 


Val 


Asn 


Lys 


Glu 


Gin 


His 


Ser 


Leu 


Glu 






355 










360 










365 








lie 


Ser 


Ala 


Thr 


Asn 


Arg 


Cys 


Gin 


Asp 


He 


Cys 


Phe 


Val 


He 


He 


Pro 




370 










375 










380 










Arg 


Leu 


Ser 


Ala 


He 


Gly 


Lys 


Asn 


Ala 


Thr 


Met 


Val 


He 


Arg 


Lys 


Gly 


385 










390 










395 










400 


Asp 


Glu 


He 


Lys 


Gin 


Glu 


Thr 


Tyr 


Leu 


Phe 


Val 


Ala 


Asn 


Lys 


Asn 


Asp 










405 










410 










415 




Thr 


Thr 


His 


Phe 


Ser 


He 


He 


Thr 


Asp 


Lys 


Asp 


Glu 


Ser 


Val 


Gly 


He 








420 










425 










430 






Glu 


Leu 


Asn 


Met 


Leu 


He 


Phe 


Ser 


He 


Leu 


Pro 


Thr 


Leu 


Ser 


Asp 


Pro 






435 










440 










445 








Ala 


Thr 


Val 


Pro 


Arg 


Pro 


Leu 


Thr 


Asp 


Ala 


Asn 


Val 


Leu 


Ser 


Ala 


Tyr 




450 










455 










460 










Gly 


Lys 


Arg 


Leu 


Gly 


Val 


Gly 


Ala 


Phe 


Thr 


Asp 


Lys 


Asn 


Leu 


Leu 


Ser 


465 










470 










475 










480 



Ser Gin 



<210> 6 
<211> 903 
<212> DNA 
<213> SHRIMP 



<400> 6 

atgggagata 

gattggctat 

gagacagccc 

aaggttgaag 

gctgctgata 

acagtggcaa 

tggacattag 

cgtttatcat 

tttttaagaa 

tccttgtcag 

aaggttgaca 

atttgtatgt 

aaatttcatt 

aatcgatcgc 

agttctgctt 

tag 



agcaaaaggt 
cggctaatgt 
aagttgttaa 
aaatgagacc 
tgataagaga 
gaactatcaa 
cctacaaacc 
ataaagacct 
taaaaaagat 
tg tcttatcc 
tacacatcag 
tagatactga 
ttgaagcagg 
atatag taaa 
ctacaacaga 



ggaacagctt 
tgatccaatt 
acaggccgtt 
ggatataatc 
ggtagacact 
taatttaaga 
atggagaaga 
gactgtccat 
cgcctatatt 
caacaagtcc 
aagaaatgat 
tttttcagga 
gaacaggact 
ctctaaaata 
actggataat 



ttaagggaat 
gttgagagat 
gatgaaaaat 
aatgaagcat 
agtagtgtta 
gataaaagga 
tatgtacaag 
gccaattccg 
aataacgacc 
gagtggggaa 
ttacaggaaa 
tatgataaag 
atgtttttac 
tgcactatag 
gtctattaca 



taaaggcaga 
ttgtcacaac 
acgatgaatt 
ccgaaacata 
ttgctcctgc 
aagaatatga 
caattacagt 
atacatattt 
gcgccagccc 
atgataacgg 
aggatcttta 
ctgtggaagt 
ccaaaacctc 
tgttccctcc 
ggatcacatg 



ggctaatga t 
taaatctga t 
attagaagat 
tgataaactt 
aatagctggc 
aaagaggcta 
gatggaattt 
aacattccct 
cgtgaactgc 
agttggacg t 
cctgtctgtt 
tgatgcccat 
taatctattt 
cgcttctgcc 
cacatgctcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

903 



wo 01/38351 



88 



PCT/USOO/28888 



<210> 7 
<211> 298 
<212> PRT 
<213> SHRIMP 



<400> 7 



Met 


Gly 


Asp 


Lys 


Gin 


Lys 


Val 


Glu 


1 








5 








Glu 


Ala 


Asn 


Asp 


Asp 


Trp 


Leu 


Ser 








20 










Arg 


Phe 


Val 


Thr 


Thr 


Lys 


Ser 


Asp 






35 










40 


Ala 


Val 


Asp 


Glu 


Lys 


Tyr 


Asp 


Glu 




50 










55 




Met 


Arg 


Pro 


Asp 


lie 


He 


Asn 


Glu 


65 










70 






Ala 


Ala 


Asp 


Met 


He 


Arg 


Glu 


Val 










85 








Ala 


lie 


Ala 


Gly 


Thr 


Val 


Ala 


Arg 








100 










Arg 


Lys 


Glu 


Tyr 


Glu 


Lys 


Arg 


Leu 






115 










120 


Arg 


Arg 


Tyr 


Val 


Gin 


Ala 


He 


Thr 




130 










135 




Lys 


Asp 


Leu 


Thr 


Val 


His 


Ala 


Asn 


145 










150 






Phe 


Leu 


Arg 


lie 


Lys 


Lys 


He 


Ala 










165 








Asn 


Cys 


Ser 


Leu 


Ser 


Val 


Ser 


Tyr 








180 










Asp 


Asn 


Gly 


Val 


Gly 


Arg 


Lys 


Val 






195 










200 


Leu 


Gin 


Glu 


Lys 


Asp 


Leu 


Tyr 


Leu 




210 










215 




Asp 


Phe 


Ser 


Gly 


Tyr 


Asp 


Lys 


Ala 


225 










230 






His 


Phe 


Glu 


Ala 


Gly 


Asn 


Arg 


Thr 










245 








Leu 


Phe 


Asn 


Arg 


Ser 


His 


He 


Val 








260 










Phe 


Pro 


Pro 


Ala 


Ser 


Ala 


Ser 


Ser 






275 










280 


Val 


Tyr 


Tyr 


Arg 


He 


Thr 


Cys 


Thr 




290 










295 





Gin 


Leu 


Leu 


Arg 


Glu 


Leu 


Lys 


Ala 




10 










15 




Ala 


Asn 


Val 


Asp 


Pro 


He 


Val 


Glu 


25 










30 






Glu 


Thr 


Ala 


Gin 


Val 


Val 


Lys 


Gin 










45 








Leu 


Leu 


Glu 


Asp 


Lys 


Val 


Glu 


Glu 








60 










Ala 


Ser 


Glu 


Thr 


Tyr 


Asp 


Lys 


Leu 






75 










80 


Asp 


Thr 


Ser 


Ser 


Val 


He 


Ala 


Pro 




90 










95 




Thr 


He 


Asn 


Asn 


Leu 


Arg 


Asp 


Lys 


105 










110 






Trp 


Thr 


Leu 


Ala 


Tyr 


Lys 


Pro 


Trp 










125 








Val 


Met 


Glu 


Phe 


Arg 


Leu 


Ser 


Tyr 








140 










Ser 


Asp 


Thr 


Tyr 


Leu 


Thr 


Phe 


Pro 






155 










160 


Tyr 


He 


Asn 


Asn 


Asp 


Arg 


Asp 


Val 




170 










175 




Pro 


Asn 


Lys 


Ser 


Glu 


Trp 


Gly 


Asn 


185 










190 






Asp 


He 


His 


He 


Arg 


Arg 


Asn 


Asp 










205 








Ser 


Val 


He 


Cys 


Met 


Leu 


Asp 


Thr 








220 










Val 


Glu 


Val 


Asp 


Ala 


His 


Lys 


Phe 






235 










240 


Met 


Phe 


Leu 


Pro 


Lys 


Thr 


Ser 


Asn 




250 










255 




Asn 


Ser 


Lys 


He 


Cys 


Thr 


He 


Val 


265 










270 






Ala 


Ser 


Thr 


Thr 


Glu 


Leu 


Asp 


Asn 










285 









Cys Ser 



<210> 8 
<211> 2376 
<212> DNA 
<213> SHRIMP 



<400> 8 

atggaggaag 

tcccagatcc 

tactctgttt 

ataactgcgt 

gaaaacgtaa 

atagctgtgg 

ctagaaaaat 

aggctagttc 

gaagaggaca 



aaagtcaacg 
tgaaagacac 
cagttataaa 
ggtcaatcag 
aacctgctgc 
aggataccag 
atgaaaaggg 
acgcctccag 
gtgataaatt 



agtacagagg 
caag ttaaga 
aagtgctctc 
ggaggcttat 
aaaa tcatta 
tcctagaagt 
gaattttagg 
ttctagagtg 
agggggtatt 



aggatcggag 
gtatcatatc 
cagaaaggat 
ctctattacc 
aatacaaata 
atggtggctt 
caacccagtt 
tgctcccaca 
tattatgcta 



tcctcccaga 
tgggcgtggg 
gcaggaggaa 
atcttggatt 
tggtaaacag 
ctaatgagtg 
acttgatgga 
tgagagcttt 
attttaatga 



agaggctgcc 
acattgggga 
cgatgaggat 
gaattaca tt 
aattaaaatc 
tgtgagaact 
tgctgccatg 
gtgttgcaag 
acttgaaact 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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cagtgtgtta 
attgaaagtg 
gcttaccatg 
ggaccattca 
cagcacgtaa 
aattggagaa 
acatctattg 
attcagaaaa 
gctacatttg 
ctggacaaac 
gtttctggag 
cgtctagatt 
ttgaagcgag 
ataaaggctt 
ataaagaaaa 
gctactacta 
ccaagtggaa 
gaggatttag 
acagttttaa 
tggggttctc 
tatttgttgc 
atagtaaagg 
aatgccatgt 
ataacagatg 
aggcaattaa 
tcaaacattc 
gcaaaggatc 
aagacttcct 
tggctatttg 
aatatgggaa 

ggggtcgtga 



gtgcagttaa 
taaaattggg 
tgttaagata 
agcgaaaaga 
aaacagaccc 
gggatttttt 
tggaattaat 
gggatttaaa 
attggataga 
atacgaacaa 
gagacaagcg 
ctccccctcc 
aagctgctag 
ctagttttac 
gaaaaattga 
gtagttctat 
gcaataaagt 
atagagtact 
tgaaggatgg 
atcagaattt 
catgccctga 
atgaaatttc 
gttatttcat 
tgcgcctttc 
tacacatttt 
tcgtaggtga 
cgaggacagc 
ttaaggtaaa 
atacatcaaa 
ttggtcccac 
atgacctgct 



tttttctccc 
gaagaaaagt 
ttatggggat 
gtttgagcag 
tgaattacgt 
ctgctcaaaa 
tatagccatg 
acgtttcaat 
aggacatgtt 
aaacacacat 
ctggtcccca 
cccagaagtg 
tcattgcgta 
tgctaatatt 
gattaaggat 
tacttctact 
agactctata 
agaagtacac 
aaataaagtt 
tgttcaagta 
tagcggacca 
gagtacagct 
ctctgggtgt 
acatatggga 
aatttttaga 
tggaggaaac 
tctcgaaaat 
caaggtaaca 
aagtgataaa 
tactttggat 
ttatgacaac 



atagaaagaa 
gtacagttat 
aaagtgaaag 
ttttgggggt 
tgttatttta 
gggttcttta 
tgcataggtg 
aaaggagaag 
aaacggatgc 
ggagtatctt 
ggggtgtggc 
ggccaatttc 
acgaggaata 
aattctgaac 
gataa tacta 
ccacctccta 
caattgaata 
aaccaaaact 
gtgtttaagg 
ctaaaggatg 
tacagaggat 
gctaggatag 
gtgactagac 
ccaaataaac 
ttgttgactg 
ttgttctctg 
agaaagataa 
aaggaagata 
ataatgaatg 
a ttgtaaaga 
aaataa 



ttaaacacgt 
taaatttaag 
atacaaataa 
tatgctttaa 
atgagttgac 
gagaagaaag 
accgtaaaca 
aagggagaaa 
ctcaaatgcc 
ttgccctaga 
ttcattctta 
tggatcaggc 
tttgtactac 
cgatggaaat 
ctgctactgt 
caaagaaaca 
atctaccaac 
ctaaaaaggg 
agatgagaaa 
aagatgtgtg 
tatacagatg 
aaaaagtgaa 
aagaaggaat 
aatatgtgta 
gtgtgagtga 
tggacgagaa 
aggaactaca 
tagatagctg 
gtgtatgtaa 
ataattgtac 



cttcagggag 
aagtgtagca 
gaaacatagt 
atttgttact 
atatgccata 
tctattcctt 
gtttgccaag 
agaagaggct 
tgtatgggtt 
gagtagtatg 
tacaaagatg 
ttttaataca 
tacaggattt 
taaggaagaa 
tactgttagt 
aaaaacaact 
tttaaatatg 
tgtagctgct 
aagctttggt 
taaattggac 
ctatttcaaa 
atggggagaa 
tgggaaaatt 
tgataactat 
taccaacaca 
ttatgtgggc 
attgctactg 
tcttccatct 
tattggtaaa 
ttgtatttta 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2376 
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<400> 9 



Met 


Glu 


Glu 


Glu 


Ser 


Gin 


Arg 


Val 


Gin 


Arg 


Arg 


He 


Gly 


Val 


Leu 


Pro 


1 








5 










10 










15 




Glu 


Glu 


Ala 


Ala 


Ser 


Gin 


He 


Leu 


Lys 


Asp 


Thr 


Lys 


Leu 


Arg 


Val 


Ser 








20 










25 










30 






Tyr 


Leu 


Gly 


Val 


Gly 


His 


Trp 


Gly 


Tyr 


Ser 


Val 


Ser 


Val 


He 


Lys 


Ser 






35 










40 










45 








Ala 


Leu 


Gin 


Lys 


Gly 


Cys 


Arg 


Arg 


Asn 


Asp 


Glu 


Asp 


He 


Thr 


Ala 


Trp 




50 










55 










60 










Ser 


He 


Arg 


Glu 


Ala 


Tyr 


Leu 


Tyr 


Tyr 


His 


Leu 


Gin 


Tyr 


He 


Glu 


Asn 


65 










70 










75 










80 


Val 


Lys 


Pro 


Ala 


Ala 


Lys 


Ser 


Leu 


Asn 


Thr 


Asn 


Met 


Val 


Asn 


Arg 


He 










85 










90 










95 




Lys 


He 


He 


Ala 


Val 


Glu 


Asp 


Thr 


Ser 


Pro 


Arg 


Ser 


Met 


Val 


Asn 


Glu 








100 










105 










110 






Cys 


Val 


Arg 


Thr 


Leu 


Glu 


Lys 


Tyr 


Glu 


Lys 


Gly 


Asn 


Phe 


Arg 


Gin 


Pro 






115 










120 










125 








Ser 


Tyr 


Leu 


Met 


Asp 


Ala 


Ala 


Met 


Arg 


Leu 


Val 


His 


Ala 


Ser 


Ser 


Ser 




130 










135 










140 










Arg 


Val 


Cys 


Ser 


His 


Met 


Arg 


Ala 


Leu 


Cys 


Cys 


Lys 


Glu 


Glu 


Asp 


Ser 


145 










150 










155 










160 


Asp 


Lys 


Leu 


Gly 


Gly 


He 


Tyr 


Tyr 


Ala 


Asn 


Phe 


Asn 


Glu 


Leu 


Glu 


Thr 










165 










170 










175 




Gin 


Cys 


Val 


Ser 


Ala 


Val 


Asn 


Phe 


Ser 


Pro 


He 


Glu 


Arg 


He 


Lys 


His 
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180 



Val 


Phe 


Arg 


Glu 


He 


Glu 


Ser 


Val 






195 










200 


Leu 


Leu 


Asn 


Leu 


Arg 


Ser 


Val 


Ala 




210 










215 




Gly 


Asp 


Lys 


Val 


Lys 


Asp 


Thr 


Asn 


225 










230 






Arg 


Lys 


Glu 


Phe 


Glu 


Gin 


Phe 


Trp 










245 








Gin 


His 


Val 


Lys 


Thr 


Asp 


Pro 


Glu 








260 










Thr 


Tyr 


Ala 


He 


Asn 


Trp 


Arg 


Arg 






275 










280 


Phe 


Arg 


Glu 


Glu 


Ser 


Leu 


Phe 


Leu 




290 










295 




Ala 


Met 


Cys 


He 


Gly 


Asp 


Arg 


Lys 


305 










310 






Asp 


Leu 


Lys 


Arg 


Phe 


Asn 


Lys 


Gly 










325 








Ala 


Thr 


Phe 


Asp 


Trp 


He 


Glu 


Gly 








340 










Pro 


Val 


Trp 


Val 


Leu 


Asp 


Lys 


His 






355 










360 


Ser 


Phe 


Ala 


Leu 


Glu 


Ser 


Ser 


Met 




370 










375 




Ser 


Pro 


Gly 


Val 


Trp 


Leu 


His 


Ser 


385 










390 






Pro 


Pro 


Pro 


Pro 


Glu 


Val 


Gly 


Gin 










405 








Leu 


Lys 


Arg 


Glu 


Ala 


Ala 


Ser 


His 








420 










Thr 


Thr 


Gly 


Phe 


He 


Lys 


Ala 


Ser 






435 










440 


Glu 


Pro 


Met 


Glu 


He 


Lys 


Glu 


Glu 




450 










455 




Lys 


Asp 


Asp 


Asn 


Thr 


Thr 


Ala 


Thr 


465 










470 






Ser 


Ser 


lie 


Thr 


Ser 


Thr 


Pro 


Pro 










485 








Pro 


Ser 


Gly 


Ser 


Asn 


Lys 


Val 


Asp 








500 










Thr 


Leu 


Asn 


Met 


Glu 


Asp 


Leu 


Asp 






515 










520 


Asn 


Ser 


Lys 


Lys 


Gly 


Val 


Ala 


Ala 




530 










535 




Lys 


Val 


Val 


Phe 


Lys 


Glu 


Met 


Arg 


545 










550 






Gin 


Asn 


Phe 


Val 


Gin 


Val 


Leu 


Lys 










565 








Tyr 


Leu 


Leu 


Pro 


Cys 


Pro 


Asp 


Ser 








580 










Phe 


Lys 


He 


Val 


Lys 


Asp 


Glu 


He 






595 










600 


Lys 


Val 


Lys 


Trp 


Gly 


Glu 


Asn 


Ala 




610 










615 




Val 


Thr 


Arg 


Gin 


Glu 


Gly 


He 


Gly 


625 










630 






Ser 


His 


Met 


Gly 


Pro 


Asn 


Lys 


Gin 










645 








Leu 


lie 


His 


He 


Leu 


He 


Phe 


Arg 



660 



185 










190 






Lys 


Leu 


Gly 


Lys 


Lys 


Ser 


Val 


Gin 










205 








Ala 


Tyr 


His 


Val 


Leu 


Arg 


Tyr 


Tyr 








220 










Lys 


Lys 


His 


Ser 


Gly 


Pro 


Phe 


Lys 






235 










240 


Gly 


Leu 


Cys 


Phe 


Lys 


Phe 


Val 


Thr 




250 










255 




Leu 


Arg 


Cys 


Tyr 


Phe 


Asn 


Glu 


Leu 


265 










270 






Asp 


Phe 


Phe 


Cys 


Ser 


Lys 


Gly 


Phe 










285 








Thr 


Ser 


He 


Val 


Glu 


Leu 


He 


He 








300 










Gin 


Phe 


Ala 


Lys 


He 


Gin 


Lys 


Arg 






315 










320 


Glu 


Glu 


Gly 


Arg 


Lys 


Glu 


Glu 


Ala 




330 










335 




His 


Val 


Lys 


Arg 


Met 


Pro 


Gin 


Met 


345 










350 






Thr 


Asn 


Lys 


Asn 


Thr 


His 


Gly 


Val 










365 








Val 


Ser 


Gly 


Gly 


Asp 


Lys 


Arg 


Trp 








380 










Tyr 


Thr 


Lys 


Met 


Arg 


Leu 


Asp 


Ser 






395 










400 


Phe 


Leu 


Asp 


Gin 


Ala 


Phe 


Asn 


Thr 




410 










415 




Cys 


Val 


Thr 


Arg 


Asn 


He 


Cys 


Thr 


425 










430 






Ser 


Phe 


Thr 


Ala 


Asn 


He 


Asn 


Ser 










445 








He 


Lys 


Lys 


Arg 


Lys 


He 


Glu 


He 








460 










Val 


Thr 


Val 


Ser 


Ala 


Thr 


Thr 


Ser 






475 










480 


Pro 


Thr 


Lys 


Lys 


Gin 


Lys 


Thr 


Thr 




490 










495 




Ser 


He 


Gin 


Leu 


Asn 


Asn 


Leu 


Pro 


505 










510 






Arg 


Val 


Leu 


Glu 


Val 


His 


Asn 


Gin 










525 








Thr 


Val 


Leu 


Met 


Lys 


Asp 


Gly 


Asn 








540 










Lys 


Ser 


Phe 


Gly 


Trp 


Gly 


Ser 


His 






555 










560 


Asp 


Glu 


Asp 


Val 


Cys 


Lys 


Leu 


Asp 




570 










575 




Gly 


Pro 


Tyr 


Arg 


Gly 


Arg 


Cys 


Tyr 


585 










590 






Ser 


Ser 


Thr 


Ala 


Ala 


Arg 


He 


Glu 










605 








Met 


Cys 


Tyr 


Phe 


He 


Ser 


Gly 


Cys 








620 










Lys 


He 


He 


Thr 


Asp 


Val 


Arg 


Leu 






635 










640 


Tyr 


Val 


Tyr 


Asp 


Asn 


Tyr 


Arg 


Gin 




650 










655 




Leu 


Leu 


Thr 


Gly 


Val 


Ser 


Asp 


Thr 


665 










670 
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Asn 


Thr 


Ser 
675 


Asn 


He 


Leu 


Val 


Gly 
680 


Asp 


Gly 


Gly 


Asn 


Leu 
685 


Phe 


Ser 


Val 


Asp 


Glu 
690 


Asn 


Tyr 


Val 


Gly 


Ala 
695 


Lys 


Asp 


Pro 


Arg 


Thr 
700 


Ala 


Leu 


Glu 


Asn 


Arg 


Lys 


He 


Lys 


Glu 


Leu 


Gin 


Leu 


Leu 


Leu 


Lys 


Thr 


Ser 


Phe 


Lys 


Val 


705 










710 










715 










720 


Asn 


Lys 


Val 


Thr 


Lys 
725 


Glu 


Asp 


He 


Asp 


Ser 
730 


Cys 


Leu 


Pro 


Ser 


Trp 
735 


Leu 


Phe 


Asp 


Thr 


Ser 
740 


Lys 


Ser 


Asp 


Lys 


He 

745 


Met 


Asn 


Gly 


Val 


Cys 
750 


Asn 


He 


Gly 


Lys 


Asn 
755 


Met 


Gly 


He 


Gly 


Pro 
760 


Thr 


Thr 


Leu 


Asp 


He 
765 


Val 


Lys 


Asn 


Asn 


Cys 
770 


Thr 


Cys 


He 


Leu 


Gly 
775 


Val 


Val 


Asn 


Asp 


Leu 
780 


Leu 


Tyr 


Asp 


Asn 


Lys 
































785 
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<400> 10 

atggctttaa 

gacatggctc 

ctttatagag 

aacacaaaca 

gacagaactg 

tctatactcg 

agtgggaaaa 

tcatacaaag 

aagagtacgt 

atggattcga 

gccttgtaca 

ctgactcata 

tggaggaaga 

ctgaagaaca 

atgtacaaga 

tcggacagtg 

gatgaagcat 

gtcacaacaa 

cacatccacc 

ttaatgaagg 

gtgaaccctg 

aaacctcctt 

ctttcttctg 

aaagaattgg 

aatctatcta 

attactgcca 

gctctgtggg 

gcggctgaca 

aaaactgtaa 

aatgtagttg 

tatatgtctc 



aagacgcatt 
cagttgttga 
ttctggtagt 
aaagacaatg 
tatctaattt 
attactggta 
gtgatacgtt 
acatgaaaac 
tggacccttt 
aatatgggag 
cgactgtata 
aacctgttac 
tcgtcaaggc 
taatgaagaa 
aaattggtaa 
aagatgatga 
cagaaaggga 
tcactggcga 
ctgaatggaa 
cca tggaaac 
tacagatttt 
ctgatgtcaa 
gtgcgggtgt 
aca ttatttc 
ccagaatttg 
gaggtgctgt 
ccaagaaaat 
aaaatagtag 
cgtttggatg 
taagcttcga 
agcaagtttt 



caccgagcgg 
agatgaccgc 
tgaacagaaa 
ccgccaggta 
accccaagct 
caagaacgtt 
tctgaggact 
ttgttcgctg 
attgaaggac 
taaaactaca 
taatcactgg 
gggtaagaaa 
ttacgtgtcc 
tgagagtgct 
cagtgtcaag 
tgatgatgat 
aaagaagcac 
ttacatcttt 
aatacctatg 
tgacatatca 
acccatcgct 
attttgtatg 
gtttggtcga 
aaactgtcta 
gagagaagga 
tttagggtat 
ccaggacgct 
gaactgtgat 
tgcaaatatg 
ggattctatt 
tactcactca 



ttagttgtga 
cagaaatctc 
aattctgcaa 
gaagaagaca 
gttaaagaaa 
atacctcttc 
gcagtcatct 
atatatgaat 
attctcgaca 
tcccctgagc 
acaaacgcct 
tatgcatcag 
aaacacaaca 
gataatgcta 
aacactaata 
gacgatgact 
caagaatctc 
aaagtgcttc 
tgtgattttg 
aatgctgtaa 
gctaacttct 
atgtttaatg 
tttattggag 
agaagtttat 
ggttctgttg 
ggtgaacaat 
gtaacagaat 
ttgttaggag 
ttgaaaaccc 
caggctaata 
tttccagctc 



acaaagttgg 
tcttccaaaa 
ttactctctc 
aagtgatttt 
ccatcgcagc 
ttgacacgaa 
gtcttgtgag 
tcgaacacaa 
acaaacaaga 
tggcaaagga 
tcaagctcta 
tcatacactt 
acgtggagag 
atgttttaac 
ataatagtgc 
gtgaaggcat 
tatacccaat 
tagagcttgt 
ttaataggaa 
tagaagttag 
gggatttttg 
aaccatcttc 
gacccttttc 
tgttgaataa 
tctgttttaa 
tgtccgagag 
ctgtcaagag 
acgagggtgt 
agaacggtat 
aggaaggggc 
tagatcaagg 



ctccggtaca 
agtggaaaat 
tggaaacaaa 
tgaaggagag 
caatgcagaa 
gaaagaaaga 
atgttgtgtg 
aattttaaat 
actactacat 
aaccatcgag 
ccaagcctcg 
tataaggaca 
ggatctt tea 
ta t tgaaaaa 
ccaccaaatg 
ggatgtttgt 
caacactcca 
tctttcaccc 
tatccccaaa 
agct tcaaag 
taagagcggg 
taacgagacc 
ccacaagagt 
agaagcggac 
ttactgcccc 
atctattaaa 
gcaaaggaat 
agtatccatg 
ggggaaattt 
ggccagacag 
aaaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1857 
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<400> 11 



Met 


Ala 


Leu 


Lys 


Asp 


Ala 


Phe 


Thr 


1 








5 








Gly 


Ser 


Gly 


Thr 


Asp 


Met 


Ala 


Pro 








20 










Ser 


Leu 


Phe 


Gin 


Lys 


Val 


Glu 


Asn 






35 










40 


Gin 


Lys 


Asn 


Ser 


Ala 


He 


Thr 


Leu 




50 










55 




Arg 


Gin 


Cys 


Arg 


Gin 


Val 


Glu 


Glu 


65 










70 






Asp 


Arg 


Thr 


Val 


Ser 


Asn 


Leu 


Pro 










85 








Ala 


Asn 


Ala 


Glu 


Ser 


He 


Leu 


Asp 








100 










Leu 


Leu 


Asp 


Thr 


Lys 


Lys 


Glu 


Arg 






115 










120 


Arg 


Thr 


Ala 


Val 


He 


Cys 


Leu 


Val 




130 










135 




Met 


Lys 


Thr 


Cys 


Ser 


Leu 


He 


Tyr 


145 










150 






Lys 


Ser 


Thr 


Leu 


Asp 


Pro 


Leu 


Leu 










165 








Glu 


Leu 


Leu 


His 


Met 


Asp 


Ser 


Lys 








180 










Glu 


Leu 


Ala 


Lys 


Glu 


Thr 


He 


Glu 






195 










200 


His 


Trp 


Thr 


Asn 


Ala 


Phe 


Lys 


Leu 




210 










215 




Pro 


Val 


Thr 


Gly 


Lys 


Lys 


Tyr 


Ala 


225 










230 






Trp 


Arg 


Lys 


lie 


Val 


Lys 


Ala 


Tyr 










245 








Arg 


Asp 


Leu 


Ser 


Leu 


Lys 


Asn 


He 








260 










Ala 


Asn 


Val 


Leu 


Thr 


He 


Glu 


Lys 






275 










280 


Val 


Lys 


Asn 


Thr 


Asn 


Asn 


Asn 


Ser 




290 










295 




Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


305 










310 






Asp 


Glu 


Ala 


Ser 


Glu 


Lys 


Lys 


His 










325 








Thr 


Pro 


Val 


Thr 


Thr 


He 


Thr 


Gly 








340 










Glu 


Leu 


Val 


Leu 


Ser 


Pro 


His 


He 






355 










360 


Cys 


Asp 


Phe 


Val 


Asn 


Arg 


Asn 


He 




370 










375 




lie 


Ser 


Asn 


Ala 


Val 


He 


Glu 


Val 


385 










390 






Gin 


He 


Leu 


Pro 


He 


Ala 


Ala 


Asn 










405 








Lys 


Pro 


Pro 


Ser 


Asp 


Val 


Lys 


Phe 








420 










Ser 


Asn 


Glu 


Thr 


Leu 


Ser 


Ser 


Gly 






435 










440 


Gly 


Gly 


Pro 


Phe 


Ser 


His 


Lys 


Ser 




450 










455 




Cys 


Leu 


Arg 


Ser 


Leu 


Leu 


Leu 


Asn 


4 65 










470 







Glu 


Arg 


Leu 


Val 


Val 


Asn 


Lys 


Val 




10 










15 




Val 


Val 


Glu 


Asp 


Asp 


Arg 


Gin 


Lys 


25 










30 






Leu 


Tyr 


Arg 


Val 


Leu 


Val 


Val 


Glu 










45 








Ser 


Gly 


Asn 


Lys 


Asn 


Thr 


Asn 


Lys 








60 










Asp 


Lys 


Val 


He 


Phe 


Glu 


Gly 


Glu 






75 










80 


Gin 


Ala 


Val 


Lys 


Glu 


Thr 


He 


Ala 




90 










95 




Tyr 


Trp 


Tyr 


Lys 


Asn 


Val 


He 


Pro 


105 










110 






Ser 


Gly 


Lys 


Ser 


Asp 


Thr 


Phe 


Leu 










125 








Arg 


Cys 


Cys 


Val 


Ser 


Tyr 


Lys 


Asp 








140 










Glu 


Phe 


Glu 


His 


Lys 


He 


Leu 


Asn 






155 










160 


Lys 


Asp 


He 


Leu 


Asp 


Asn 


Lys 


Gin 




170 










175 




Tyr 


Gly 


Ser 


Lys 


Thr 


Thr 


Ser 


Pro 


185 










190 






Ala 


Leu 


Tyr 


Thr 


Thr 


Val 


Tyr 


Asn 










205 








Tyr 


Gin 


Ala 


Ser 


Leu 


Thr 


His 


Lys 








220 










Ser 


Val 


He 


His 


Phe 


He 


Arg 


Thr 






235 










240 


Val 


Ser 


Lys 


His 


Asn 


Asn 


Val 


Glu 




250 










255 




Met 


Lys 


Asn 


Glu 


Ser 


Ala 


Asp 


Asn 


265 










270 






Met 


Tyr 


Lys 


Lys 


He 


Gly 


Asn 


Ser 










285 








Ala 


His 


Gin 


Met 


Ser 


Asp 


Ser 


Glu 








300 










Asp 


Cys 


Glu 


Gly 


Met 


Asp 


Val 


Cys 






315 










320 


Gin 


Glu 


Ser 


Leu 


Tyr 


Pro 


He 


Asn 




330 










335 




Asp 


Tyr 


He 


Phe 


Lys 


Val 


Leu 


Leu 


345 










350 






His 


Pro 


Glu 


Trp 


Lys 


He 


Pro 


Met 










365 








Pro 


Lys 


Leu 


Met 


Lys 


Ala 


Met 


Asp 








380 










Arg 


Ala 


Ser 


Lys 


Val 


Asn 


Pro 


Val 






395 










400 


Phe 


Trp 


Asp 


Phe 


Cys 


Lys 


Ser 


Gly 




410 










415 




Cys 


Met 


Met 


Phe 


Asn 


Glu 


Pro 


Ser 


425 










430 






Ala 


Gly 


Val 


Phe 


Gly 


Arg 


Phe 


He 










445 








Lys 


Glu 


Leu 


Asp 


He 


He 


Ser 


Asn 








460 










Lys 


Glu 


Ala 


Asp 


Asn 


Leu 


Ser 


Thr 






475 










480 
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Arg 


He 


Trp 


Arg 


Glu 
485 


Gly 


Gly 


Ser 


He 


Thr 


Arg 


Ala 
500 


Val 


Leu 


Gly 


Tyr 


Ala 


Leu 


Trp 
515 


Ala 


Lys 


Lys 


He 


Gin 
520 


Arg 


Gin 
530 


Arg 


Asn 


Ala 


Ala 


Asp 
535 


Lys 


Gly 


Asp 


Glu 


Gly 


Val 


Val 


Ser 


Met 


545 










550 






Asn 


Met 


Leu 


Lys 


Thr 
565 


Gin 


Asn 


Gly 


Ser 


Phe 


Glu 


Asp 
580 


Ser 


He 


Gin 


Ala 


Tyr 


Met 


Ser 
595 


Gin 


Gin 


Val 


Phe 


Thr 
600 



Gly Lys 
610 



Val 


Val 
490 


Cys 


Phe 


Asn 


Tyr 


Cys 
495 


Pro 


Gly 


Glu 


Gin 


Leu 


Ser 


Ser 


He 


Lys 


505 










510 






Asp 


Ala 


Val 


Thr 


Glu 
525 


Ser 


Val 


Lys 


Asn 


Ser 


Arg 


Asn 
540 


Cys 


Asp 


Leu 


Leu 


Lys 


Thr 


Val 
555 


Thr 


Phe 


Gly 


Cys 


Ala 
560 


Met 


Gly 

570 


Lys 


Phe 


Asn 


Val 


Val 
575 


Val 


Asn 


Lys 


Glu 


Gly Ala 


Ala 


Arg 


Gin 


585 










590 






His 


Ser 


Phe 


Pro 


Ala 
605 


Leu 


Asp 


Gin 



<210> 12 
<211> 5811 
<212> DNA 
<213> SHRIMP 



<400> 12 

atggggagta 

aacaacaaca 

gctaccagaa 

tcttcttctt 

gccagaaagg 

aactcgattt 

cctggagctg 

ttgattgatt 

agtctcatgt 

ttttctaccc 

ctagaaaaaa 

aatttaaggt 

attgatgctg 

gttatactac 

atccagaaga 

aacaagaaca 

gacgaaaacg 

gctgagacga 

aattttggcc 

atgttttgct 

cgccatgaag 

tcaagtgcta 

ccaagttctt 

gaagatgacg 

gaaaaggaca 

gatgcaaacg 

gatatgatga 

gctcacatat 

gtatgcagta 

cgagaaaaat 

tccttaatgg 

atagatgcag 

aggaacgcct 

tttaaaga ta 

gaactactag 



aacgtccgtg 
ataattcaaa 
cccaggtgct 
cctcctcctc 
tattgagcac 
ctgtcagtta 
ggaaaactac 
ctggatctgc 
gtgcatgcaa 
tcaacaaagg 
tccaatttgt 
tccttgttat 
ttttacgtat 
ttggagataa 
aaataaagaa 
agaagaagaa 
aagaagaaga 
agaaagagga 
aagaagataa 
ctagagactt 
caattttaat 
ttaaagttga 
ccccggccca 
atttctttaa 
gagcatcggg 
aaaagataag 
aatctctccc 
ctgaactcaa 
ttgtgttcga 
tgtatattgt 
atgaagagat 
taccattcaa 
tctttagaga 
atttgcgtac 
cgaaatctga 



ctccagcggc 
tccagttcca 
cagaagtgca 
ctctgcttct 
tgtttccttg 
tgctggagga 
aatggtcaag 
agacatgctc 
gaaaccagga 
atttgagatt 
ggccgataaa 
cgatgagtat 
agcgaaacac 
caggcaaaat 
cccgagtgag 
gaaagagaag 
gggggaggag 
agaaaagagt 
tgcaaaacta 
ttttggcaac 
gaagagtaat 
agactgtggt 
atccacggca 
caatgaagca 
agctttgaat 
aagtggaaca 
cctttccgct 
gaaaatgagc 
catgatggcc 
ttctagccta 
attgaacgtc 
ttctgcacat 
tggaaaagac 
agttgcttca 
aaatataaga 



caagagccag 
gttatcaaca 
gttgcagctg 
gcggtcaagc 
caacagtctc 
ggaatatctg 
gaactaattg 
ctgtgttgca 
ggttcttcct 
ccagttatat 
ctaaaatgga 
acaatggcca 
aggccagata 
tctgcagtag 
gaagaaaaac 
aaagagaaag 
gaggaagaag 
accttct tec 
tacactgaag 
ccaagtaacc 
aacgtcaagt 
aa tgcttcta 
gaagaaaatt 
tut t tgaaac 
ggat tctctt 
acttctgtat 
ctaatagaag 
aatgcaaatc 
aaagccatga 
tcagagaga t 
aagtatgtgc 
aaccgtgcgt 
tttgtgga tg 
tttt tagaga 
tctattctct 



ttaccaaaaa 
tcaaatcata 
cagcggcatc 
tcccagatac 
tggcagtgag 
tgttccatct 
cagttcttaa 
aatctaattc 
tgatgtaccc 
ttaggaaaga 
aagtgataca 
gttgcaggga 
tacctttcgg 
tggaagataa 
ctcagaagaa 
gtggtgaaga 
aagaagaaag 
aagggtcagt 
tttttataaa 
tgcggaatat 
cagttaacaa 
ataaaaagga 
gtgacgaatt 
tgatggaaag 
tacgctgcaa 
cagacaagaa 
aaggaatatg 
tagaaaagta 
gagaaattga 
ttaaggatac 
acggttctga 
ctgctgtggc 
aaactcctat 
atgaaacttt 
tgaagaaaga 



acaaaagaag 
cccctttctg 
cccttctggg 
gtgcaaggag 
gtatctgtgc 
gggtggtctt 
cgaccacggc 
tgcaaaggag 
agaaagtgtg 
tgaaataact 
agtgttggca 
attggtgttt 
aggagtgttt 
cactaaccat 
caacaagaac 
agaagaggga 
tgacgatgaa 
cgaacaagat 
gatactgaaa 
tgtcaataaa 
caatttagtc 
agtgacggca 
tgatgacgaa 
aaatgcactt 
gagcatttct 
gtcttctttg 
ttcagagcta 
cactgaaaat 
ttacagtggc 
acatctgacc 
tcccaagtgt 
agcgtgtgtg 
cgctaattac 
aacatacaaa 
aactggtaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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aataatagtg 
tattgttatt 
ggtatgaaaa 
tatagaactg 
catttagttg 
ttggtacgag 
gataatggaa 
ccctcagaat 
gataagcctc 
ggagatgtag 
agaacatata 
atgtacggta 
atggccggta 
tattttgata 
gttttcaata 
cttcaaactg 
tactctgaaa 
gagaggagga 
tgtgaacgcc 
actgacaaga 
tcgaatttag 
tcctcctctt 
agagtggaag 
accatctctc 
ctttttgaca 
actacgattc 
cagggccaaa 
ttcgtgacta 
gtatttgtgt 
attatcggta 
cttccatttg 
agagacacta 
cgcaatagta 
ttgtgtaaca 
attgctgatt 
gtggcagaat 
tat tctgcct 
agatcgggaa 
cagcaacatg 
aatggagatg 
actgctatcg 
tcttcaaagt 
atacacccag 
gata t tgcac 
ttagtgtatg 
ccgtgcctga 
aatctgggtt 
aaattgcgtc 
aaaaaagtta 
aagagtatgc 
ggtctatcga 
aatatccatg 
tgtatgttaa 
aaggacaaaa 
gcaccaagaa 
ctgttctgtg 
actggaggag 
tctcccaact 
a ttgccatga 
aagagcatgt 
aatgaagtaa 



catcatcacg 
tcgatgaaga 
gtagtggtgg 
atataccaga 
actttcatgt 
cccgtgtgtg 
aactcccttc 
gggccacaag 
tccatgacga 
ggaagagtat 
tcatgtcgtc 
tttcattgtt 
gatcatccag 
acatcttccc 
atggtcaaat 
ctcacactga 
aatctatagt 
agaagattga 
tcatggccaa 
tgatgggtgc 
tggatacttc 
ctacttcact 
taaaatttgt 
acaagtaccg 
agtttacaga 
ttttggatga 
atgtgatttt 
aggacactgg 
acgtggaacg 
atccaaatat 
aatctagtca 
ttgttgacct 
atcctcaaaa 
ttaaatgcgc 
ttaatgggag 
cttccgctgc 
atgacccttc 
aatctttatc 
ggctagaaac 
tggctaaaca 
ttcgttcaat 
tgtgtgaaga 
tttttgtgaa 
cccataacaa 
aaaaattgaa 
atcagtatgt 
acgagtctaa 
ttcctataac 
gtgacacatc 
agcacaactt 
aacatgggtc 
tagatgcaga 
tgttcatgtc 
gtattcttga 
agacaatttc 
gacaagttca 
cactgaaaag 
ttgccacata 
ctgaaatgca 
ttactggtgt 
tggataaagt 



tacagctgca 
agaagcaatg 
gggaggtgat 
taaattgcac 
tgtgtggaaa 
gtatttgtat 
tcttgataaa 
cacaggagta 
atattggt tg 
gttattacct 
gttaaagaga 
tgatatgact 
aaatgggagt 
ttcaatggtt 
gatggagggt 
aaataacaac 
gttggccatg 
agagtttata 
ctctaaagcc 
aatcacactc 
tatcatcaaa 
tgcagcggct 
agttttgaat 
gcaacaatta 
caggaaaatt 
gaaaaagaag 
cactacatct 
agtggttact 
gctaggacaa 
caagaatgga 
agccatgacg 
ttctggtgca 
cttgttcatt 
catgagtaaa 
cgactttatt 
aatggatgat 
taaagatgtg 
gtttaatgcg 
agaacttaaa 
ttatgagaaa 
cacacactac 
atatgagacg 
tagggcacca 
ggctacaatt 
cgttttgaat 
acgcccaaaa 
agattttgcc 
aagcgcggat 
taatcaaaat 
gcgtaggact 
tattgtaacg 
aaagggatgg 
aaaaatagca 
aagtaaccca 
acctactaac 
tccaatgaaa 
tacttttgag 
tcctctattc 
agggcatgaa 
tggaactgtg 
tgacaaattt 



gcagcagctt 
gatttggaag 
gacgatgatg 
cgtgacgcat 
aaatggctca 
acactagtga 
tctgcactgt 
ggagtaggag 
agggtattat 
gcatattctt 
atagacatta 
gccaaccttc 
gttttcatgg 
aacgaatttc 
gttaagggta 
attaacaata 
caaatggtga 
acaaaagaac 
aaacaagaaa 
actaaaaaac 
gaaactaaaa 
gcggcagttg 
atggatctgt 
attaacgcca 
ttgagggcag 
gtaactaggg 
aatagaatga 
aatttaatgt 
aaatgcctag 
ggatttggga 
atatattctt 
tctacccagg 
attcaaaatc 
gacaaggctt 
aaccacgata 
gactataatg 
gtagctgccg 
tcttggatgg 
aacattagag 
ttatgtaaca 
tctatagcca 
aaacaaaaga 
aaggagtcca 
gtgttccaat 
tcttcattcg 
tcaattacca 
cattgcaagg 
gagttttcaa 
aaggttaata 
ggtaaatttt 
tccacatgca 
ctgtgtatgt 
gcagcttctg 
gaaacaccat 
aatctccact 
agggtacaat 
gaagaagaag 
aaaaaatgta 
tttgccaagt 
gtggatttgg 
attgtgaaaa 



attatgagga 
acggaggaag 
aagaaagtgg 
ctactctaga 
ctgaaaataa 
gaatgcaaca 
ctggacgctt 
gaggaggagg 
ctatgcccat 
cctatctttc 
ttaagcacgc 
aagacttggt 
acaactttga 
ttatgtacag 
gtt taaagat 
ggcacaataa 
ctagtatttc 
aagggcagcc 
aggatgcaat 
acgttttaaa 
agaacaataa 
aaaattctgt 
ctgacattag 
ttaaaactcg 
ccgaatctcc 
ctaaatctat 
tccatggcac 
acaagaatgg 
aaattaaaga 
acaatgtata 
gccaagggca 
atgcgtatgt 
attctgttga 
atactatgcc 
cagttagtgt 
gagatggagg 
ctgaagaatt 
ccaacactgc 
actttttctt 
agaaaatgat 
gtggtattgt 
acaagaaaga 
ctattgaaat 
tttatgtcca 
cctt tttgcc 
ccaattctac 
acgggggaga 
ataacattga 
aatacatgga 
gtcgtcctac 
cagctcaaga 
cagacgaagc 
gtgttagtga 
ctgatacgat 
t ttcaatgtc 
tttctttgca 
gtcttcccac 
aaatgtatgg 
attcaacatt 
aaaagatatc 
atgtatcaaa 



cgacgaggac 
tggaggttct 
agaaatgata 
tcgtgtaggt 
gccttcagat 
ggttaaattt 
gttccactcg 
agcaggggca 
atcaacagga 
tgccctttcg 
ttattcgtta 
agatacgcga 
cccagtgcaa 
aaaggaagat 
ttccagagtg 
ctcacttaaa 
aaagggaaat 
taaggatatg 
ttcttccaaa 
gaatgcagtc 
caattcctca 
gcccgcatta 
ccacgagaaa 
cagtacacca 
tagggcatta 
tactctttat 
tcaggaaagg 
tgaattgaca 
aggacgacaa 
cgttcaatat 
cactttcttt 
tgctgtaacc 
acggggtaac 
tattggaggc 
ctctagagag 
agtaacaatg 
catcctaagt 
taaggttatt 
tggtgttaat 
tgagctatac 
aaaacagccc 
ctatatcaaa 
gttgttgttt 
ttatatattt 
atctcccaat 
tcatgtacca 
aagagatgtc 
aggtattttg 
tgttgtatgt 
ggaaacttgt 
gaagggagaa 
taatgtatat 
attccccatt 
tagtcttttg 
tgaagatgtc 
tgtaaagaga 
aaagatattc 
tgctataatc 
ggatatcaga 
aggagaaggt 
catattgttt 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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aaggaacaag ggaagcgtgt ctctttcttt gtttcatgtg 



ctatacacta g 



5811 



<2X0> 13 
<211> 1926 
<212> PRT 
<213> SHRIMP 

<400> 13 

Met Gly Ser Lys Arg Pro Cys Ser Ser Gly Gin Glu Pro Val Thr Lys 

IS 10 15 

Lys Gin Lys Lys Asn Asn Asn Asn Asn Ser Asn Pro Val Pro Val He 

20 25 30 

Asn He Lys Ser Tyr Pro Phe Leu Ala Thr Arg Thr Gin Val Leu Arg 

35 40 45 

Ser Ala Val Ala Ala Ala Ala Asp Ser Gly Ser Ser Ser Ser Ser Ser 

50 55 60 

Ser Ser Ala Ser Ala Val Lys Leu Pro Asp Thr Cys Lys Glu Ala Arg 
65 70 75 80 

Lys Val Leu Ser Thr Val Ser Leu Gin Gin Ser Leu Ala Val Arg Tyr 

85 90 95 

Leu Cys Asn Ser He Ser Val Ser Tyr Ala Gly Gly Gly He Ser Val 

100 105 110 

Phe His Leu Gly Gly Leu Pro Gly Ala Gly Lys Thr Thr Met Val Lys 

115 120 125 

Glu Leu He Ala Val Leu Asn Asp His Gly Leu He Asp Ser Gly Ser 

130 135 140 

Ala Asp Met Leu Leu Cys Cys Lys Ser Asn Ser Ala Lys Glu Ser Leu 
145 150 155 160 

Met Cys Ala Cys Lys Lys Pro Gly Gly Ser Ser Leu Met Tyr Pro Glu 

165 170 175 

Ser Val Phe Ser Thr Leu Asn Lys Gly Phe Glu He Pro Val He Phe 

180 185 190 

Arg Lys Asp Glu He Thr Leu Glu Lys He Gin Phe Val Ala Asp Lys 

195 200 205 

Leu Lys Trp Lys Val He Gin Val Leu Ala Asn Leu Arg Phe Leu Val 

210 215 220 

He Asp Glu Tyr Thr Met Ala Ser Cys Arg Glu Leu Val Phe He Asp 
225 230 235 240 

Ala Val Leu Arg He Ala Lys His Arg Pro Asp He Pro Phe Gly Gly 

245 250 255 

Val Phe Val He Leu Leu Gly Asp Asn Arg Gin Asn Ser Ala Val Val 

260 265 270 

Glu Asp Asn Thr Asn His He Gin Lys Lys He Lys Asn Pro Ser Glu 

275 280 285 

Glu Glu Lys Pro Gin Lys Asn Asn Lys Asn Asn Lys Asn Lys Lys Lys 

290 295 300 

Lys Lys Glu Lys Lys Glu Lys Gly Gly Glu Glu Glu Glu Gly Asp Glu 
305 310 315 320 

Asn Glu Glu Glu Glu Gly Glu Glu Glu Glu Glu Glu Glu Glu Ser Asp 

325 330 335 

Asp Glu Ala Glu Thr Lys Lys Glu Glu Glu Lys Ser Thr Phe Phe Gin 

340 345 350 

Gly Ser Val Glu Gin Asp Asn Phe Gly Gin Glu Asp Asn Ala Lys Leu 

355 360 365 

Tyr Thr Glu Val Phe He Lys He Leu Lys Met Phe Cys Ser Arg Asp 

370 375 380 

Phe Phe Gly Asn Pro Ser Asn Leu Arg Asn He Val Asn Lys Arg His 
385 390 395 400 

Glu Ala He Leu Met Lys Ser Asn Asn Val Lys Ser Val Asn Asn Asn 

405 410 415 

Leu Val Ser Ser Ala He Lys Val Glu Asp Cys Gly Asn Asn Lys Lys 
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420 

Glu Val Thr Ala 
435 

Asn Cys Asp Glu 
450 

Glu Ala Phe Leu 

465 • 

Ala Ser Gly Ala 

Asp Ala Asn Glu 
500 

Lys Ser Ser Leu 
515 

Glu Glu Gly lie 
530 

Asn Ala Asn Leu 
545 

Asp Met Met Ala 

Lys Leu Tyr lie 
580 

Ser Leu Met Asp 
595 

Asp Pro Lys Cys 
610 

Ala Ser Ala Val 
625 

Lys Asp Phe Val 

Leu Arg Thr Val 
660 

Glu Leu Leu Ala 
675 

Glu Thr Gly Asn 
690 

Ala Tyr Tyr Glu 
705 

Ala Met Asp Leu 

Ser Gly Gly Gly 
740 

Tyr Arg Thr Asp 
755 

Asp Arg Val Gly 
770 

Leu Thr Glu Asn 
785 

Leu Tyr Thr Leu 

Leu Pro Ser Leu 
820 

Pro Ser Glu Trp 
835 

Gly Ala Gly Ala 
850 

Leu Ser Met Pro 
865 

Leu Pro Ala Tyr 

Met Ser Ser Leu 
900 



Pro Ser Ser Ser 
440 

Phe Asp Asp Glu 
455 

Lys Leu Met Glu 
470 

Leu Asn Gly Phe 
485 

Lys lie Arg Ser 

Asp Met Met Lys 
520 

Cys Ser Glu Leu 
535 

Glu Lys Tyr Thr 
550 

Lys Ala Met Arg 
565 

Val Ser Ser Leu 

Glu Glu lie Leu 
600 

He Asp Ala Val 
615 

Ala Ala Cys Val 
630 

Asp Glu Thr Pro 
645 

Ala Ser Phe Leu 

Lys Ser Glu Asn 
680 

Asn Asn Ser Ala 
695 

Asp Asp Glu Asp 
710 

Glu Asp Gly Gly 
725 

Gly Asp Asp Asp 

He Pro Asp Lys 
760 

His Leu Val Asp 
775 

Lys Pro Ser Asp 
790 

Val Arg Met Gin 
805 

Asp Lys Ser Ala 

Ala Thr Ser Thr 
840 

Asp Lys Pro Leu 
855 

He Ser Thr Gly 
870 

Ser Ser Tyr Leu 
885 

Lys Arg He Asp 



425 



Pro 


Ala 


Gin 


Ser 


Glu 


Asp 


Asp 


Asp 








460 


Arg 


Asn 


Ala 


Leu 






475 




Ser 


Leu 


Arg 


Cys 




490 






Gly 


Thr 


Thr 


Ser 


505 








Ser 


Leu 


Pro 


Leu 


Ala 


His 


Glu 


Leu 








540 


Glu 


Asn 


Val 


Cys 






555 




Glu 


He 


Asp 


Tyr 




570 






Ser 


Phe 


Lys 


Asp 


585 








Asn 


Val 


Lys 


Tyr 


Pro 


Phe 


Asn 


Ser 








620 


Arg 


Asn 


Ala 


Phe 






635 




He 


Ala 


Asn 


Tyr 




650 






Glu 


Asn 


Glu 


Thr 


665 








He 


Arg 


Ser 


He 


Ser 


Ser 


Arg 


Thr 








700 


Tyr 


Cys 


Tyr 


Phe 






715 




Ser 


Gly 


Gly 


Ser 




730 






Asp 


Glu 


Glu 


Ser 


745 








Leu 


His 


Arg 


Asp 


Phe 


His 


Val 


Val 








780 


Leu 


Val 


Arg 


Ala 






795 




Gin 


Val 


Lys 


Phe 




810 






Leu 


Ser 


Gly 


Arg 


825 








Gly 


Val 


Gly 


Val 


His 


Asp 


Glu 


Tyr 








860 


Gly 


Asp 


Val 


Gly 






875 




Ser 


Ala 


Leu 


Ser 




890 






He 


He 


Lys 


His 



905 





430 






Thr 


Ala 


Glu 


Glu 


445 








Phe 


Phe 


Asn 


Asn 


Glu 


Lys 


Asp 


Arg 








480 


Lys 


Ser 


He 


Ser 






4 95 




Val 


Ser 


Asp 


Lys 




510 






Ser 


Ala 


Leu 


He 


525 








Lys 


Lys 


Met 


Ser 


Ser 


He 


Val 


Phe 








560 


Ser 


Gly 


Arg 


Glu 






575 




Thr 


His 


Leu 


Thr 




590 






Val 


His 


Gly 


Ser 


605 








Ala 


His 


Asn 


Arg 


Phe 


Arg 


Asp 


Gly 








640 


Phe 


Lys 


Asp 


Asn 






655 




Leu 


Thr 


Tyr 


Lys 




670 






Leu 


Leu 


Lys 


Lys 


685 








Ala 


Ala 


Ala 


Ala 


Asp 


Glu 


Glu 


Glu 








720 


Gly 


Met 


Lys 


Ser 






735 




Gly 


Glu 


Met 


He 




750 






Ala 


Ser 


Thr 


Leu 


765 








Trp 


Lys 


Lys 


Trp 


Arg 


Val 


Trp 


Tyr 








800 


Asp 


Asn 


Gly 


Lys 






815 




Leu 


Phe 


His 


Ser 




830 






Gly 


Gly 


Gly 


Gly 


845 








Trp 


Leu 


Arg 


Val 


Lys 


Ser 


Met 


Leu 








880 


Arg 


Thr 


Tyr 


He 






895 




Ala 


Tyr 


Ser 


Leu 




910 
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Met 


Tyr 


Gly 
915 


He 


Ser 


Leu 


Phe 


Asp 
920 


Met 


Thr 


Ala 


Asn 


Leu 
925 


Gin 


Asp 


Leu 


Val 


Asp 
930 


Thr 


Arg 


Met 


Ala 


Gly 
935 


Arg 


Ser 


Ser 


Arg 


Asn 
940 


Gly 


Ser 


Val 


Phe 


Met 


Asp 


Asn 


Phe 


Asp 


Pro 


Val 


Gin 


Tyr 


Phe 


Asp 


Asn 


He 


Phe 


Pro 


Ser 


945 










950 










955 










960 


Met 


Val 


Asn 


Glu 


Phe 
965 


Leu 


Met 


Tyr 


Arg 


Lys 
970 


Glu 


Asp 


Val 


Phe 


Asn 
975 


Asn 


Gly 


Gin 


Met 


Met 
980 


Glu 


Gly 


Val 


Lys 


Gly 
985 


Ser 


Leu 


Lys 


He 


Ser 
990 


Arg 


Val 


Leu 


Gin 


Thr 


Ala 


His 


Thr 


Glu 


Asn 


Asn 


Asn 


He 


Asn 


Asn 


Arg 


His 


Asn 






995 










1000 








1005 






Asn 


Ser 


Leu 


Lys 


Tyr 


Ser 


Glu 


Lys 


Ser 


He 


Val 


Leu 


Ala 


Met 


Gin 


Met 




1010 








1015 








1020 








Val 


Thr 


Ser 


He 


Ser 


Lys 


Gly 


Asn 


Glu 


Arg 


Arg 


Lys 


Lys 


He 


Glu 


Glu 


1025 








1030 








1035 








104i 


Phe 


lie 


Thr 


Lys 


Glu 


Gin 


Gly 


Gin 


Pro 


Lys 


Asp 


Met 


Cys 


Glu 


Arg 


Leu 










1045 








1050 








1055 


Met 


Ala 


Asn 


Ser 


Lys 


Ala 


Lys 


Gin 


Glu 


Lys 


Asp 


Ala 


He 


Ser 


Ser 


Lys 








1060 








1065 








1070 




Thr 


Asp 


Lys 


Met 


Met 


Gly 


Ala 


He 


Thr 


Leu 


Thr 


Lys 


Lys 


His 


Val 


Leu 






1075 








1080 








1085 






Lys 


Asn 


Ala 


Val 


Ser 


Asn 


Leu 


Val 


Asp 


Thr 


Ser 


He 


He 


Lys 


Glu 


Thr 




1090 








1095 








1100 








Lys 


Lys 


Asn 


Asn 


Asn 


Asn 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Thr 


Ser 


Leu 


Ala 


1105 








1110 








1115 








1121 


Ala 


Ala 


Ala 


Ala 


Val 


Glu 


Asn 


Ser 


Val 


Pro 


Ala 


Leu 


Arg 


Val 


Glu 


Val 










1125 








1130 








1135 


Lys 


Phe 


Val 


Val 


Leu 


Asn 


Met 


Asp 


Leu 


Ser 


Asp 


He 


Ser 


His 


Glu 


Lys 








1140 








1145 








1150 




Thr 


He 


Ser 


His 


Lys 


Tyr 


Arg 


Gin 


Gin 


Leu 


He 


Asn 


Ala 


He 


Lys 


Thr 






1155 








1160 








1165 






Arg 


Ser 


Thr 


Pro 


Leu 


Phe 


Asp 


Lys 


Phe 


Thr 


Asp 


Arg 


Lys 


He 


Leu 


Arg 




1170 








1175 








1180 








Ala 


Ala 


Glu 


Ser 


Pro 


Arg 


Ala 


Leu 


Thr 


Thr 


He 


Leu 


Leu 


Asp 


Glu 


Lys 


1185 








1190 








1195 








1201 


Lys 


Lys 


Val 


Thr 


Arg 


Ala 


Lys 


Ser 


He 


Thr 


Leu 


Tyr 


Gin 


Gly 


Gin 


Asn 










1205 








1210 








1215 


Val 


He 


Phe 


Thr 


Thr 


Ser 


Asn 


Arg 


Met 


He 


His 


Gly 


Thr 


Gin 


Glu 


Arg 








1220 








1225 








1230 




Phe 


Val 


Thr 


Lys 


Asp 


Thr 


Gly 


Val 


Val 


Thr 


Asn 


Leu 


Met 


Tyr 


Lys 


Asn 






1235 








1240 








1245 






Gly 


Glu 


Leu 


Thr 


Val 


Phe 


Val 


Tyr 


Val 


Glu 


Arg 


Leu 


Gly Gin 


Lys 


Cys 




1250 








1255 








1260 








Leu 


Glu 


He 


Lys 


Glu 


Gly 


Arg 


Gin 


He 


He 


Gly 


Asn 


Pro 


Asn 


He 


Lys 


1265 








1270 








1275 








128( 


Asn 


Gly 


Gly 


Phe 


Gly 


Asn 


Asn 


Val 


Tyr 


Val 


Gin 


Tyr 


Leu 


Pro 


Phe 


Glu 










1285 








1290 








1295 


Ser 


Ser 


Gin 


Ala 


Met 


Thr 


He 


Tyr 


Ser 


Cys 


Gin 


Gly 


His 


Thr 


Phe 


Phe 








1300 








1305 








1310 




Arg 


Asp 


Thr 


He 


Val 


Asp 


Leu 


Ser 


Gly 


Ala 


Ser 


Thr 


Gin 


Asp 


Ala 


Tyr 






1315 








1320 








1325 






Val 


Ala 


Val 


Thr 


Arg 


Asn 


Ser 


Asn 


Pro 


Gin 


Asn 


Leu 


Phe 


He 


He 


Gin 



1330 1335 1340 



Asn His Ser Val Glu Arg Gly Asn Leu Cys Asn He Lys Cys Ala Met 

1345 1350 1355 1360 

Ser Lys Asp Lys Ala Tyr Thr Met Pro He Gly Gly He Ala Asp Phe 

1365 1370 1375 

Asn Gly Ser Asp Phe He Asn His Asp Thr Val Ser Val Ser Arg Glu 

1380 1385 1390 

Val Ala Glu Ser Ser Ala Ala Met Asp Asp Asp Tyr Asn Gly Asp Gly 
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1395 1400 1405 

Gly Val Thr Met Tyr Ser Ala Tyr Asp Pro Ser Lys Asp Val Val Ala 

1410 1415 1420 

Ala Ala Glu Glu Phe lie Leu Ser Arg Ser Gly Lys Ser Leu Ser Phe 
1425 1430 1435 1440 

Asn Ala Ser Trp Met Ala Asn Thr Ala Lys Val lie Gin Gin His Gly 

1445 1450 1455 

Leu Glu Thr Glu Leu Lys Asn lie Arg Asp Phe Phe Phe Gly Val Asn 

1460 1465 1470 

Asn Gly Asp Val Ala Lys His Tyr Glu Lys Leu Cys Asn Lys Lys Met 

1475 1480 1485 

lie Glu Leu Tyr Thr Ala lie Val Arg Ser lie Thr His Tyr Ser He 

1490 1495 1500 

Ala Ser Gly He Val Lys Gin Pro Ser Ser Lys Leu Cys Glu Glu Tyr 
1505 1510 1515 1520 

Glu Thr Lys Gin Lys Asn Lys Lys Asp Tyr lie Lys He His Pro Val 

1525 1530 1535 

Phe Val Asn Arg Ala Pro Lys Glu Ser Thr He Glu Met Leu Leu Phe 

1540 1545 1550 

Asp He Ala Pro His Asn Lys Ala Thr He Val Phe Gin Phe Tyr Val 

1555 1560 1565 

His Tyr He Phe Leu Val Tyr Glu Lys Leu Asn Val Leu Asn Ser Ser 

1570 1575 1580 

Phe Ala Phe Leu Pro Ser Pro Asn Pro Cys Leu Asn Gin Tyr Val Arg 
1585 1590 1595 1600 

Pro Lys Ser He Thr Thr Asn Ser Thr HiS Val Pro Asn Leu Gly Tyr 

1605 1610 1615 

Glu Ser Lys Asp Phe Ala His Cys Lys Asp Gly Gly Glu Arg Asp Val 

1620 1625 1630 

Lys Leu Arg Leu Pro He Thr Ser Ala Asp Glu Phe Ser Asn Asn He 

1635 1640 1645 

Glu Gly He Leu Lys Lys Val Ser Asp Thr Ser Asn Gin Asn Lys Val 

1650 1655 1660 

Asn Lys Tyr Met Asp Val Val Cys Lys Ser Met Gin His Asn Leu Arg 
1665 1670 1675 1680 

Arg Thr Gly Lys Phe Cys Arg Pro Thr Glu Thr Cys Gly Leu Ser Lys 

1685 1690 1695 

His Gly Ser He Val Thr Ser Thr Cys Thr Ala Gin Glu Lys Gly Glu 

1700 1705 1710 

Asn He His Val Asp Ala Glu Lys Gly Trp Leu Cys Met Ser Asp Glu 

1715 1720 1725 

Ala Asn Val Tyr Cys Met Leu Met Phe Met Ser Lys He Ala Ala Ala 

1730 1735 1740 

Ser Gly Val Ser Glu Phe Pro He Lys Asp Lys Ser He Ser Asn Pro 
1745 1750 1755 1760 

Glu Thr Pro Ser Asp Thr He Ser Leu Leu Ala Pro Arg Lys Thr He 

1765 1770 1775 

Ser Pro Thr Asn Asn Leu His Phe Ser Met Ser Glu Asp Val Leu Phe 

1780 1785 1790 

Cys Gly Gin Val His Pro Met Lys Arg Val Gin Phe Ser Leu His Val 

1795 1800 1805 

Lys Arg Thr Gly Gly Ala Leu Lys Ser Thr Phe Glu Glu Glu Glu Gly 

1810 1815 1820 

Leu Pro Thr Lys He Phe Ser Pro Asn Phe Ala Thr Tyr Pro Leu Phe 
1825 1830 1835 1840 

Lys Lys Cys Lys Met Tyr Gly Ala He He He Ala Met Thr Glu Met 

1845 1850 1855 

Gin Gly His Glu Phe Ala Lys Tyr Ser Thr Leu Asp He Arg Lys Ser 

1860 1865 1870 

Met Phe Thr Gly Val Gly Thr Val Val Asp Leu Glu Lys He Ser Gly 
1875 1880 1885 
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Glu Gly Asn Glu Val Met Asp Lys Vai Asp Lys Phe lie Val Lys Asn 

1890 1895 1900 

Val Ser Asn He Leu Phe Lys Glu Gin Gly Lys Arg Val Ser Phe Phe 
1905 1910 1915 1920 

Val Ser Cys Ala He His 
1925 



<210> 14 
<211> 768 
<212> DNA 
<213> SHRIMP 



<400> 14 

atggctgggg 

aataaaa tgc 

actttaccaa 

ttccctgatc 

gcatgggacg 

caattgattg 

gatgaccaag 

aatcacgggc 

t tttctctaa 

ttcaataatc 

aaggatatcg 

tccgatatag 

aaaccagt tg 



tagatttgta 
agggcaacag 
tgtctgaaga 
cgaaatcggc 
tactaaaaac 
aaaatcttca 
ggaggtatga 
taatcagaac 
caattgagaa 
aaacggataa 
caattaagcc 
ttacaaagaa 
atcacctttt 



tggaggacat 
agggaaaatt 
caacattagt 
attaactttt 
gccagctagc 
catgagtgat 
aaccagctac 
actggctaag 
ttttgaagta 
atcttgcttt 
tgaaaaaaga 
tacgcgtaat 
attgtactgc 



attaaaccct 
agagcactaa 
gcctgggtaa 
tttgtaccga 
gttgaaattg 
tttacagtgg 
aatttatctc 
gcccaagata 
gataatggga 
gtaaagatct 
ggttcatctg 
gcaaaatttt 
atggttacat 



acggagagac 
taaatgaaaa 
ctgaggtggc 
acaagagcct 
atatagggaa 
caatttttag 
cttctatggg 
ttgtagtctg 
aaaaacggct 
tccacgaaa t 
ctgtatggga 
cattacgata 
atttttaa 



tgtattcaat 
agcagcagca 
tgcggatgtt 
caatgcgttt 
gagaattcct 
agtaaaatgt 
aggaaaaata 
gaagagggat 
cgattttttg 
ggaatctgaa 
tgaagtatat 
tagaaatgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

768 



<210> 15 
<211> 255 
<212> PRT 
<213> SHRIMP 



<400> 15 



Met 


Ala 


Gly 


Val 


Asp 


Leu 


Tyr 


Gly 


Gly 


His 


He 


Lys 


Pro 


Tyr 


Gly 


Glu 


1 








5 










10 










15 




Thr 


Val 


Phe 


Asn 


Asn 


Lys 


Met 


Gin 


Gly 


Asn 


Arg 


Gly 


Lys 


He 


Arg 


Ala 








20 










25 










30 






Leu 


He 


Asn 


Glu 


Lys 


Ala 


Ala 


Ala 


Thr 


Leu 


Pro 


Met 


Ser 


Glu 


Asp 


Asn 






35 










40 










45 








He 


Ser 


Ala 


Trp 


Val 


Thr 


Glu 


Val 


Ala 


Ala 


Asp 


Val 


Phe 


Pro 


Asp 


Pro 




50 










55 










60 










Lys 


Ser 


Ala 


Leu 


Thr 


Phe 


Phe 


Val 


Pro 


Asn 


Lys 


Ser 


Leu 


Asn 


Ala 


Phe 


65 










70 










75 










80 


Ala 


Trp 


Asp 


Val 


Leu 


Lys 


Thr 


Pro 


Ala 


Ser 


Val 


Glu 


He 


Asp 


He 


Gly 










85 










90 










95 




Lys 


Arg 


He 


Pro 


Gin 


Leu 


He 


Glu 


Asn 


Leu 


His 


Met 


Ser 


Asp 


Phe 


Thr 








100 










105 










110 






Val 


Ala 


He 


Phe 


Arg 


Val 


Lys 


Cys 


Asp 


Asp 


Gin 


Gly 


Arg 


Tyr 


Glu 


Thr 






115 










120 










125 








Ser 


Tyr 


Asn 


Leu 


Ser 


Pro 


Ser 


Met 


Gly 


Gly 


Lys 


He 


Asn 


His 


Gly 


Leu 




130 










135 










140 










He 


Arg 


Thr 


Leu 


Ala 


Lys 


Ala 


Gin 


Asp 


He 


Val 


Val 


Trp 


Lys 


Arg 


Asp 


145 










150 










155 










160 


Phe 


Ser 


Leu 


Thr 


He 


Glu 


Asn 


Phe 


Glu 


Val 


Asp 


Asn 


Gly 


Lys 


Lys 


Arg 










165 










170 










175 




Leu 


Asp 


Phe 


Leu 


Phe 


Asn 


Asn 


Gin 


Thr 


Asp 


Lys 


Ser 


Cys 


Phe 


Val 


Lys 








180 










185 










190 






He 


Phe 


His 


Glu 


Met 


Glu 


Ser 


Glu 


Lys 


Asp 


He 


Ala 


He 


Lys 


Pro 


Glu 






195 










200 










205 








Lys 


Arg 


Gly 


Ser 


Ser 


Ala 


Val 


Trp 


Asp 


Glu 


Val 


Tyr 


Ser 


Asp 


He 


Val 
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210 215 220 

Thr Lys Asn Thr Arg Asn Ala Lys Phe Ser Leu Arg Tyr Arg Asn Glu 
225 230 235 240 

Lys Pro Val Asp His Leu Leu Leu Tyr Cys Met Val Thr Tyr Phe 
245 250 255 



<210> 16 
<211> 348 
<212> DNA 
<213> SHRIMP 



<400> 16 

atggcttcgc 

ttaataacaa 

tctctcctgg 

ggagtgggcg 

acaccagatg 

cccacccgtc 



cattggtggc gtctttagga ggaggaaaaa atatactttt tggtttactt 60 

taattgtcat tgttattgct gtaataataa tcaaggctcc tctcttagca 120 

cgggcacagc actggccggc acgatcgctt ctgcgctggg atcaatacca 180 

gtgcattcaa gaaagccttt ggaaaaggaa aggggaaagg aggaccaaaa 24 0 

gtggggccaa gaaaactaac caaaaaccaa agaagggtaa gaaaaaaccc 300 

gttccatatt caagaggata cctaaaataa aattttag 348 



<210> 17 
<211> 111 
<212> PRT 
<213> SHRIMP 



<400> 17 



Met 


Asp 


Leu 


Val 


Ala 


Ser 


Leu 


Gly 


Gly 


Gly 


Lys 


Asn 


He 


Leu 


Phe 


Gly 


1 








5 










10 










15 




Leu 


Leu 


Leu 


He 
20 


Thr 


He 


He 


Val 


He 
25 


Val 


He 


Ala 


Val 


He 
30 


He 


He 


Lys 


Ala 


Pro 

35 


Leu 


Leu 


Ala 


Ser 


Leu 
40 


Leu 


Ala 


Gly 


Thr 


Ala 
45 


Gly 


Thr 


He 


Ala 


Ser 
50 


Ala 


Leu 


Gly 


Ser 


He 
55 


Pro 


Gly 


Val 


Gly 


Gly 
60 


Ala 


Phe 


Lys 


Lys 


Ala 


Phe 


Gly 


Lys 


Gly 


Lys 


Gly 


Lys 


Gly 


Gly 


Pro 


Lys 


Thr 


Pro 


Asp 


Gly 


65 










70 










75 










80 


Gly 


Ala 


Lys 


Lys 


Thr 

85 


Asn 


Gin 


Lys 


Pro 


Lys 
90 


Lys 


Gly 


Lys 


Lys 


Lys 
95 


Pro 


Pro 


Thr 


Arg 


Arg 
100 


Ser 


He 


Phe 


Lys 


Arg 
105 


He 


Pro 


Lys 


He 


Lys 
110 


Phe 





<210> 18 
<211> 687 
<212> DNA 
<213> SHRIMP 



<400> 18 

atggtctctt 

ctccccacca 

gaacaagaag 

aaggaggaag 

gatggtgaag 

gacgacgaca 

aacatcaaac 

gaatttgagg 

ctgatcaacc 

gacatggaga 

caggctgtag 

agtttgattg 



ctagaacatc 
agaggaagag 
aactggtaga 
aacacaaaga 
aagaggaata 
ctgcagagaa 
tgaaaaaccc 
tacaaaatac 
gtgaagccgt 
ttggatctct 
tacagaaagt 
atgatccttg 



aacaacatct 
ggagccagaa 
ggactcgtca 
aacacatcac 
tgaggaagag 
aatggaaaat 
cctaagggaa 
tgtcaatttt 
ttctaggtgg 
tacagttaac 
gaaggaagat 
ttggtaa 



tcatctgcag 
gaagtaaagg 
agtaacaagc 
ctctccctcc 
gaggatgagg 
cttttggtgc 
catgacatgg 
agttttggag 
ggtaatacac 
cagttgctcc 
ataacccctt 



tagcagccac 
tgaaagtgga 
gccccagaat 
catgtaaaga 
aagaatatga 
aactggacaa 
cagtttcaca 
tactatctga 
ccccaccaaa 
acaagtgtga 
ctattgaagt 



ctctactctt 
agtaaaaatg 
taaggaagag 
agaagaagac 
agacagagtg 
tactaccaaa 
eta tgagca t 
tattgggttc 
agagtttggc 
taattttg ta 
tacaatagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

687 
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<210> 19 
<211> 228 
<212> PRT 
<213> SHRIMP 



<400> 19 



Met 


Val 


Ser 


Ser 


Arg 


Thr 


Ser 


Thr 


Thr 


Ser 


Ser 


Ser 


Ala 


Val 


Ala 


Ala 


1 








5 










10 










15 




Thr 


Ser 


Thr 


Leu 


Leu 


Pro 


Thr 


Lys 


Arg 


Lys 


Arg 


Glu 


Pro 


Glu 


Glu 


Val 








20 










25 










30 






Lys 


Val 


Lys 


Val 


Glu 


Val 


Lys 


Met 


Glu 


Gin 


Glu 


Glu 


Leu 


Val 


Glu 


Asp 






35 










40 










45 








Ser 


Ser 


Ser 


Asn 


Lys 


Arg 


Pro 


Arg 


He 


Lys 


Glu 


Glu 


Lys 


Glu 


Glu 


Glu 




50 










55 










60 










His 


Lys 


Glu 


Thr 


His 


His 


Leu 


Ser 


Leu 


Pro 


Cys 


Lys 


Glu 


Glu 


Glu 


Asp 


65 










70 










75 










80 


Asp 


Gly 


Glu 


Glu 


Glu 


Glu 


Tyr 


Glu 


Glu 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Tyr 










85 










90 










95 




Glu 


Asp 


Arg 


Val 


Asp 


Asp 


Asp 


Thr 


Ala 


Glu 


Lys 


Met 


Glu 


Asn 


Leu 


Leu 








100 










105 










110 






Val 


Gin 


Leu 


Asp 


Asn 


Thr 


Thr 


Lys 


Asn 


He 


Lys 


Leu 


Lys 


Asn 


Pro 


Leu 






115 










120 










125 








Arg 


Glu 


His 


Asp 


Met 


Ala 


Val 


Ser 


His 


Tyr 


Glu 


His 


Glu 


Phe 


Glu 


Val 




130 










135 










140 










Gin 


Asn 


Thr 


Val 


Asn 


Phe 


Ser 


Phe 


Gly 


Val 


Leu 


Ser 


Asp 


He 


Gly 


Phe 


145 










150 










155 










160 


Leu 


He 


Asn 


Arg 


Glu 


Ala 


Val 


Ser 


Arg 


Trp 


Gly 


Asn 


Thr 


Pro 


Pro 


Pro 










165 










170 










175 




Lys 


Glu 


Phe 


Gly 


Asp 


Met 


Glu 


He 




Ser 


Leu 


Thr 


Val 


Asn 


Gin 


Leu 








180 










185 










190 






Leu 


His 


Lys 


Cys 


Asp 


Asn 


Phe 


Val 


Gin 


Ala 


Val 


Val 


Gin 


Lys 


Val 


Lys 






195 










200 










205 








Glu 


Asp 


He 


Thr 


Pro 


Ser 


He 


Glu 


Val 


Thr 


He 


Asp 


Ser 


Leu 


He 


Asp 




210 










215 










220 










Asp 


Pro 


Cys 


Trp 



























225 



<210> 20 
<211> 1698 
<212> DNA 
<213> SHRIMP 



<400> 20 

atggcaaggt 

aaaatggaag 

cctgacaaat 

gaattaaggg 

aacggagtgc 

tatactcaca 

ggggaaaagt 

gaaaaggccc 

gaaagacttt 

ttagtagggt 

ttattggtgg 

ttaacataca 

ctaattgttc 

aaaagtattg 

agggataatt 

aatatgaagg 

aggatttctt 



ctgtaggttt 
aattcaagac 
acgaacatat 
acggactata 
atagactgag 
atctccccag 
ggaacgaatc 
ttcttagtga 
ctaatgcagc 
gtggctacac 
aacaatgtgg 
aaggggccta 
caatgga taa 
gaaattatat 
tacagatgga 
aagaagaact 
tgagggatgt 



attatctgtg 
tttaaaggta 
tctat tatca 
tatag tccgt 
acaattgaca 
gttagggagt 
actccccatt 
caaaaacttt 
tgtgaaaaaa 
gccgagccat 
cttttcttcc 
cattgcaaac 
actaggtttg 
gtcatctctt 
taatttcaga 
gaaggaattg 
gcacaaaaaa 



acacccgagt 
aaaaatggat 
ttcaaatctg 
ctaaaagata 
ggcgacaata 
ttattgcaag 
gatatgcaaa 
aaattttctc 
tgtgactata 
tgccctagaa 
cgcatatcag 
ccaatcacag 
attttctaca 
ttcaatgcaa 
agagaaataa 
aggaaacgtt 
tcatcgattg 



atgacacatt 
tcactatttc 
ttgatagggt 
aggaagtact 
ctcttcaagt 
atgatgggtg 
atatcaacaa 
ctctctacag 
tgataataac 
cacttcgtaa 
tgtatgatat 
gcagttactc 
acagtacaca 
ccgtcatata 
agtttgcaga 
gtgccgtctc 
cgacatcccg 



taagtatatc 


60 


tggtgaaaac 


120 


tacaaaaagt 


180 


tcacataaaa 


240 


cggattaaag 


300 


tgaggattat 


360 


aatcgttaaa 


420 


gcttttacac 


480 


cactgacttt 


540 


catggaacaa 


600 


atgtgatagg 


660 


caacatgtgc 


720 


cccatcagct 


780 


cgcaaatgaa 


840 


gaatgaagta 


900 


tgaagaacaa 


960 


ctatgatggt 


1020 
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ggagcctgct 
ggaaatggtt 
tataaaaaga 
gatgccccta 
cgcaccagtg 
atggtcagag 
caaaatggtg 
aaaagggaag 
attttcaata 
ggaggagaaa 
aggaaaagag 
gacataaaag 



tagtgtttgc 
ccttttattc 
gagatgtggg 
tatgtattag 
gcaatagtat 
aggtaacagg 
ccgaaatcaa 
aaaacttgcc 
cgtcaagggc 
cttcagaaga 
aagaaaatga 
tttgctaa 



tttttctgac 
tgcaacagaa 
agaacgtagg 
ggacagtgta 
aacctttgca 
aacagacatc 
tgtgattgat 
cgaagacgaa 
aatttccatg 
tggagatgga 
agggaatgaa 



cgagatttct 
gaaggaatca 
ccccgtttga 
cgaaaccact 
gtccctccag 
aaaattttca 
ccgacatcta 
cggaagcgtt 
tataatgaag 
aacggcagca 
tatgtagtcc 



ccttgctatg 
ggtatgtctc 
tcatgtccat 
tcaagacgag 
gagaaaggga 
tggataatgg 
aggaatacaa 
tgcggcgaga 
aacgtggaga 
ccagcagtaa 
tt ttgaacaa 



cagaaccaac 
ttcacctgaa 
aactggatcg 
acttttctcc 
actaatggaa 
aaaagtatat 
agagttactc 
acgacgcatg 
tggaggaagt 
aggagagaaa 
ggcttgtaaa 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1698 



<210> 21 
<211> 563 
<212> PRT 
<213> SHRIMP 



<400> 21 



Met 


Ala 


Arg 


Ser 


Val 


Gly 


Leu 


Leu 


Ser 


Val 


Thr 


Pro 


Glu 


Tyr 


Asp 


Thr 


1 








5 










10 










15 




Phe 


Lys 


Tyr 


He 


Lys 


Met 


Glu 


Glu 


Phe 


Lys 


Thr 


Leu 


Lys 


Val 


Lys 


Asn 








20 










25 










30 






Gly 


Phe 


Thr 


He 


Ser 


Gly 


Glu 


Asn 


Pro 


Asp 


Lys 


Tyr 


Glu 


His 


He 


Leu 






35 










40 










45 








Leu 


Ser 


Phe 


Lys 


Ser 


Val 


Asp 


Arg 


Val 


Thr 


Lys 


Ser 


Glu 


Leu 


Arg 


Asp 




50 










55 










60 










Gly 


lie 


Val 


Arg 


Leu 


Lys 


Asp 


Lys 


Glu 


Val 


Leu 


His 


He 


Lys 


Asn 


Gly 


65 










70 










75 










80 


Val 


His 


Arg 


Leu 


Arg 


Gin 


Leu 


Thr 


Gly 


Asp 


Asn 


Thr 


Leu 


Gin 


Val 


Gly 










85 










90 










95 




Leu 


Lys 


Tyr 


Thr 


His 


Asn 


Leu 


Pro 


Arg 


Leu 


Gly 


Ser 


Leu 


Leu 


Gin 


Asp 








100 










105 










110 






Asp 


Gly 


Cys 


Glu 


Asp 


Tyr 


Gly 


Glu 


Lys 


Trp 


Asn 


Glu 


Ser 


Leu 


Pro 


He 






115 










120 










125 








Asp 


Met 


Gin 


Asn 


He 


Asn 


Lys 


He 


Val 


Lys 


Glu 


Lys 


Ala 


Leu 


Leu 


Ser 




130 










135 










140 










Asp 


Lys 


Asn 


Phe 


Lys 


Phe 


Ser 


Pro 


Leu 


Tyr 


Arg 


Leu 


Leu 


His 


Glu 


Arg 


145 










150 










155 










160 


Leu 


Ser 


Asn 


Ala 


Ala 


Val 


Lys 


Lys 


Cys 


Asp 


Tyr 


Met 


He 


He 


Thr 


Thr 










165 










170 










175 




Asp 


Phe 


Leu 


Val 


Gly 


Cys 


Gly 


Tyr 


Thr 


Pro 


Ser 


His 


Cys 


Pro 


Arg 


Thr 








180 










185 










190 






Leu 


Arg 


Asn 


Met 


Glu 


Gin 


Leu 


Leu 


Val 


Glu 


Gin 


Cys 


Gly 


Phe 


Ser 


Ser 






195 










200 










205 








Arg 


lie 


Ser 


Val 


Tyr 


Asp 


He 


Cys 


Asp 


Arg 


Leu 


Thr 


Tyr 


Lys 


Gly 


Ala 




210 










215 










220 










Tyr 


He 


Ala 


Asn 


Pro 


He 


Thr 


Gly 


Ser 


Tyr 


Ser 


Asn 


Met 


Cys 


Leu 


He 


225 










230 










235 










240 


Val 


Pro 


Met 


Asp 


Lys 


Leu 


Gly 


Leu 


He 


Phe 


Tyr 


Asn 


Ser 


Thr 


His 


Pro 










245 










250 










255 




Ser 


Ala 


Lys 


Ser 


He 


Gly 


Asn 


Tyr 


Met 


Ser 


Ser 


Leu 


Phe 


Asn 


Ala 


Thr 








260 










265 










270 






Val 


He 


Tyr 


Ala 


Asn 


Glu 


Arg 


Asp 


Asn 


Leu 


Gin 


Met 


Asp 


Asn 


Phe 


Arg 






275 










280 










285 








Arg 


Glu 


He 


Lys 


Phe 


Ala 


Glu 


Asn 


Glu 


Val 


Asn 


Met 


Lys 


Glu 


Glu 


Glu 




290 










295 










300 










Leu 


Lys 


Glu 


Leu 


Arg 


Lys 


Arg 


Cys 


Ala 


Val 


Ser 


Glu 


Glu 


Gin 


Arg 


He 


305 










310 










315 










320 


Ser 


Leu 


Arg 


Asp 


Val 


His 


Lys 


Lys 


Ser 


Ser 


He 


Ala 


Thr 


Ser 


Arg 


Tyr 










325 










330 










335 
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Asp 


Gly 


Gly 


Ala 


Cys 


Leu 


Val 


Phe 


Ala 


Phe 


Ser 


Asp 


Arg 


Asp 


Phe 


Ser 








340 










345 










350 






Leu 


Leu 


Cys 


Arg 


Thr 


Asn 


Gly 


Asn 


Gly 


Ser 


Phe 


Tyr 


Ser 


Ala 


Thr 


Glu 






355 










360 










365 








Glu 


Gly 


lie 


Arg 


Tyr 


Val 


Ser 


Ser 


Pro 


Glu 


Tyr 


Lys 


Lys 


Arg 


Asp 


Val 




370 










375 










380 










Gly 


Glu 


Arg 


Arg 


Pro 


Arg 


Leu 


He 


Met 


Ser 


He 


Thr 


Gly 


Ser 


Asp 


Ala 


385 










390 










395 










400 


Pro 


lie 


Cys 


He 


Arg 


Asp 


Ser 


Val 


Arg 


Asn 


His 


Phe 


Lys 


Thr 


Arg 


Leu 










405 










410 










415 




Phe 


Ser 


Arg 


Thr 


Ser 


Gly 


Asn 


Ser 


He 


Thr 


Phe 


Ala 


Val 


Pro 


Pro 


Gly 








420 










425 










430 






Glu 


Arg 


Glu 


Leu 


Met 


Glu 


Met 


Val 


Arg 


Glu 


Val 


Thr 


Gly 


Thr 


Asp 


He 






435 










440 










445 








Lys 


lie 


Phe 


Met 


Asp 


Asn 


Gly 


Lys 


Val 


Tyr 


Gin 


Asn 


Gly 


Ala 


Glu 


He 




450 










455 










460 










Asn 


Val 


lie 


Asp 


Pro 


Thr 


Ser 


Lys 


Glu 


Tyr 


Lys 


Glu 


Leu 


Leu 


Lys 


Arg 


465 










470 










475 










480 


Glu 


Glu 


Asn 


Leu 


Pro 


Glu 


Asp 


Glu 


Arg 


Lys 


Arg 


Leu 


Arg 


Arg 


Glu 


Arg 










485 










490 










495 




Arg 


Met 


He 


Phe 


Asn 


Thr 


Ser 


Arg 


Ala 


He 


Ser 


Met 


Tyr 


Asn 


Glu 


Glu 








500 










505 










510 






Arg 


Gly 


Asp 


Gly 


Gly 


Ser 


Gly 


Gly 


Glu 


Thr 


Ser 


Glu 


Asp 


Gly 


Asp 


Gly 






515 










520 










525 








Asn 


Gly 


Ser 


Thr 


Ser 


Ser 


Lys 


Gly 


Glu 


Lys 


Arg 


Lys 


Arg 


Glu 


Glu 


Asn 




530 










535 










540 










Glu 


Gly 


Asn 


Glu 


Tyr 


Val 


Val 


Leu 


Leu 


Asn 


Lys 


Ala 


Cys 


Lys 


Asp 


He 


545 










550 










555 










560 



Lys Val Cys 



<210> 22 
<211> 630 
<212> DNA 
<213> SHRIMP 



<400> 22 

atga ttgtat 

agaactgccg 

tatt taccca 

cgtcccaggc 

aggcacgaca 

tatggagaag 

gtgatggttt 

gatgcacata 

gaagt tatcc 

aagaattacg 

tgtaagttta 



tcgtcgaagg 
gaaagggcaa 
tcttcccatg 
tcgtggatgg 
cagaacagtt 
aattcaaagc 
acagggaaga 
actggtttat 
ccagaaatgt 
tgtaccgtgt 
ttgctgagaa 



<210> 23 
<2H> 209 
<212> PRT 
<213> SHRIMP 



atctccccta 
acagtctttc 
gaccattcaa 
aatgtttgga 
tccagaatca 
atgtgtcgcc 
agatgtcaag 
tgacgttctc 
gtacaaaggg 
tggtaaatgc 
gattatttga 



actgggaaaa 
ctcaacttca 
gagcatttgc 
tcatcattga 
aaaattggat 
gagtgggtca 
aaaatggaac 
aaggaggaga 
aatatttgta 
acgaatagta 



cttcatgggt 
tgtacaccaa 
gtgcatccga 
actttttcac 
tgagggag ta 
aatacaagcc 
caattattca 
gagcgttgtt 
gttcatgttt 
t tgtacactg 



ggataacatg 
ttatagggac 
ttaccaggaa 
aggcatgtgg 
tttggagatg 
agttttccat 
agaattgaat 
tgtaaagatt 
ctcgacgagt 
tgatatgaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



<400> 23 

Met He Val Phe Val Glu Gly Ser Pro Leu Thr Gly Lys Thr Ser Trp 

15 10 15 

Val Asp Asn Met Arg Thr Ala Gly Lys Gly Lys Gin Ser Phe Leu Asn 

20 25 30 

Phe Met Tyr Thr Asn Tyr Arg Asp Tyr Leu Pro He Phe Pro Trp Thr 
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35 40 45 



lie 


Gin 


Glu 


His 


Leu 


Arg 


Ala 


Ser 


Asp 


Tyr 


Gin 


Glu 


Arg 


Pro 


Arg 


Leu 




50 










55 










60 










Val 


Asp 


Gly 


Met 


Phe 


Gly 


Ser 


Ser 


Leu 


Asn 


Phe 


Phe 


Thr 


Gly 


Met 


Trp 


65 










70 










75 










80 


Arg 


His 


Asp 


Thr 


Glu 


Gin 


Phe 


Pro 


Glu 


Ser 


Lys 


He 


Gly 


Leu 


Arg 


Glu 










85 










90 










95 




Tyr 


Leu 


Glu 


Met 


Tyr 


Gly 


Glu 


Glu 


Phe 


Lys 


Ala 


Cys 


Val 


Ala 


Glu 


Trp 








100 










105 










110 






Val 


Lys 


Tyr 


Lys 


Pro 


Val 


Phe 


His 


Val 


Met 


Val 


Tyr 


Arg 


Glu 


Glu 


Asp 






115 










120 










125 








Val 


Lys 


Lys 


Met 


Glu 


Pro 


He 


He 


Gin 


Glu 


Leu 


Asn 


Asp 


Ala 


His 


Asn 




130 










135 










140 










Trp 


Phe 


He 


Asp 


Val 


Leu 


Lys 


Glu 


Glu 


Arg 


Ala 


Leu 


Phe 


Val 


Lys 


He 


145 










150 










155 










160 


Glu 


Val 


He 


Pro 


Arg 


Asn 


Val 


Tyr 


Lys 


Gly 


Asn 


He 


Cys 


Ser 


Ser 


Cys 










165 










170 










175 




Phe 


Ser 


Thr 


Ser 


Lys 


Asn 


Tyr 


Val 


Tyr 


Arg 


Val 


Gly 


Lys 


Cys 


Thr 


Asn 








180 










185 










190 






Ser 


He 


Val 


His 


Cys 


Asp 


Met 


Lys 


Cys 


Lys 


Phe 


He 


Ala 


Glu 


Lys 


He 



195 200 205 

He 



<210> 24 
<211> 1347 
<212> DNA 
<213> SHRIMP 



<400> 24 

atgggaggtg 

ggagcaaaat 

gcggtaccca 

cctgtttaca 

tttagtttca 

gacgtgagaa 

gtatcgtctt 

ctttgttctg 

tcttatgcaa 

actcttcctg 

tccaatatct 

gagaacgcta 

gcatgcatta 

aacatataca 

gatgccatgg 

agatttgaac 

gatattaaca 

ttacaacctc 

agagaagtag 

gtaaaataca 

aatacagaat 

gatgttgtgg 

aaacttcccg 



aagattcgtt 
cgattcaagt 
ctctagaaga 
gagtgcctct 
a tgtagatcc 
atccttcacg 
acactggaga 
aaatagcatc 
caaactttaa 
acgacaaaat 
ctatcggacc 
ccgcagtcgg 
tcccagaagg 
acatgactaa 
aagaagatga 
atga tttgaa 
gt tcaagttt 
tcgtttcttc 
agtt tcatga 
tgaaaatagt 
tgta taagaa 
gactaga tac 
aagtgattgg 



cgacgacagg 
aaaggaaacg 
aaaaaataga 
cttctcttta 
acgacgattt 
aacggctgcc 
cgggatgttg 
cgatataaca 
tagtgttgaa 
gttccaacaa 
catgagtcat 
tgtcattaga 
ggttttcttt 
ccaaccaaaa 
tggtgaagaa 
aatgcaagcc 
ccacaaatgt 
agaagttgtg 
caggctcccg 
atctctaaag 
aaaacaggct 
gagtgagcat 
tcaataa 



tacgattctg 
gatttggagg 
acagctttaa 
aaggatggaa 
ggaaaagtac 
attttcgtgc 
gtctgcccta 
atttataata 
gataattccc 
gtgtctattt 
gtgaaaatca 
tatggaggat 
gatgatgttg 
gaaattgtac 
gcagattata 
atgtcgtcag 
ttcctaatta 
cttaatgatc 
tctggtgcac 
gaaggcctaa 
ttaaaggtgc 
agttttggag 



atgcgctctg 
tgtaccgaat 
ggtactattc 
gtgatcccca 
cagtgaaggt 
caactggacc 
atcacaattt 
caagtagttc 
ctgtcggtat 
tttccgctac 
agctcggt ta 
tat tctacac 
tgggaaatca 
taaaagagcc 
atttcttggg 
catatagctc 
aacctaaata 
tctcgctcaa 
aagacaactc 
aggttgtaaa 
atgttttaaa 
taatagtgtg 



ggaaaatgaa 
gcaccgcaga 
agat tggtct 
cgaaagagat 
gagacgagt t 
aggacttcat 
tataggagat 
agggcggcta 
tttattcgag 
tgaaccggct 
ctatgatgaa 
ctctgtcggt 
tagctcaatg 
aagaggagag 
ttatgtagta 
agtttctatt 
taactctatt 
tacacgtggg 
atat tcgatt 
ccctataatc 
tatgacgcgt 
tea tgctgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1347 



<210> 25 
<211> 444 
<212> PRT 
<213> SHRIMP 



<400> 25 
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Met 


Gly 


Gly 


Glu 


Asp 


Ser 


Phe 


Asp 


Asp 


1 








5 










Trp 


Glu 


Asn 


Glu 


Gly 


Ala 


Lys 


Ser 


He 








20 










25 


Glu 


Val 


Tyr 


Arg 


Met 


His 


Arg 


Arg 


Ala 






35 










40 




Asn 


Arg 


Thr 


Ala 


Leu 


Arg 


Tyr 


Tyr 


Ser 




50 










55 






Val 


Pro 


Leu 


Phe 


Ser 


Leu 


Lys 


Asp 


Gly 


65 










70 








Phe 


Asn 


Val 


Asp 


Pro 


Arg 


Arg 


Phe 


Gly 










85 










Arg 


Val 


Asp 


Val 


Arg 


Asn 


Pro 


Ser 


Arg 








100 










105 


Thr 


Gly 


Pro 


Gly 


Leu 


His 


Val 


Ser 


Ser 






115 










120 




Val 


Cys 


Pro 


Asn 


His 


Asn 


Phe 


He 


Gly 




130 










135 






Ser 


Asp 


He 


Thr 


He 


Tyr 


Asn 


Thr 


Ser 


145 










150 








Ala 


Thr 


Asn 


Phe 


Asn 


Ser 


Val 


Glu 


Asp 










165 










Phe 


Glu 


Thr 


Leu 


Pro 


Asp 


Asp 


Lys 


Met 








180 










185 


Ser 


Ala 


Thr 


Glu 


Pro 


Asn 


He 


Ser 


He 






195 










200 




lie 


Lys 


Leu 


Gly 


Tyr 


Tyr 


Asp 


Glu 


Glu 




210 










215 






He 


Arg 


Tyr 


Gly 


Gly 


Leu 


Phe 


Tyr 


Thr 


225 










230 








Pro 


Glu 


Gly 


Val 


Phe 


Phe 


Asp 


Asp 


Val 










245 










Asn 


He 


Tyr 


Asn 


Met 


Thr 


Asn 


Gin 


Pro 








260 










265 


Pro 


Arg 


Gly 


Glu 


Asp 


Ala 


Met 


Glu 


Glu 






275 










280, 




Tyr 


Asn 


Phe 


Leu 


Gly 


Tyr 


Val 


Val 


Arg 




290 










295 






Gin 


Ala 


Met 


Ser 


Ser 


Ala 


Tyr 


Ser 


Ser 


305 










310 








Ser 


Ser 


Phe 


His 


Lys 


Cys 


Phe 


Leu 


He 










325 










Leu 


Gin 


Pro 


Leu 


Val 


Ser 


Ser 


Glu 


Val 








340 










345 


Asn 


Thr 


Arg 


Gly 


Arg 


Glu 


Val 


Glu 


Phe 






355 










360 




Ala 


Gin 


Asp 


Asn 


Ser 


Tyr 


Ser 


He 


Val 




370 










375 






Leu 


Lys 


Glu 


Gly 


Leu 


Lys 


Val 


Val 


Asn 


385 










390 








Tyr 


Lys 


Lys 


Lys 


Gin 


Ala 


Leu 


Lys 


Val 










405 










Asp 


Val 


Val 


Gly 


Leu 


Asp 


Thr 


Ser 


Glu 








420 










425 


Cys 


His 


Ala 


Ala 


Lys 


Leu 


Pro 


Glu 


Val 






435 










440 





Arg 


Tyr 


Asp 


Ser 


Asp 


Ala 


Leu 


10 










15 




Gin 


Val 


Lys 


Glu 


Thr 


Asp 


Leu 










30 






Val 


Pro 


Thr 


Leu 


Glu 


Glu 


Lys 








45 








Asp 


Trp 


Ser 


Pro 


Val 


Tyr 


Arg 






60 










Ser 


Asp 


Phe 


Arg 


Asp 


Phe 


Ser 




75 










80 


Lys 


Val 


Pro 


Val 


Lys 


Val 


Arg 


90 










95 




Thr 


Ala 


Ala 


He 


Phe 


Val 


Pro 










110 






Tyr 


Thr 


Gly 


Asp 


Gly 


Met 


Leu 








125 








Asp 


Leu 


Cys 


Ser 


Glu 


He 


Ala 






140 










Ser 


Ser 


Gly 


Arg 


Leu 


Ser 


Tyr 




155 










160 


Asn 


Ser 


Pro 


Val 


Gly 


He 


Leu 


170 










175 




Phe 


Gin 


Gin 


Val 


Ser 


He 


Phe 










190 






Gly 


Pro 


Met 


Ser 


His 


Val 


Lys 








205 








Asn 


Ala 


Thr 


Ala 


Val 


Gly 


Val 






220 










Ser 


Val 


Gly 


Ala 


Cys 


He 


lie 




235 










240 


Val 


Gly 


Asn 


His 


Ser 


Ser 


Met 


250 










255 




Lys 


Glu 


He 


Val 


Leu 


Lys 


Glu 










270 






Asp 


Asp 


Gly 


Glu 


Glu 


Ala 


Asp 








285 








Phe 


Glu 


His 


Asp 


Leu 


Lys 


Met 






300 










Val 


Ser 


He 


Asp 


He 


Asn 


Ser 




315 










320 


Lys 


Pro 


Lys 


Tyr 


Asn 


Ser 


He 


330 










335 




Val 


Leu 


Asn 


Asp 


Leu 


Ser 


Leu 










350 






His 


Asp 


Arg 


Leu 


Pro 


Ser 


Gly 








365 








Lys 


Tyr 


Met 


Lys 


He 


Val 


Ser 






380 










Pro 


He 


He 


Asn 


Thr 


Glu 


Leu 




395 










400 


His 


Val 


Leu 


Asn 


Met 


Thr 


Arg 


410 










415 




His 


Ser 


Phe 


Gly 


Val 


He 


Val 










430 






He 


Gly 


Gin 











<210> 26 
<211> 2352 
<212> DNA 
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<213> SHRIMP 



<400> 26 

atggcagcag 

caactcaccc 

aagatctttg 

aaccaaggtg 

aattgcatat 

aaaaagtata 

gccgtgatcc 

gagattattg 

ggaatgatga 

tcaactaaac 

gcagctgaag 

tttcctggtt 

ggagatgatg 

actcgcggtc 

tgtctcagtg 

ggagaatttg 

actaaagggt 

tgtttgagga 

tttgatgaga 

gcaggtgctg 

ttcagaaaat 

aatcttgata 

aagaaaaacg 

aacaaggagg 

agagaaaagc 

gtcgcagcca 

gacatggctg 

cgtgcagtcg 

aggatcaagc 

gaagacgcga 

aaggaggaag 

gaaaagaaca 

atgaggattc 

gaagaagaga 

gatatggtga 

aaggatgaag 

aaagaaattg 

aactattctg 

aacgatgcat 

ttcaagacat 



cagcagtctc 
cggacggaga 
gtgacaaaag 
agagttacca 
ggaatggaga 
gacttctcaa 
ctatctacgg 
acaaggaaac 
acgccaagca 
cgtttgaaaa 
ctatttatga 
tggaatctgc 
acaatatgtt 
gggtaaatag 
ttttaaagtt 
aagaaaaggc 
ctgaacaaga 
atatttgtat 
gtattttgag 
gaaatgaaga 
tgttgtcccc 
tgttgtgccc 
aagatggaaa 
aagaagacgc 
agaacaagga 
tcaacgaaaa 
tcgcagccat 
acatggcggt 
gtgcagtcga 
ggatcaagcg 
aagacgcgag 
agaaggaaga 
aacgtatagt 
aaaggacaaa 
acgaagtaca 
ccaagaatac 
taccttgttt 
ccaacaatac 
ctagattctc 
aa 



aggagagggg 
cgtgataaga 
tgtgatagag 
aatcgcatct 
gtcgggagga 
ctcacttttg 
gtcagtgtgt 
tcttcagacc 
tggcacattg 
tttccaatat 
tggattcaga 
tgatgtgtac 
gttgaacctc 
atccttgtta 
tgttgaaaaa 
cgaaacatgt 
atcttacaag 
agtatcacaa 
aaagtgtaca 
agaaatatgt 
tgtctcaatt 
ctacagtgac 
cagagtgagg 
gaggataaag 
ggaagaagac 
gaacaaggag 
caacgaaaa t 
tgcagccatc 
catggctgtc 
tgcagtcgac 
gatcaagcgt 
agacgcgagg 
cgacatggca 
aagggaacaa 
gaagaaactt 
tagtaatgtt 
aggaaataat 
taagaataat 
caatattgta 



agaatctctg 
tacgactctg 
actattggac 
tctgtgttgg 
atggctctat 
gtccacaaga 
gacagagaag 
atatgtaaga 
ggaggtaatt 
gaagcaatgg 
gaccatggct 
ggaaacaatc 
atctgcaact 
gattttttaa 
cactttaaaa 
gttaattgtc 
ctttcttgtg 
cacctaagat 
cctaacttga 
ttcatgagaa 
cctcat ttct 
cacacgataa 
gtcaaccaca 
cgtgtagccg 
gcgaggatca 
gaagaagacg 
aacaaggagg 
aacgaaaaga 
gcagccatca 
atggcggttg 
gcagtcgaca 
a tcaagcgta 
attgcagctg 
gagttaaggg 
gaagacatgg 
gttagtagca 
aataatgctg 
gtatttggtt 
gaaactccca 



cagatctact 
agcagtacac 
attttctcat 
aaaaattccc 
ggaaggcatt 
taaaaaattg 
aaaggcccat 
gtgatatacg 
ttttacactt 
gagctaatgc 
taaacccatc 
cagtggaaat 
atggtgtatc 
aaatgaacac 
ttgagtcaaa 
ttgatagaaa 
gacactttct 
gtgaaaaatg 
attggtggtt 
ataagaaact 
tcaaaaatag 
taccaaataa 
cagccatcag 
tcaggacatt 
agcgtgcagt 
cgaggatcaa 
aagaagacgc 
acaaggagga 
acgaaaataa 
cagccatcaa 
tggcggttgc 
taattgactt 
ccactaaaaa 
ctgatctgag 
aactagaaaa 
gtagtgttgt 
tcattggtat 
cacctcataa 
aaatgtcttt 



cctgttggaa 
caaacctagg 
ccacaaccac 
cgctctactc 
gtacagggct 
gccttcggtt 
catcatgagt 
ttctcttctt 
ctatgcccgc 
agtgctaatg 
agaatatact 
tgcaatatca 
ttatgaaaaa 
agcttcaaag 
tacaccaaaa 
taatgtgttg 
tcatgtaaaa 
tctaaaaaga 
gactatgccg 
ggttgatgat 
tagacagcga 
agaagatcca 
tgaaaagcag 
tacagccatc 
cgacatggct 
gcgtgcagtc 
gaggatcaag 
agaagacgcg 
caaggaggaa 
cgaaaataac 
agccaccaac 
gactgttgat 
ggacaagaaa 
aagggcaatg 
ggggtgtaat 
tgcctattct 
gactagcacc 
attttccttc 
caatttctcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2352 



<210> 27 
<211> 781 
<212> PRT 
<213> SHRIMP 



<400> 27 

Met Ala Ala Ala Ala Val Ser Gly Glu Gly Arg lie Ser Ala Asp Leu 

15 10 15 

Leu Leu Leu Glu Gin Leu Thr Pro Asp Gly Asp Val lie Arg Tyr Asp 

20 25 30 

Ser Glu Gin Tyr Thr Lys Pro Arg Lys He Phe Gly Asp Lys Ser Val 

35 40 45 

He Glu Thr He Gly His Phe Leu He His Asn His Asn Gin Gly Glu 

50 55 60 

Ser Tyr Gin He Ala Ser Ser Val Leu Glu Lys Phe Pro Ala Leu Leu 
65 70 75 80 

Asn Cys He Trp Asn Gly Glu Ser Gly Gly Met Ala Leu Trp Lys Ala 
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85 










90 










95 




Leu 


Tyr 


Arg 


Ala 


Lys 


Lys 


Tyr 


Arg 


Leu 


Leu 


Asn 


Ser 


Leu 


Leu 


Val 


His 








100 










105 










110 






Lys 


lie 


Lys 


Asn 


Trp 


Pro 


Ser 


Val 


Ala 


Val 


He 


Pro 


He 


Tyr 


Gly 


Ser 






115 










120 










125 








Val 


Cys 


Asp 


Arg 


Glu 


Glu 


Arg 


Pro 


He 


He 


Met 


Ser 


Glu 


He 


He 


Asp 




130 










135 










140 










Lys 


Glu 


Thr 


Leu 


Gin 


Thr 


He 


Cys 


Lys 


Ser 


Asp 


He 


Arg 


Ser 


Leu 


Leu 


145 










150 










155 










160 


Gly 


Met 


Met 


Asn 


Ala 


Lys 


His 


Gly 


Thr 


Leu 


Gly 


Gly 


Asn 


Phe 


Leu 


His 










165 










170 










175 




Phe 


Tyr 


Ala 


Arg 


Ser 


Thr 


Lys 


Pro 


Phe 


Glu 


Asn 


Phe 


Gin 


Tyr 


Glu 


Ala 








180 










185 










190 






Met 


Gly 


Ala 


Asn 


Ala 


Val 


Leu 


Met 


Ala 


Ala 


Glu 


Ala 


He 


Tyr 


Asp 


Gly 






195 










200 










205 






Phe 


Arg 


Asp 


His 


Gin 


Pro 


Ser 


Glu 


Tyr 


Thr 


Phe 


Pro 


Gly 


Leu 


Glu 


Ser 




210 










215 










220 










Ala 


Asp 


Val 


Tyr 


Gly 


Asn 


Asn 


Pro 


Val 


Glu 


He 


Ala 


He 


Ser 


Gly 


Asp 


225 










230 










235 










240 


Asp 


Asp 


Asn 


Met 


Leu 


Leu 


Asn 


Leu 


He 


Cys 


Asn 


Tyr 


Gly 


Val 


Ser 


Tyr 










245 










250 










255 


Glu 


Lys 


Thr 


Arg 


Gly 


Arg 


Val 


Asn 


Arg 


Ser 


Leu 


Leu 


Asp 


Phe 


Leu 


Lys 








260 










265 










270 






Met 


Asn 


Thr 


Ala 


Ser 


Lys 


Cys 


Leu 


Ser 


Val 


Leu 


Lys 


Phe 


Val 


Glu 


Lys 






275 










280 










285 






His 


Phe 


Lys 


He 


Glu 


Ser 


Asn 


Thr 


Pro 


Lys 


Gly 


Glu 


Phe 


Glu 


Glu 


Lys 




290 










295 










300 








Ala 


Glu 


Thr 


Cys 


Val 


Asn 


Cys 


Leu 


Asp 


Arg 


Asn 


Asn 


Val 


Leu 


Thr 


Lys 


305 










310 










315 










320 


Gly 


Ser 


Glu 


Gin 


Glu 


Ser 


Tyr 


Lys 


Leu 


Ser 


Cys 


Gly 


His 


Phe 


Leu 


His 










325 










330 










335 




Val 


Lys 


Cys 


Leu 


Arg 


Asn 


He 


Cys 


He 


Val 


Ser 


Gin 


His 


Leu 


Arg 


Cys 








340 










345 










350 






Glu 


Lys 


Cys 


Leu 


Lys 


Arg 


Phe 


Asp 


Glu 


Ser 


He 


Leu 


Arg 


Lys 


Cys 


Thr 






355 










360 










365 








Pro 


Asn 


Leu 


Asn 


Trp 


Trp 


Leu 


Thr 


Met 


Pro 


Ala 


Gly 


Ala 


Gly 


Asn 


Glu 




370 










375 










380 








Glu 


Glu 


lie 


Cys 


Phe 


Met 


Arg 


Asn 


Lys 


Lys 


Leu 


Vai 


Asp 


Asp 


Phe 


Arg 


385 










390 










395 










400 


Lys 


Leu 


Leu 


Ser 


Pro 


Val 


Ser 


He 


Pro 


His 


Phe 


Phe 


Lys 


Asn 


Ser 


Arg 










405 










410 










415 


Gin 


Arg 


Asn 


Leu 


Asp 


Met 


Leu 


Cys 


Pro 


Tyr 


Ser 


Asp 


His 


Thr 


He 


He 








420 










425 










430 






Pro 


Asn 


Lys 


Glu 


Asp 


Pro 


Lys 


Lys 


Asn 


Glu 


Asp 


Gly 


Asn 


Arg 


Val 


Arg 






435 










440 










445 








Val 


Asn 


His 


Thr 


Ala 


He 


Ser 


Glu 


Lys 


Gin 


Asn 


Lys 


Glu 


Glu 


Glu 


Asp 














A R R 










4 60 








Ala 


Arg 


He 


Lys 


Arg 


Val 


Ala 


Val 


Arg 


Thr 


Phe 


Thr 


Ala 


He 


Arg 


Glu 


465 










470 










475 








480 


Lys 


Gin 


Asn 


Lys 


Glu 


Glu 


Glu 


Asp 


Ala 


Arg 


He 


Lys 


Arg 


Ala 


Val 


Asp 










485 










490 










495 




Met 


Ala 


Val 


Ala 


Ala 


He 


Asn 


Glu 


Lys 


Asn 


Lys 


Glu 


Glu 


Glu 


Asp 


Ala 








500 










505 










510 




Arg 


lie 


Lys 


Arg 


Ala 


Val 


Asp 


Met 


Ala 


Val 


Ala 


Ala 


He 


Asn 


Glu 


Asn 






515 










520 










525 








Asn 


Lys 


Glu 


Glu 


Glu 


Asp 


Ala 


Arg 


He 


Lys 


Arg 


Ala 


Val 


Asp 


Met 


Ala 




530 










535 










540 










Val 


Ala 


Ala 


He 


Asn 


Glu 


Lys 


Asn 


Lys 


Glu 


Glu 


Glu 


Asp 


Ala 


Arg 


He 


545 










550 










555 








560 


Lys 


Arg 


Ala 


Val 


Asp 


Met 


Ala 


Val 


Ala 


Ala 


He 


Asn 


Glu 


Asn 


Asn 


Lys 



565 570 575 
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Glu 


Glu 


Glu 


Asp 
580 


Ala 


Arg 


He 


Lys 


Arg 
585 


Ala 


Val 


Asp 


Met 


Ala 
590 


Val 


Ala 


Ala 


He 


Asn 
595 


Glu 


Asn 


Asn 


Lys 


Glu 
600 


Glu 


Glu 


Asp 


Ala 


Arg 
605 


He 


Lys 


Arg 


Ala 


Val 
610 


Asp 


Met 


Ala 


Val 


Ala 
615 


Ala 


Thr 


Asn 


Glu 


Lys 
620 


Asn 


Lys 


Lys 


Glu 


Glu 


Asp 


Ala 


Arg 


He 


Lys 


Arg 


He 


He 


Asp 


Leu 


Thr 


Val 


Asp 


Met 


Arg 


625 










630 










635 










640 


lie 


Gin 


Arg 


He 


Val 
645 


Asp 


Met 


Ala 


He 


Ala 
650 


Ala 


Ala 


Thr 


Lys 


Lys 
655 


Asp 


Lys 


Lys 


Glu 


Glu 
660 


Glu 


Lys 


Arg 


Thr 


Lys 
665 


Arg 


Glu 


Gin 


Glu 


Leu 
670 


Arg 


Ala 


Asp 


Leu 


Arg 

675 


Arg 


Ala 


Met 


Asp 


Met 
680 


Val 


Asn 


Glu 


Val 


Gin 
685 


Lys 


Lys 


Leu 


Glu 


Asp 
690 


Met 


Glu 


Leu 


Glu 


Lys 
695 


Gly 


Cys 


Asn 


Lys 


Asp 
700 


Glu 


Ala 


Lys 


Asn 


Thr 


Ser 


Asn 


Val 


Val 


Ser 


Ser 


Ser 


Ser 


Val 


Val 


Ala 


Tyr 


Ser 


Lys 


Glu 


705 










710 










715 










720 


He 


Val 


Pro 


Cys 


Leu 
725 


Gly 


Asn 


Asn 


Asn 


Asn 
730 


Ala 


Val 


He 


Gly 


Met 
735 


Thr 


Ser 


Thr 


Asn 


Tyr 
740 


Ser 


Ala 


Asn 


Asn 


Thr 
745 


Lys 


Asn 


Asn 


Val 


Phe 
750 


Gly 


Ser 


Pro 


His 


Lys 
755 


Phe 


Ser 


Phe 


Asn 


Asp 
760 


Ala 


Ser 


Arg 


Phe 


Ser 
765 


Asn 


He 


Val 


Glu 


Thr 
770 


Pro 


Lys 


Met 


Ser 


Phe 
775 


Asn 


Phe 


Ser 


Phe 


Lys 
780 


Thr 









<210> 28 
<211> 1506 
<212> DNA 
<213> SHRIMP 



<400> 28 

atggtatata 

agaccaacaa 

gatacgtatg 

tctgctcgta 

aaattccatc 

acaattgaac 

gagggaga ta 

catgatattt 

aatctacgca 

gatgtgtgcc 

ttctcaaaaa 

gqggaatgca 

tattttttct 

atggaaat tg 

agtagaaact 

attaatagag 

atcaaggaaa 

gtgataactg 

aacaggcgca 

gatctaatcg 

ttggggatgt 

gttgtatgtg 

attgcagac t 

gaagaaaatg 

tcttctcctc 

tgttaa 



aaggctttgt 
catcccctct 
aggatatctt 
aaagtgaaat 
ttgcaaaggg 
ctattatttc 
tatcctcaag 
atgaattgga 
acgtttctaa 
acagaaaatt 
atgtgtgtat 
gggatttatt 
accctgaaaa 
cagatgagga 
ggtctggtgt 
gagatagagg 
agagatgtaa 
ggtgttatac 
aatttttgaa 
aattcgcgat 
tccagaaccg 
gtgagcatga 
ataatgactt 
aagatgaaga 
ctgcacataa 



atctccatca 
ttcgttctct 
acttgaacaa 
agaggataaa 
catagaagag 
tccaacgaca 
tcccctatat 
ccttattgtt 
actgatgaag 
tttcaacaat 
tattcctcta 
caacggtatt 
tactactact 
agaacaatcc 
ctgtttgatt 
aggttctttt 
aatcacagat 
agaattgtac 
aatgaatggg 
gagtgtttct 
taaaatgata 
tatacaaaaa 
tgtaagtaag 
tgcagacgct 
aaaatcacgt 



tttatcacta 
gaaggcttct 
ttcaactgtt 
actttaatct 
ctccgtgaga 
ttcaatgaca 
actcagataa 
ggcactgatt 
aaaatatcgt 
aggattatag 
ttttctgcag 
tgtgatgacg 
actactacta 
ccaaaaacta 
tttgaagtat 
gaaaaggctg 
attaatggta 
ttcaaagatg 
aattactttt 
ggtgccgggg 
cctgtaattg 
gaagatgcta 
aacaagagag 
agcagcagca 
cttccggatg 



ttcgtacgat 
cgctttcagg 
tcaagacatc 
ttcaacttaa 
ttctagacga 
gaaacgaat t 
tgaagcatat 
tgctttttgg 
atggtacttt 
ttaatcccat 
ctgaagaatt 
tagagagata 
ctgctccttc 
taaagagaaa 
ttaaaaacac 
tgaagagtgc 
ataaacctcg 
cacttaaaca 
ctctgattga 
agaggatttt 
atcctctcac 
ttctttctgt 
ggaagaaacg 
gcagcagcag 
aaggcgaaaa 



gacgtccaac 
ggataagtat 
ttctccttct 
agaaggagaa 
taa t tctgca 
actaaaccac 
ttcaccagag 
tctaggtgtg 
aaatgtagtt 
ttcttcgtca 
t tcgtctctg 
tatcaactct 
gtcgcccgaa 
tgacaacgca 
gtactacatt 
aatttcttct 
a ttggttatg 
gattggagaa 
tgaacaagca 
tgttaacggt 
atatgaaaat 
aaggagagct 
cagcgcagaa 
tcctcctcct 
atgtacactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1506 
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<210> 29 
<211> 499 
<212> PRT 
<213> SHRIMP 



<400> 29 



Met 


Val 


Tyr 


Lys 


Gly 


Phe 


Val 


Ser 


Pro 


Ser 


Phe 


He 


Thr 


He 


Arg 


Thr 


1 








5 










10 










15 




Met 


Thr 


Ser 


Asn 


Arg 


Pro 


Thr 


Thr 


Ser 


Pro 


Leu 


Ser 


Phe 


Ser 


Glu 


Gly 








20 










25 










30 




Phe 


Ser 


Leu 


Ser 


Gly 


Asp 


Lys 


Tyr 


Asp 


Thr 


Tyr 


Glu 


Asp 


He 


Leu 


Leu 






35 










40 










45 








Glu 


Gin 


Phe 


Asn 


Cys 


Phe 


Lys 


Thr 


Ser 


Ser 


Pro 


Ser 


Ser 


Ala 


Arg 


Lys 




50 










55 










60 










Ser 


Glu 


He 


Glu 


Asp 


Lys 


Thr 


Leu 


He 


Phe 


Gin 


Leu 


Lys 


Glu 


Gly 


Glu 


65 










70 










75 










80 


Lys 


Phe 


His 


Leu 


Ala 


Lys 


Gly 


He 


Glu 


Glu 


Leu 


Arg 


Glu 


He 


Leu 


Asp 










85 










90 










95 




Asp 


Asn 


Ser 


Ala 


Thr 


He 


Glu 


Pro 


He 


He 


Ser 


Pro 


Thr 


Thr 


Phe 


Asn 








100 










105 










110 






Asp 


Arg 


Asn 


Glu 


Leu 


Leu 


Asn 


His 


Glu 


Gly 


Asp 


He 


Ser 


Ser 


Ser 


Pro 






115 










120 










125 








Leu 


Tyr 


Thr 


Gin 


He 


Met 


Lys 


His 


Pro 


Glu 


His 


Asp 


He 


Tyr 


Glu 


Leu 




130 










135 










140 










Asp 


Leu 


He 


Val 


Gly 


Thr 


Asp 


Leu 


Leu 


Phe 


Gly 


Leu 


Gly 


Val 


Asn 


Leu 


145 










150 










155 










160 


Arg 


Asn 


Val 


Ser 


Lys 


Leu 


Met 


Lys 


Lys 


He 


Ser 


Tyr 


Gly 


Thr 


Leu 


Asn 










165 










170 










175 




Val 


Val 


Asp 


Val 


Cys 


His 


Arg 


Lys 


Phe 


Phe 


Asn 


Asn 


Arg 


He 


He 


Val 








180 










185 










190 






Asn 


Pro 


He 


Ser 


Ser 


Ser 


Phe 


Ser 


Lys 


Asn 


Val 


Cys 


He 


He 


Pro 


Leu 






195 










200 










205 








Phe 


Ser 


Ala 


Ala 


Glu 


Glu 


Phe 


Ser 


Ser 


Leu 


Gly Glu 


Cys 


Arg 


Asp 


Leu 




210 










215 










220 










Phe 


Asn 


Gly 


He 


Cys 


Asp 


Asp 


Val 


Glu 


Arg 


Tyr 


He 


Asn 


Ser 


Tyr 


Phe 


225 










230 










235 










240 


Phe 


Tyr 


Pro 


Glu 


Asn 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Ala 


Pro 


Ser 


Ser 










245 










250 










255 




Pro 


Glu 


Met 


Glu 


He 


Ala 


Asp 


Glu 


Glu 


Glu 


Gin 


Ser 


Pro 


Lys 


Thr 


He 








260 










265 










270 






Lys 


Arg 


Asn 


Asp 


Asn 


Ala 


Ser 


Arg 


Asn 


Trp 


Ser 


Gly 


Val 


Cys 


Leu 


He 
















2 80 










285 








Phe 


Glu 


Val 


Phe 


Lys 


Asn 


Thr 


Tyr 


Tyr 


He 


He 


Asn 


Arg 


Gly 


Asp 


Arg 




290 










295 










300 










Gly 


Gly 


Ser 


Phe 


Glu 


Lys 


Ala 


Val 


Lys 


Ser 


Ala 


He 


Ser 


Ser 


He 


Lys 


305 










310 










315 










320 


Glu 


Lys 


Arg 


Cys 


Lys 


He 


Thr 


Asp 


He 


Asn 


Gly 


Asn 


Lys 


Pro 


Arg 


Leu 










325 










330 










335 




Val 


Met 


Val 


He 


Thr 


Gly 


Cys 


Tyr 


Thr 


Glu 


Leu 


Tyr 


Phe 


Lys 


Asp 


Ala 








340 










345 










350 






Leu 


Lys 


Gin 


He 


Gly 


Glu 


Asn 


Arg 


Arg 


Lys 


Phe 


Leu 


Lys 


Met 


Asn 


Gly 






355 










360 










365 








Asn 


Tyr 


Phe 


Ser 


Leu 


He 


Asp 


Glu 


Gin 


Ala 


Asp 


Leu 


He 


Glu 


Phe 


Ala 




370 










375 










380 










Met 


Ser 


Val 


Ser 


Gly 


Ala 


Gly 


Glu 


Arg 


He 


Phe 


Val 


Asn 


Gly 


Leu 


Gly 


385 










390 










395 










400 


Met 


Phe 


Gin 


Asn 


Arg 


Lys 


Met 


He 


Pro 


Val 


He 


Asp 


Pro 


Leu 


Thr 


Tyr 










405 










410 










415 




Glu 


Asn 


Val 


Val 


Cys 


Gly 


Glu 


His 


Asp 


He 


Gin 


Lys 


Glu 


Asp 


Ala 


He 








420 










425 










430 






Leu 


Ser 


Val 


Arg 


Arg 


Ala 


He 


Ala 


Asp 


Tyr 


Asn 


Asp 


Phe 


Val 


Ser 


Lys 



wo 01/38351 



110 



PCT/USOO/28888 



435 440 445 

Asn Lys Arg Gly Lys Lys Arg Ser Ala Glu Glu Glu Asn Glu Asp Glu 

450 455 460 

Asp Ala Asp Ala Ser Ser Ser Ser Ser Ser Ser Pro Pro Pro Ser Ser 
465 470 475 480 

Pro Pro Ala His Lys Lys Ser Arg Leu Pro Asp Glu Gly Glu Lys Cys 
485 490 495 

Thr Leu Cys 



<210> 30 
<211> 846 
<212> DNA 
<213> SHRIMP 



<400> 30 

atggcggtaa 

aaactcgtat 

gcagggtttg 

aaagaaaagg 

cctttgacta 

actgaaaatt 

gtaagcgagg 

aaaactgcaa 

aatcttgcac 

tcggaattgg 

gacagtggca 

tgtttgtgta 

ggtgtcgctg 

ttaataccca 

acataa 



acttggataa 
ccgaggcaat 
acatgagacg 
ccataaaggc 
caatgcagta 
tattaggtga 
aagatattga 
cagtcatagt 
catcattcgg 
ataatgtcgc 
atactaaaaa 
gaatttttgc 
aaaacaacag 
aacaaccagc 



tgttcttgtg 
aaagcggcga 
tcaagttgaa 
at tcgatgag 
tgaagagtgg 
tgttgagcac 
aggtttcgct 
taaggcagat 
cgtcactcaa 
agattctttc 
attgttgtat 
agaaattgaa 
agatgaaa tt 
tggtgcaaaa 



aatatcaaca 
gctaaaactg 
gctgcattat 
ctcatacaag 
gtaaaccgta 
gccgattttt 
gcttctactt 
tgtgaaacgg 
gaaattaaaa 
catatttata 
gggttaagga 
tcagatggga 
gatgaaaacg 
ttgatcatct 



acaaggatga 
tatttgacac 
atgaagcaat 
aaagaggtga 
caataactcc 
tactggaccg 
ttaaggaggt 
gggatatcga 
tatataggtc 
aaatttctgc 
ataaaaaagc 
ttatggccaa 
aagaaggtaa 
acttcttttt 



agatcttaca 
taaaaatcaa 
atccaaaaag 
tgaaattaca 
ctcattgacg 
aatgacaccc 
atcagattca 
tgaagtgtat 
aaacaattcc 
aacagatagc 
aggttatacg 
tacaaatatc 
atatggtttt 
aaattgttgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

846 



<210> 31 
<211> 282 
<212> PRT 
<213> SHRIMP 



<220> 

<221> VARIANT 

<222> (1) . . . (282) 

<223> Xaa = Any Amino Acid 



<400> 31 




























Met 


Ala 


Val 


Asn 


Leu 


Asp 


Asn 


Val 


Leu 


Val 


Asn 


He 


Asn 


Asn 


Lys 


Asp 


1 








5 










10 










15 




Glu 


Asp 


Leu 


Thr 


Lys 


Leu 


Val 


Ser 


Glu 


Ala 


He 


Lys 


Arg 


Arg 


Ala 


Lys 








20 










25 










30 






Thr 


Val 


Phe 


Asp 


Thr 


Lys 


Asn 


Gin 


Ala 


Gly 


Phe 


Asp 


Met 


Arg 


Arg 


Gin 






35 










40 










45 








Val 


Glu 


Ala 


Ala 


Leu 


Tyr 


Glu 


Ala 


He 


Ser 


Lys 


Lys 


Lys 


Glu 


Lys 


Ala 




50 










55 










60 










He 


Lys 


Ala 


Phe 


Asp 


Glu 


Leu 


He 


Gin 


Glu 


Arg 


Gly 


Asp 


Glu 


He 


Thr 


65 










70 










75 










80 


Pro 


Leu 


Thr 


Thr 


Met 


Gin 


Tyr 


Glu 


Glu 


Trp 


Val 


Asn 


Arg 


Thr 


He 


Thr 










85 










90 










95 




Pro 


Ser 


Leu 


Thr 


Thr 


Glu 


Asn 


Leu 


Leu 


Gly 


Asp 


Val 


Glu 


His 


Ala 


Asp 








100 










105 










110 






Phe 


Leu 


Leu 


Asp 


Arg 


Met 


Thr 


Pro 


Val 


Ser 


Glu 


Glu 


Asp 


He 


Glu 


Gly 






115 










120 










125 








Phe 


Ala 


Ala 


Ser 


Thr 


Phe 


Lys 


Glu 


Val 


Ser 


Asp 


Ser 


Lys 


Thr 


Ala 


Thr 
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130 135 140 



Val 


He 


Val 


Lys 


Ala 


Asp 


Cys 


Glu 


Thr 


Gly 


Asp 


He 


Asp 


Glu 


Val 


Tyr 


145 










150 










155 










160 


Asn 


Leu 


Ala 


Pro 


Ser 


Phe 


Gly 


Val 


Thr 


Gin 


Glu 


He 


Lys 


He 


Tyr 


Arg 










165 










170 










175 




Ser 


Asn 


Asn 


Ser 


Ser 


Glu 


Leu 


Asp 


Asn 


Val 


Ala 


Asp 


Ser 


Phe 


His 


He 








180 










185 










190 






Tyr 


Lys 


He 


Ser 


Ala 


Thr 


Asp 


Ser 


Asp 


Ser 


Gly 


Asn 


Thr 


Lys 


Lys 


Leu 






195 










200 










205 








Leu 


Tyr 


Gly 


Leu 


Arg 


Asn 


Lys 


Lys 


Ala 


Gly 


Tyr 


Thr 


Cys 


Leu 


Cys 


Arg 




210 










215 










220 










He 


Phe 


Ala 


Glu 


He 


Glu 


Ser 


Asp 


Gly 


He 


Met 


Ala 


Asn 


Thr 


Asn 


He 


225 










230 










235 










240 


Gly 


Val 


Ala 


Glu 


Asn 


Asn 


Arg 


Asp 


Glu 


He 


Asp 


Glu 


Asn 


Glu 


Glu 


Gly 










245 










250 










255 




Lys 


Tyr 


Gly 


Phe 


Leu 


He 


Pro 


Lys 


Gin 


Pro 


Ala 


Gly 


Ala 


Lys 


Leu 


He 








260 










265 










270 






He 


Tyr 


Phe 


Phe 


Leu 


Asn 


Cys 


Trp 


Thr 


Xaa 















275 280 



<210> 32 
<211> 1089 
<212> DNA 
<213> SHRIMP 



<400> 32 

atgactgtat 

gaagatgaaa 

cttaacgctg 

gaagcagcat 

tctgtcgaaa 

ttggcaggcg 

gcagcattaa 

aacaaagata 

tcagatcttg 

aacaaaatat 

gatactggtg 

ttagtcgaag 

gttgataaaa 

ttcgttatta 

actgaccaag 

tttcaattca 

tcagaaagga 

atattggcct 

gacgattaa 



tagctgtata 
atgaggaaga 
tcaaggaaaa 
taaaaactgc 
tagatgtgga 
caacaagcga 
taacagaaga 
ataaagacgg 
tggggttgta 
tccctcatga 
ttgtgactga 
gggatgaacc 
taaaccacgt 
ttccacggtt 
tcaaacctaa 
caatggataa 
acttgaggaa 
catatgggaa 



cactgcacct 
acccgtaaag 
acctgcagag 
cgaagaaaca 
ggaagtactt 
aacgctaaca 
agatatcact 
ggaagcaact 
cttgaacgaa 
gatggttttt 
tcaagccatt 
tgaagtattc 
cttgaaaatt 
ctctcctgta 
aacattctta 
gcaacattct 
tttacccgat 
gcgtctagga 



caaa taaaga 
actttggaag 
tactttgagt 
gcccttcgag 
gaagagaaac 
aacacaatca 
attaaaatgt 
cctgaagaaa 
gtagtagaaa 
gaaagatgtg 
cagataccct 
cctggtgact 
tctgcaaaga 
ggaagtgttt 
tttttagcca 
gtagggtgtg 
tcaaaaccta 
actggtgttt 



aatcgaagaa 
attttgtaaa 
tgctaatatc 
attttgttat 
caagagaata 
tcgcagaggt 
taaaacaatt 
aggaagattt 
aaacaacaaa 
ctattttaat 
ccaacaaata 
gcttggatct 
acggatgtga 
cttccatgat 
acaaaaatga 
agttggacat 
gacctctaag 
tcacaacaga 



gagaaaaatt 
gggtcggctc 
tgcagacact 
tgagaacgac 
tgtcttcaaa 
acaaaaaaag 
cagggctgcg 
taccaataat 
tattgtcatt 
tgaagatttt 
caaaatcaga 
tgcagtttca 
aaacaactgc 
attgggcagc 
cagtacacat 
gttaattttt 
tgatgcagac 
aaatttggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1089 



<210> 33 
<211> 356 
<212> PRT 
<213> SHRIMP 



<400> 33 

Met Thr Val Leu Ala Val Tyr Thr Ala Pro Gin He Lys Lys Ser Lys 

15 10 15 

Lys Arg Lys He Glu Asp Glu Asn Glu Glu Glu Pro Val Lys Thr Leu 

20 25 30 

Glu Asp Phe Val Lys Gly Arg Leu Leu Asn Ala Val Lys Glu Lys Pro 

35 40 45 

Ala Glu Tyr Phe Glu Leu Leu He Ser Ala Asp Thr Glu Ala Ala Leu 
50 55 60 
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Lys 


Thr 


Ala 


Glu 


Glu 


Thr 


Ala 


Leu 


Arg 


Asp 


Phe 


Val 


He 


Glu 


Asn 


Asp 


65 










70 










75 










80 


Ser 


Val 


Glu 


He 


Asp 


Val 


Glii 


Glu 


Val 


Leu 


Glu 


Glu 


Lys 


Pro 


Arg 


Glu 










85 










90 










95 




Tyr 


Val 


Phe 


Lys 


Leu 


Ala 


Gly 


Ala 


Thr 


Ser 


Glu 


Thr 


Leu 


Thr 


Asn 


Thr 








100 










105 










110 






lie 


He 


Ala 


Glu 


Val 


Gin 


Lys 


Lys 


Ala 


Ala 


Leu 


He 


Thr 


Glu 


Glu 


Asp 






115 










120 










125 








lie 


Thr 


He 


Lys 


Met 


Leu 


Lys 


Gin 


Phe 


Arg 


Ala 


Ala 


Asn 


Lys 


Asp 


Asn 




130 










135 










140 










Lys 


Asp 


Gly 


Glu 


Ala 


Thr 


Pro 


Glu 


Glu 


Lys 


Glu 


Asp 


Phe 


Thr 


Asn 


Asn 


145 










150 










155 










160 


Ser 


Asp 


Leu 


Val 


Gly 


Leu 


Asn 


Glu 


Val 


Val 


Glu 


Lys 


Thr 


Thr 


Asn 


He 










165 










170 










175 




Val 


He 


Asn 


Lys 


He 


Phe 


Phe 


Met 


Val 


Phe 


Glu 


Arg 


Cys 


Ala 


He 


Leu 








180 










185 










190 






lie 


Glu 


Asp 


Phe 


Asp 


Thr 


Gly 


Val 


Val 


Thr 


Asp 


Gin 


Ala 


He 


Gin 


He 






195 










200 










205 








Pro 


Ser 


Asn 


Lys 


Tyr 


Lys 


He 


Arg 


Leu 


Val 


Glu 


Gly 


Asp 


Glu 


Pro 


Glu 




210 










215 










220 










Val 


Phe 


Pro 


Gly 


Asp 


Cys 


Leu 


Asp 


Leu 


Ala 


Val 


Ser 


Val 


Asp 


Lys 


He 


225 










230 










235 










240 


Asn 


His 


Val 


Leu 


Lys 


He 


Ser 


Ala 


Lys 


Asn 


Gly 


Cys 


Glu 


Asn 


Asn 


Cys 










245 










250 










255 




Phe 


Val 


He 


He 


Pro 


Arg 


Phe 


Ser 


Pro 


Val 


Gly 


Ser 


Val 


Ser 


Ser 


Met 








260 










265 










270 






He 


Leu 


Gly 


Ser 


Thr 


Asp 


Gin 


Val 


Lys 


Pro 


Lys 


Thr 


Phe 


Leu 


Phe 


Leu 






275 










280 










285 








Ala 


Asn 


Lys 


Asn 


Asp 


Ser 


Thr 


His 


Phe 


Gin 


Phe 


Thr 


Met 


Asp 


Lys 


Gin 




290 










295 










300 










His 


Ser 


Val 


Gly 


Cys 


Glu 


Leu 


Asp 


Met 


Leu 


He 


Phe 


Ser 


Asn 


Leu 


Arg 


305 










310 










315 










320 


Asn 


Leu 


Pro 


Asp 


Ser 


Lys 


Pro 


Arg 


Pro 


Leu 


Ser 


Asp 


Ala 


Asp 


He 


Leu 










325 










330 










335 




Ala 


Ser 


Tyr 


Gly 


Lys 


Arg 


Leu 


Gly 


Thr 


Gly 


Val 


Phe 


Thr 


Thr 


Glu 


Asn 








340 










345 










350 






Leu 


Val 


Asp 


Asp 



























<210> 34 
<2H> 930 
<212> DNA 
<213> SHRIMP 



<400> 34 

atgtcttctt 

aaagacat tg 

attatcaata 

gttgatgaac 

tttacaagtg 

ggagccaacg 

aa taaaatga 

aaacgatttt 

atgatgcgtg 

atggacggtt 

aacaaagtag 

agagtagacg 

cttgataatt 

caaagagtca 

tctgatatgt 



cgtcttctga 
taaatgctct 
atcacatctc 
ggtacgatag 
ctcagactat 
ggactggacc 
ggggaaatat 
ccagttctat 
gttctgacgg 
caagaaagaa 
gggatgagtg 
actatgaaag 
tcgaacaagc 
tgttcttacc 
tttctataat 



aactcctaag 
agataataat 
acgaaaaacg 
aaaaatagcc 
gtatgaccaa 
agtcatgggg 
gctcgaatac 
gacaacgctt 
gtattttgca 
agaagaccca 
ggaagagggt 
agatatccat 
attaggagaa 
caagaaatct 
attcccacct 



acttccaccg 
ggcgagtggt 
gcagaaactg 
gacaaaatca 
tatgcaatag 
ccagtgaaca 
gctgaagata 
gaatttgatc 
ttccctttcc 
attaattgta 
aaagaacgtg 
gtttcaattt 
aatgcaaact 
aaactgttca 
gcatctgacc 



atactgggga 
tgtcttccta 
tccaaaaaat 
acgaaa taaa 
acacatttca 
cggttatcga 
tgtgggacgg 
taagt tactc 
gtggaacaaa 
tcatttcagt 
aagtgaat 1 1 
tgtgcatg 1 1 
ctttttattt 
atagacctac 
aagattttgc 



agaaaggatt 
tattgatccg 
caaccaagaa 
atcatccatc 
agaaggaaaa 
tacaacttta 
agatgactgg 
tgatttaact 
aaagataaag 
aacatatcca 
taacctagaa 
acatgcacaa 
taaaaagggg 
tgtagaagat 
agatgatatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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tattatcgaa taattgtaac atgttcataa 930 

<210> 35 
<211> 307 
<212> PRT 
<213> SHRIMP 



<400> 35 



Met 


Ser 


Ser 


Ser 


Ser 


Ser 


Glu 


Thr 


Pro 


Lys 


Thr 


Ser 


Thr 


Asp 


Thr 


Gly 


1 








5 










10 










15 




Glu 


Glu 


Arg 


He 


Lys 


Asp 


He 


Val 


Asn 


Ala 


Leu 


Asp 


Asn 


Asn 


Gly 


Glu 








20 










25 










30 






Trp 


Leu 


Ser 


Ser 


Tyr 


He 


Asp 


Pro 


He 


He 


Asn 


Asn 


His 


Arg 


Lys 


Thr 






35 










40 










45 








Ala 


Glu 


Thr 


Val 


Gin 


Lys 


He 


Asn 


Gin 


Glu 


Val 


Asp 


Glu 


Arg 


Tyr 


Asp 




50 










55 










60 










Arg 


Lys 


He 


Ala 


Asp 


Lys 


He 


Asn 


Glu 


He 


Lys 


Ser 


Ser 


He 


Phe 


Thr 


65 










70 










75 










80 


Ser 


Ala 


Gin 


Thr 


Met 


Tyr 


Asp 


Gin 


Tyr 


Ala 


He 


Asp 


Thr 


Phe 


Gin 


Glu 










85 










90 










95 




Gly 


Lys 


Gly 


Ala 


Asn 


Gly 


Thr 


Gly 


Pro 


Val 


Met 


Gly 


Pro 


Val 


Asn 


Thr 








100 










105 










110 






Val 


He 


Asp 


Thr 


Thr 


Leu 


Asn 


Lys 


Met 


Arg 


Gly 


Asn 


Met 


Leu 


Glu 


Tyr 






115 










120 










125 








Ala 


Glu 


Asp 


Met 


Trp 


Asp 


Gly 


Asp 


Asp 


Trp 


Lys 


Arg 


Phe 


Ser 


Ser 


Ser 




130 










135 










140 










Met 


Thr 


Thr 


Leu 


Glu 


Phe 


Asp 


Leu 


Ser 


Tyr 


Ser 


Asp 


Leu 


Thr 


Met 


Met 


145 










150 










155 










160 


Arg 


Gly 


Ser 


Asp 


Gly 


Tyr 


Phe 


Ala 


Phe 


Pro 


Phe 


Arg 


Gly 


Thr 


Lys 


Lys 










165 










170 










175 




lie 


Lys 


Met 


Asp 


Gly 


Ser 


Arg 


Lys 


Lys 


Glu 


Asp 


Pro 


He 


Asn 


Cys 


He 








180 










185 










190 






lie 


Ser 


Val 


Thr 


Tyr 


Pro 


Asn 


Lys 


Val 


Gly 


Asp 


Glu 


Trp 


Glu 


Glu 


Gly 






195 










200 










205 








Lys 


Glu 


Arg 


Glu 


Val 


Asn 


Phe 


Asn 


Leu 


Glu 


Arg 


Val 


Asp 


Asp 


Tyr 


Glu 




210 










215 










220 










Arg 


Asp 


He 


His 


Val 


Ser 


He 


Leu 


Cys 


Met 


Leu 


His 


Ala 


Gin 


Leu 


Asp 


225 










230 










235 










240 


Asn 


Phe 


Glu 


Gin 


Ala 


Leu 


Gly 


Glu 


Asn 


Ala 


Asn 


Ser 


Phe 


Tyr 


Phe 


Lys 










245 










250 










255 




Lys 


Gly 


Gin 


Arg 


Val 


Met 


Phe 


Leu 


Pro 


Lys 


Lys 


Ser 


Lys 


Leu 


Phe 


Asn 








260 










265 










270 






Arg 


Pro 


Thr 


Val 


Glu 


Asp 


Ser 


Asp 


Met 


Phe 


Ser 


He 


He 


Phe 


Pro 


Pro 






275 










280 










285 








Ala 


Ser 


Asp 


Gin 


Asp 


Phe 


Ala 


Asp 


Asp 


He 


Tyr 


Tyr 


Arg 


He 


He 


Val 




290 










295 










300 











Thr Cys Ser 
305 



<210> 36 
<211> 3657 
<212> DNA 
<213> SHRIMP 

<400> 36 

atggagacaa ctatggataa 
gttgctactg ttacaacagc 
cttaaggccg aatgtcctca 
aattttggaa aatccatgtc 
aaacacttta aggacccaaa 



tgtcgttcag aataacgacg 
aactgagaaa cgtcagtcat 
agtactgaga gcactaaaat 
ggctattttt gctcaacatt 
gacaaagaag attttagaac 



taacaaaacc aacaccagat 60 
gcaaagagaa aaaggat caa 120 
tgtccaatac tttaaaggca 180 
tagtggacat gacaaacgct 24 0 
tggatggaag tagtagcagt 300 
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gacagtgaag 
agaagtgctt 
gctcaaggtg 
cttagagacg 
caggacatgg 
tcagaaatag 
aggtcggctg 
aacagtaata 
catcatgtca 
agttctgtaa 
agtgatgaat 
tcgtccgata 
tcgtctttac 
gatacggcca 
aagcccgtta 
cgaaatgttt 
acgaatgcga 
actcgccata 
agtaaaaaga 
gattcagaaa 
tcatccgaag 
cgcaaaacaa 
ctaaaaacag 
atagacaagg 
tctacgccaa 
gaacaagtaa 
gctttagata 
gctgaattgg 
cgccatgaag 
aaaactgttg 
aatgaagaaa 
caacatgaac 
aaatcaatag 
aagcaaaact 
atatctatcg 
gggaataata 
gcccaagcta 
caaaactcag 
tttcccggaa 
gcacgaggat 
gatgaaggag 
gttaattttg 
aatatgaacc 
atgtttgaaa 
gatgtatcaa 
aaaaatagag 
cattcccagc 
atggaagcag 
tctacaaatg 
tgtacgacca 
atgagtgtat 
ctgatgttgc 
ggaggaggag 
agagattgga 
agtaatgccg 
caagacgaca 



aagaggaaga 
cttcaaagaa 
tattccgcca 
aagtaaagga 
aaaaggtggt 
ccaaaggttt 
ttctagactc 
tttttgacac 
aatatttatt 
agtatgtaaa 
ttaagtccct 
ataatgttgc 
aaaaagtacg 
gagatattgt 
cattattgca 
tcaccgacgg 
acctattatc 
atcttctgag 
aggagaaaaa 
cagattcttc 
acgaagaaga 
aaaagaaacc 
gaggttactt 
atgattttgc 
tgaatcaaag 
aatatagtgt 
gtgctaaatt 
caaggctagc 
cattcattct 
tagagtcatt 
tgatgtcggt 
aacaaccgga 
tgaactca tt 
actcgcctat 
gatctaatat 
ctgtaactgg 
ataatataat 
cagccatgtc 
acgatccttc 
tttattccat 
gagttaaagc 
ccgctcagtc 
cttctaatgt 
aaaatggcag 
atcgatactt 
cagaaagagc 
tagcaaacgc 
aagcagcgga 
ctattaatat 
tcgccgtgag 
taaacactgc 
agcaacaaca 
tattaggatc 
tattaaacat 
gccggacatt 
agagtatgat 



aactagttct 
agaaaaatgc 
gtttgcagat 
tgaacaaact 
ggaagaagtt 
ggcccgtttc 
gtccatatcc 
actaaacaca 
gcaaaatctt 
gaaatcatat 
tctcactgga 
tactccttac 
tgaaactaaa 
actacttcca 
gttggtgtcc 
cttttttaat 
taacgacttt 
tatgaaaatg 
gaaggataaa 
ctcatcatca 
agaaaaagga 
atcaaaggac 
ccacgacacg 
gggcgtagcc 
a ttagtatct 
tgatactgaa 
gacatctcaa 
tgccttcttt 
aacatcaaaa 
ctttaaaaat 
gatgccgttc 
t t tgaaaaga 
cccttccaat 
gggaagacgc 
ctcgcccaat 
tctaaggcta 
caaaaatgcc 
catactactc 
a tctataaaa 
cgcagaagga 
gtacacttta 
tctggaaaaa 
attagaagga 
tgactgtgaa 
gaaaaaattc 
actcgtcgaa 
catgggggta 
atctgaaatg 
aacaaataca 
tgcagccgcc 
aaataacagt 
accaacacat 
aggaactgaa 
gatatctcca 
aggata tgga 
ggataaactt 



tcatccaaac 
cccaatacta 
atcatcatta 
gagctaaaga 
ttagggcgcc 
gatacccacg 
aaagagttgg 
ctcaaggaag 
acaccggagg 
caatactaca 
gtcaacatta 
aaacacatca 
cctgtttcaa 
gagacgggtg 
tacatcaacg 
gcagcgtgcg 
cctaaaccca 
cttcaagaag 
aagaagggcg 
tcatcttctt 
gaagcagtag 
gacgatttag 
agtgaactcg 
caatatgcag 
agtcttttag 
agtacttcct 
caagtggtta 
ttcgtggtgg 
cttttaccct 
ttaacaatta 
gaagacgaac 
gtggtgggag 
aagagtgtac 
ataaatttgg 
atactatttt 
accaacctat 
aacacgctcg 
tttcctccaa 
ttacaggata 
tgtatcgggg 
ctagtggact 
tcaatgtctg 
ggatcttttg 
cccacccctc 
aacaatgata 
caggttacaa 
gctgttatag 
agagcagcaa 
attggaatga 
gacatgtcaa 
catagcagca 
tcagcatttc 
cagacaaagg 
gaagacaaaa 
tctaatataa 
tctgagatgt 



ggaaaagagg 
tcaaaaattg 
atcttccctc 
ccatatatga 
aagacctgtt 
tctcgttgct 
aaaaaaatag 
aaatcaaaga 
atgcaaattt 
tacaaacatc 
aaatattgga 
ctaatcccag 
aggattatcc 
gcatttctga 
ctctcttttc 
tcctaatttc 
ttgaactggc 
gttcatccag 
gtggtggtgg 
cttcttcatc 
aaaaaggcaa 
atacaattag 
gcaataaaat 
taacaatcac 
atttgataat 
ccaccgccaa 
caatgatggt 
tggataatac 
cgaatgaaaa 
gtaacaaagt 
aacaacaaca 
aagtatttct 
aattaacggc 
cggccaagat 
ctaacctccc 
tgaaaaacat 
taaacaatac 
catcaaaaga 
tgactacaat 
tcgtacgctc 
caaatactat 
aagccttaac 
tagacggtgc 
tagcaaaata 
aaaatacaca 
ataagcctac 
gcgcagcatc 
attatcaagc 
tacgtaatac 
agctcgccaa 
gacgaggaga 
tggaacagac 
atcatgtgga 
atactactac 
ctggcataaa 
ctagcttcag 



tagtggtgct 
gctcaatgat 
ttttgatgat 
cttgtataga 
tgatcacaag 
cccttcggat 
caagggcccg 
acttttatgt 
tgtgttcaat 
agagatggaa 
gaagataatc 
gaacattatt 
gttcagagtg 
tctccctatc 
cctcgagcgc 
ccagtgccta 
agctaatgtt 
tgaaaagaag 
tgacgattct 
ctcctcctcc 
gaaaactaaa 
taaactgatt 
tagaaattta 
tgagatgcaa 
gagactaaaa 
atctaataat 
ggattctgga 
tgtagttaat 
tagaggactt 
ctctacctcg 
acaatgccct 
agaaatggga 
tgacgcgttc 
aaaaacggct 
agaatctgta 
atcccaaagt 
gatggaccaa 
aacttctatc 
gtcaaatctg 
gagggaattt 
ggacatggca 
aaataatgcg 
tctttcttac 
tacaatgaaa 
agatttatat 
aagtgttgtg 
aatcaagctc 
aacatcaaaa 
tactcacctg 
caaccatttt 
cagatcgtcc 
aagaggaaga 
acgtatgaag 
tactacaccc 
cacgataaaa 
aacttaa 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3657 



<210> 37 
<211> 1210 
<212> PRT 
<213> SHRIMP 
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<400> 37 



Met 


Thr 


Met 


Asp 


Asn 


Val 


Val 


Gin 


Asn 


Asn 


Asp 


Val 


Thr 


Lys 


Pro 


Thr 


1 








5 










10 










15 




Pro 


Asp 


Val 


Ala 


Thr 


Val 


Thr 


Thr 


Ala 


Thr 


Glu 


Lys 


Arg 


Gin 


Ser 


Cys 








20 










25 










30 






Lys 


Glu 


Lys 


Lys 


Asp 


Gin 


Leu 


Lys 


Ala 


Glu 


Cys 


Pro 


Gin 


Val 


Leu 


Arg 






35 










40 










45 








Ala 


Leu 


Lys 


Leu 


Ser 


Asn 


Thr 


Leu 


Lys 


Ala 


Asn 


Phe 


Gly 


Lys 


Ser 


Met 




50 










55 










60 










Ser 


Ala 


He 


Phe 


Ala 


Gin 


His 


Leu 


Val 


Asp 


Met 


Thr 


Asn 


Ala 


Lys 


His 


65 










70 










75 










80 


Phe 


Lys 


Asp 


Pro 


Lys 


Thr 


Lys 


Lys 


He 


Leu 


Asp 


Gly 


Ser 


Ser 


Ser 


Ser 










85 










90 










95 




Asp 


Ser 


Glu 


Glu 


Glu 


Glu 


Glu 


Thr 


Ser 


Ser 


Ser 


Ser 


Lys 


Arg 


Lys 


Arg 








100 










105 










110 






Gly 


Ser 


Gly Ala 


Arg 


Ser 


Ala 


Ser 


Ser 


Lys 


Lys 


Glu 


Lys 


Cys 


Pro 


Asn 






115 










120 










125 








Thr 


He 


Lys 


Asn 


Trp 


Leu 


Asn 


Asp 


Ala 


Gin 


Gly 


Val 


Phe 


Arg 


Gin 


Phe 




130 










135 










140 










Ala 


Asp 


He 


He 


He 


Asn 


Leu 


Pro 


Ser 


Phe 


Asp 


Asp 


Leu 


Arg 


Asp 


Glu 


145 










150 










155 










160 


Val 


Lys 


Asp 


Glu 


Gin 


Thr 


Glu 


Leu 


Lys 


Thr 


He 


Tyr 


Asp 


Leu 


Tyr 


Arg 










165 










170 










175 




Gin 


Asp 


Met 


Glu 


Lys 


Val 


Val 


Glu 


Glu 


Val 


Leu 


Gly 


Arg 


Gin 


Asp 


Leu 








180 










185 










190 






Phe 


Asp 


His 


Lys 


Ser 


Glu 


He 


Ala 


Lys 


Gly 


Leu 


Ala 


Arg 


Phe 


Asp 


Thr 






195 










200 










205 








His 


Val 


Ser 


Leu 


Leu 


Pro 


Ser 


Asp 


Arg 


Ser 


Ala 


Val 


Leu 


Asp 


Ser 


Ser 




210 










215 










220 










lie 


Ser 


Lys 


Glu 


Leu 


Glu 


Lys 


Asn 


Ser 


Lys 


Gly 


Pro 


Asn 


Ser 


Asn 


He 


225 










230 










235 










240 


Phe 


Asp 


Thr 


Leu 


Asn 


Thr 


Leu 


Lys 


Glu 


Glu 


He 


Lys 


Glu 


Leu 


Leu 


Cys 










245 










250 










255 




His 


His 


Val 


Lys 


Tyr 


Leu 


Leu 


Gin 


Asn 


Leu 


Thr 


Pro 


Glu 


Asp 


Ala 


Asn 








260 










265 










270 






Phe 


Val 


Phe 


Asn 


Ser 


Ser 


Val 


Lys 


Tyr 


Val 


Lys 


Lys 


Ser 


Tyr 


Gin 


Tyr 






275 










280 










285 








Tyr 


He 


Gin 


Thr 


Ser 


Glu 


Met 


Glu 


Ser 


Asp 


Glu 


Phe 


Lys 


Ser 


Leu 


Leu 




290 










295 










300 










Thr 


Gly 


Val 


Asn 


He 


Lys 


He 


Lys 


He 


He 


Ser 


Ser 


Asp 


Asn 


Asn 


Val 


305 










310 










315 










320 


Ala 


Thr 


Pro 


Tyr 


Lys 


His 


He 


Thr 


Asn 


Pro 


Arg 


Asn 


He 


He 


Ser 


Ser 










325 










330 










335 




Leu 


Gin 


Lys 


Val 


Arg 


Glu 


Thr 


Lys 


Pro 


Val 


Ser 


Lys 


Asp 


Tyr 


Pro 


Phe 








340 










345 










350 






Arg 


Val 


Asp 


Thr 


Ala 


Arg 


Asp 


He 


Val 


Leu 


Leu 


Pro 


Glu 


Thr 


Gly 


Gly 






355 










360 










365 








He 


Ser 


Asp 


Leu 


Pro 


He 


Lys 


Pro 


Val 


Thr 


Leu 


Leu 


Gin 


Leu 


Val 


Ser 




370 










375 










380 










Tyr 


He 


Asn 


Ala 


Leu 


Phe 


Ser 


Leu 


Glu 


Arg 


Arg 


Asn 


Val 


Phe 


Thr 


Asp 


385 










390 










395 










400 


Gly 


Phe 


Phe 


Asn 


Ala 


Ala 


Cys 


Val 


Leu 


He 


Ser 


Gin 


Cys 


Leu 


Thr 


Asn 










405 










410 










415 




Ala 


Asn 


Leu 


Leu 


Ser 


Asn 


Asp 


Phe 


Pro 


Lys 


Pro 


He 


Glu 


Leu 


Ala 


Ala 








420 










425 










430 






Asn 


Val 


Thr 


Arg 


His 


Asn 


Leu 


Leu 


Ser 


Met 


Lys 


Met 


Leu 


Gin 


Glu 


Gly 






435 










440 










445 






Ser 


Ser 


Ser 


Glu 


Lys ^ 


Lys 


Ser 


Lys 


Lys 


Lys 


Glu 


Lys 


Lys 


Lys 


Asp 


Lys 




450 










455 










460 










Lys 


Lys 


Gly 


Gly 


Gly 


Gly 


Gly 


Asp 


Asp 


Ser 


Asp 


Ser 


Glu 


Thr 


Asp 


Ser 
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470 










475 










480 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 










485 










490 










4 95 




Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Lys 


Gly 


Glu 


Ala 


Val 


Glu 


Lys 


Gly 


Lys 


Lys 








500 










505 










510 






•■p „ 

rnr 


Lys 


Arg 


Lys 


Thr 


Lys 


Lys 


Lys 


Pro 


Ser 


Lys 


Asp 


Asp 


Asp 


Leu 


Asp 






515 










520 










525 








Thr 


lie 


Ser 


Lys 


Leu 


lie 


Leu 


Lys 


Thr 


Gly 


Gly 


Tyr 


Phe 


His 


Asp 


Thr 




530 










535 










540 










Ser 


Glu 


Leu 


Gly 


Asn 


Lys 


He 


Arg 


Asn 


Leu 


He 


Asp 


Lys 


Asp 


Asp 


Phe 


S'l 5 










550 










555 








560 


Ala 


Gly 


Val 


Ala 


Gin 


Tyr 


Ala 


Val 


Thr 


He 


Thr 


Glu 


Met 


Gin 


Ser 


Thr 










565 










570 










575 




Pro 


Met 


Asn 


Gin 


Arg 


Leu 


Val 


Ser 


Ser 


Leu 


Leu 


Asp 


Leu 


He 


Met 


Arg 








580 










585 










590 




Leu 


Lys 


Glu 


Gin 


Val 


Lys 


Tyr 


Ser 


Val 


Asp 


Thr 


Glu 


Ser 


Thr 


Ser 


Ser 






595 










600 










605 








Thr 


Ala 


Lys 


Ser 


Asn 


Asn 


Ala 


Leu 


Asp 


Ser 


Ala 


Lys 


Leu 


Thr 


Ser 


Gin 




bl U 










615 










620 










Cjln 


Val 


Val 


Thr 


Met 


Met 


Val 


Asp 


Ser 


Gly 


Ala 


Glu 


Leu 


Ala 


Arg 


Leu 


O c: 










630 










635 








640 


Ala 


Ala 


Phe 


Phe 


Phe 


Val 


Val 


Val 


Asp 


Asn 


Thr 


Val 


Val 


Asn 


Arg 


His 










645 










650 










655 




Glu 


Ala 


Phe 


lie 


Leu 


Thr 


Ser 


Lys 


Leu 


Leu 


Pro 


Ser 


Asn 


Glu 


Asn 


Arg 








660 










665 










670 




Gly 


Leu 


Lys 


Thr 


Val 


Val 


Glu 


Ser 


Phe 


Phe 


Lys 


Asn 


Leu 


Thr 


He 


Ser 






675 










680 










685 








Asn 


Lys 


Val 


Ser 


Thr 


Ser 


Asn 


Glu 


Glu 


Met 


Met 


Ser 


Val 


Met 


Pro 


Phe 




690 










695 










700 










Glu 


Asp 


Glu 


Gin 


Gin 


Gin 


Gin 


Gin 


Cys 


Pro 


Gin 


His 


Glu 


Gin 


Gin 


Pro 


705 










710 










715 










720 


Asp 


Leu 


Lys 


Arg 


Val 


Val 


Gly 


Glu 


Val 


Phe 


Leu 


Glu 


Met 


Gly 


Lys 


Ser 










725 










730 










735 




lie 


Val 


Asn 


Ser 


Phe 


Pro 


Ser 


Asn 


Lys 


Ser 


Val 


Gin 


Leu 


Thr 


Ala 


Asp 








740 










745 










750 




Ala 


Phe 


Lys 


Gin 


Asn 


Tyr 


Ser 


Pro 


Met 


Gly 


Arg 


Arg 


He 


Asn 


Leu 


Ala 






755 










760 










765 








Ala 


Lys 


I le 


Lys 


Thr 


Ala 


He 


Ser 


He 


Gly 


Ser 


Asn 


He 


Ser 


Pro 


Asn 














775 










780 










i xe 


Leu 


Pne 


Ser 


Asn 


Leu 


Pro 


Glu 


Ser 


Val 


Gly 


Asn 


Asn 


Thr 


Val 


Thr 


T O C 










790 










795 










800 


Gly 


Leu 


Arg 


Leu 


Thr 


Asn 


Leu 


Leu 


Lys 


Asn 


He 


Ser 


Gin 


Ser 


Ala 


Gin 










805 










810 










815 




Ala 


Asn 


Asn 


lie 


lie 


Lys 


Asn 


Ala 


Asn 


Thr 


Leu 


Val 


Asn 


Asn 


Thr 


Met 








82 0 










825 










830 






Asp 


Gin 


Gin 


Asn 


Ser 


Ala 


Ala 


Met 


Ser 


He 


Leu 


Leu 


Phe 


Pro 


Pro 


Thr 






835 










840 










845 








Ser 


Lys 


Glu 


Thr 


Ser 


He 


Phe 


Pro 


Gly 


Asn 


Asp 


Pro 


Ser 


Ser 


He 


Lys 




850 










855 










860 








Leu 


Gin 


Asp 


Met 


Thr 


Thr 


Met 


Ser 


Asn 


Leu 


Arg 


Phe 


Tyr 


Ser 


He 


Ala 


865 










870 










875 








880 


Glu 


Gly 


Cys 


lie 


Gly 


Val 


Val 


Arg 


Ser 


Arg 


Glu 


Phe 


Asp 


Glu 


Gly 


Gly 










885 










890 










895 




Val 


Lys 


Ala 


Tyr 


Thr 


Leu 


Leu 


Val 


Asp 


Ser 


Asn 


Thr 


Met 


Asp 


Met 


Ala 








900 










905 










910 






Val 


Asn 


Phe 


Ala 


Ala 


Gin 


Ser 


Leu 


Glu 


Lys 


Ser 


Met 


Ser 


Glu 


Ala 


Leu 






915 










920 








925 








Thr 


Asn 


Asn 


Ala 


Asn 


Met 


Asn 


Pro 


Ser 


Asn 


Val 


Leu 


Glu 


Gly 


Gly 


Ser 




930 










935 










940 






Phe 


Val 


Asp 


Gly Ala 


Leu 


Ser 


Tyr 


Met 


Phe 


Glu 


Lys 


Asn 


Gly 


Ser 


Asp 
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Cys 


Glu 


Pro 


Thr 


Pro 
965 


Leu 


Ala 


Lys 


Tyr 


Thr 
970 


Met 


Lys 


Asp 


Val 


Ser 
975 


Asn 


Arg 


Tyr 


Leu 


Lys 
980 


Lys 


Phe 


Asn 


Asn 


Asp 
985 


Lys 


Asn 


Thr 


Gin 


Asp 
990 


Leu 


Tyr 


Lys 


Asn 


Arg 


Ala 


Glu 


Arg 


Ala 


Leu 


Val 


Glu 


Gin 


Val 


Thr 


Asn 


Lys 


Pro 






995 










1000 








1005 






Thr 


Ser 


Val 


Val 


His 


Ser 


Gin 


Leu 


Ala 


Asn 


Ala 


Met 


Gly 

) 


Val 


Ala 


Val 




1010 








1015 








102C 








lie 


Gly 


Ala 


Ala 


Ser 


He 


Lys 


Leu 


Met 


Glu 


Ala 


Glu 


Ala 


Ala 


Glu 


Ser 


1025 








1030 








1035 








1040 


Glu 


Met 


Arg 


Ala 


Ala 


Asn 


Tyr 


Gin 


Ala 


Thr 


Ser 


Lys 


Ser 


Thr 


Asn 


Ala 










1045 








1050 








1055 


lie 


Asn 


He 


Thr 


Asn 


Thr 


He 


Gly 


Met 


He 


Arg 


Asn 


Thr 


Thr 


His 


Leu 








1060 








1065 








1070 




Cys 


Thr 


Thr 


He 


Ala 


Val 


Ser 


Ala 


Ala 


Ala 


Asp 


Met 


Ser 


Lys 


Leu 


Ala 






1075 








1080 








1085 






Asn 


Asn 


His 


Phe 


Met 


Ser 


Val 


Leu 


Asn 


Thr 


Ala 


Asn 


Asn 


Ser 


His 


Ser 




1090 








1095 








1100 








Ser 


Arg 


Arg 


Gly 


Asp 


Arg 


Ser 


Ser 


Leu 


Met 


Leu 


Gin 


Gin 


Gin 


Gin 


Pro 


1105 








1110 








1115 








1120 


Thr 


His 


Ser 


Ala 


Phe 


Leu 


Glu 


Gin 


Thr 


Arg 


Gly Arg 


Gly 


Gly 


Gly 


Val 










1125 








1130 








1135 


Leu 


Gly 


Ser 


Gly 


Thr 


Glu 


Gin 


Thr 


Lys 


Asp 


His 


Val 


Glu 


Arg 


Met 


Lys 








1140 








1145 








1150 




Arg 


Asp 


Trp 


He 


Leu 


Asn 


Met 


He 


Ser 


Pro 


Glu 


Asp 


Lys 


Asn 


Thr 


Thr 






1155 








1160 








1165 






Thr 


Thr 


Thr 


Pro 


Ser 


Asn 


Ala 


Gly Arg 


Thr 


Leu 


Gly 


Tyr 


Gly 


Ser 


Asn 




1170 








1175 








1180 








He 


Thr 


Gly 


He 


Asn 


Thr 


He 


Lys 


Gin 


Asp 


Asp 


Lys 


Ser 


Met 


Met 


Asp 


1185 








1190 








1195 








1200 


Lys 


Leu 


Ser 


Glu 


Met 


Ser 


Ser 


Phe 


Arg 


Thr 















1205 1210 



<210> 38 
<211> 2376 
<212> DNA 
<213> SHRIMP 



<400> 38 

atgtctcttg 

gttgtggagg 

tcttcttctt 

cccccaaaga 

ccagcatctc 

cctctaatcg 

aaaacgctac 

tcaacct t ta 

ggaaataaca 

acacaagctg 

ctccctgaag 

aaaaatat tg 

ttgttccaca 

gttaagaaag 

atcggggatt 

gttaagaggc 

actacttcca 

tcca tttcta 

caagaagaat 

catcctactt 

tactgeatct 



ttgagaataa 
gagcagaagt 
cagct tcgga 
agaaacgtaa 
cagaaatgct 
ataatagttt 
tcactgaaac 
a tcaactcga 
actacagata 
ttttggacgc 
gggagggaat 
tccaggctgc 
aga tggagaa 
taaggaggaa 
ctcttcaaga 
cattctccaa 
ctggtgttgt 
agggatgttt 
tcgtttctga 
tcaagaagct 
gcccctacta 



tacccaagaa 
tgcccctaga 
tagtgaagat 
tatcaatagc 
cagtgctgta 
tggcgttcag 
tattgccgtg 
aaggaccgga 
ctggcaaatt 
aatcgttgaa 
cgggcttcaa 
cgatattgaa 
ggccgatgaa 
caagtcccag 
gcgtattgat 
ttctgctgct 
taatccacga 
cactgtgagg 
gttgtacact 
tattca tgac 
caccacagga 



gaaatgattc 
ggtgtcaaga 
gatgaaggag 
ggaaagtact 
aatgatattg 
tttaaaaaga 
gaatatggaa 
gagcctctta 
cgaattgaag 
ggaaatgaca 
tttaacaaga 
tttggacaag 
tcttctaatt 
cctacaaatt 
aacgcaatca 
gctgctgaag 
gggattaagg 
aatattgttg 
aaccttctaa 
cgtacaatga 
ggcgtccccc 



tggaaactac 
gaccccttcc 
gagagcaacc 
ggaaaatcga 
ataacgtgtc 
gtgtatctga 
caatcacaaa 
agaagaagcg 
ccgctgcagc 
ccgtgatcaa 
gtgttagctc 
ttgcgcacat 
cttctggtga 
cttactacac 
aagttatcga 
aagacaccac 
atattcactt 
cagcaaatgg 
aggttgaaga 
atcgacacat 
ctgcagctga 



tgttgaaggt 
ttcttcctca 
acaaacgaaa 
aacaatcgaa 
caaaactatt 
agaacaga tc 
tgttaaatat 
ctcaaacaat 
agagaatgtt 
ggcaatcctc 
tcaacaagct 
gaagtgtaat 
atcaccaaag 
tt tcactatg 
aatgtctcca 
tactacgacc 
tttcgattca 
tgaagttcca 
gaagg tggat 
taaggcatgg 
taaggtttcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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gctaagggaa 
ggggcacata 
tctatgctct 
gaaggtttgg 
gatagcaggg 
gctgatgttt 
gatactatgg 
attaagaagg 
actgcaatca 
cgagcacaca 
tcaacaacag 
tttgcaaagc 
aaggaaatgg 
aacgaagaga 
gtaaagatca 
ttccctgctg 
tttgatgttg 
tctgaaaata 
ggagagaagc 



ttgctaccta 
ca tctactac 
tccagagtcc 
agcctatttc 
caacacgctg 
tgtctattgt 
aatatggtgt 
gaggagatac 
cttcggagga 
agaaacgaaa 
caaatactcc 
ttactttaga 
tgcagagaat 
agatggaaat 
agacgaatgc 
ataaggtgtg 
taagggagga 
aggcaagcgg 
tgttgcacga 



cagaatatac 
cccagcacag 
tggaactgat 
atcgggcgaa 
cttcaaattt 
tcatactgat 
tgcaaagagc 
tttccatagc 
acttaatagg 
gagcaactct 
aaagaagacg 
ttatgttgat 
tttggctcag 
tgtagaggga 
aaaggcatac 
tctaaagcac 
taatgttaat 
aggagatgat 
attgttgaat 



gaggacagga 
gcagccgagg 
attcaaaagt 
attgtgtacc 
tattcctctt 
ggacttttca 
aagagtaaaa 
gaggaggata 
gaatgtaaca 
tctactacta 
aagaagagcg 
agtacatcat 
gagagagtaa 
gaagaagcac 
aatcttgcca 
acattggaag 
aagacggtcg 
gaagaaactc 
gaataa 



ctggagtgtt 
caactggtgc 
tcctcgatgc 
gttcaaagtg 
cagacgagaa 
gtagtgtaca 
ttatccccaa 
ttgaagtgcc 
ccaagggaat 
ctacttccac 
cttctgctgc 
ttgtatttta 
agacgctaaa 
aagaaactta 
acaaaacatc 
atttggggga 
cttctactac 
caatggaatt 



ccaattcgat 
tattcacaag 
taagaaggca 
gagccccaat 
gatgaacatt 
ctttagaaag 
aactatcaag 
tgtaaaattc 
gaacagcctc 
cacttccact 
cagtgaccca 
caatatcagt 
ggcagtcaag 
tagaggaatc 
tggagtactt 
tgtacttgat 
tactacatca 
tgaaactgac 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2376 



<210> 39 
<211> 783 
<212> PRT 
<213> SHRIMP 



<400> 39 
Met Ser Leu Val 
1 

Glu Gly Val Val 
20 

Pro Leu Pro Ser 
35 

Asp Glu Gly Gly 
50 

Asn lie Asn Ser 
65 

Glu Met Leu Ser 

Pro Leu lie Asp 
100 

Glu Glu Gin lie 
115 

Gly Thr lie Thr 
130 

Thr Gly Glu Pro 
145 

Tyr Arg Tyr Trp 

Thr Gin Ala Val 
180 

Lys Ala lie Leu 
195 

Lys Ser Val Ser 
210 

lie Glu Phe Gly 
225 

Met Glu Lys Ala 

Val Lys Lys Val 

260 

Thr Phe Thr Met 



Glu Asn Asn Thr 

5 

Glu Gly Ala Glu 

Ser Ser Ser Ser 
40 

Glu Gin Pro Gin 

55 

Gly Lys Tyr Trp 
70 

Ala Val Asn Asp 
85 

Asn Ser Phe Gly 

Lys Thr Leu Leu 
120 

Asn Val Lys Tyr 
135 

Leu Lys Lys Lys 
150 

Gin lie Arg He 

165 

Leu Asp Ala He 

Leu Pro Glu Gly 
200 

Ser Gin Gin Ala 
215 

Gin Val Ala His 
230 

Asp Glu Ser Ser 
245 

Arg Arg Asn Lys 
He Gly Asp Ser 



Gin Glu Glu Met 
10 

Val Ala Pro Arg 
25 

Ser Ser Ala Ser 

Thr Lys Pro Pro 
60 

Lys He Glu Thr 
75 

He Asp Asn Val 
90 

Val Gin Phe Lys 
105 

Thr ^ Glu Thr He 

Ser Thr Phe Asn 
140 

Arg Ser Asn Asn 
155 

Glu Ala Ala Ala 
170 

Val Glu Gly Asn 
185 

Glu Gly He Gly 

Lys Asn He Val 
220 

Met Lys Cys Asn 
235 

Asn Ser Ser Gly 
250 

Ser Gin Pro Thr 
265 

Leu Gin Glu Arg 



He Thr Thr Val 
15 

Gly Val Lys Arg 
30 

Asp Ser Glu Asp 
45 

Lys Lys Lys Arg 

He Glu Pro Asp 
80 

Ser Lys Thr He 
95 

Lys Ser Val Ser 
110 

Ala Val Glu Tyr 
125 

Gin Leu Glu Arg 

Gly Asn Asn Asn 
160 

Ala Glu Asn Val 
175 

Asp Thr Val He 
190 

Leu Gin Phe Asn 
205 

Gin Ala Ala Asp 

Leu Phe His Lys 
240 

Glu Ser Pro Lys 
255 

Asn Ser Tyr Tyr 
270 

He Asp Asn Ala 
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275 










280 










285 








lie 


Lys 


Val 


lie 


Glu 


Met 


Ser 


Pro 


Val 


Lys 


Arg 


Pro 


Phe 


Ser 


Asn 


Ser 




290 










295 










300 










Ala 


Ala 


Ala 


Ala 


Glu 


Glu 


Asp 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Ser 


Thr 


305 










310 










315 










320 


Gly 


Val 


Val 


Asn 


Pro 


Arg 


Gly 


He 


Lys 


Asp 


He 


His 


Phe 


Phe 


Asp 


Ser 










325 










330 










335 




Ser 


lie 


Ser 


Lys 


Gly 


Cys 


Phe 


Thr 


Val 


Arg 


Asn 


He 


Val 


Ala 


Ala 


Asn 








340 










345 










350 






Gly 


Glu 


Val 


Pro 


Gin 


Glu 


Glu 


Phe 


Val 


Ser 


Glu 


Leu 


Tyr 


Thr 


Asn 


Leu 






355 










360 










365 








Leu 


Lys 


Val 


Glu 


Glu 


Lys 


Val 


Asp 


His 


Pro 


Thr 


Phe 


Lys 


Lys 


Leu 


He 




370 










375 










380 










His 


Asp 


Arg 


Thr 


Met 


Asn 


Arg 


His 


He 


Lys 


Ala 


Trp 


Tyr 


Cys 


He 


Cys 


385 










390 










395 










400 


Pro 


Tyr 


Tyr 


Thr 


Thr 


Gly 


Gly 


Val 


Pro 


Pro 


Ala 


Ala 


Asp 


Lys 


Val 


Ser 










405 










410 










415 




Ala 


Lys 


Gly 


lie 


Ala 


Tyr 


He 


Tyr 


Glu 


Asp 


Arg 


Thr 


Gly 


Val 


Phe 


Gin 








420 










425 










430 






Phe 


Asp 


Gly 


Ala 


His 


Thr 


Ser 


Thr 


Thr 


Pro 


Ala 


Gin 


Ala 


Ala 


Glu 


Ala 






435 










440 










445 








Thr 


Gly 


Ala 


lie 


His 


Lys 


Ser 


Met 


Leu 


Phe 


Gin 


Ser 


Pro 


Gly 


Thr 


Asp 




450 










455 










460 










lie 


Gin 


Lys 


Phe 


Leu 


Asp 


Ala 


Lys 


Lys 


Ala 


Glu 


Gly 


Leu 


Glu 


Pro 


He 


465 










470 










475 










480 


Ser 


Ser 


Gly 


Glu 


He 


Val 


Tyr 


Arg 


Ser 


Lys 


Trp 


Ser 


Pro 


Asn 


Asp 


Ser 










485 










490 










495 




Arg 


Ala 


Thr 


Arg 


Cys 


Phe 


Lys 


Phe 


Tyr 


Ser 


Ser 


Ser 


Asp 


Glu 


Lys 


Met 








500 










505 










510 






Asn 


lie 


Ala 


Asp 


Val 


Leu 


Ser 


He 


Val 


His 


Thr 


Asp 


Gly 


Leu 


Phe 


Ser 






515 










520 










525 








Ser 


Val 


His 


Phe 


Arg 


Lys 


Asp 


Thr 


Met 


Glu 


Tyr 


Gly 


Val 


Ala 


Lys 


Ser 




530 










535 










540 










Lys 


Ser 


Lys 


He 


He 


Pro 


Lys 


Thr 


He 


Lys 


He 


Lys 


Lys 


Gly 


Gly 


Asp 


545 










550 










555 










560 


Thr 


Phe 


His 


Ser 


Glu 


Glu 


Asp 


He 


Glu 


Val 


Pro 


Val 


Lys 


Phe 


Thr 


Ala 










565 










570 










575 




lie 


Thr 


Ser 


Glu 


Glu 


Leu 


Asn 


Arg 


Glu 


Cys 


Asn 


Thr 


Lys 


Gly 


Met 


Asn 








580 










585 










590 






Ser 


Leu 


Arg 


Ala 


His 


Lys 


Lys 


Arg 


Lys 


Ser 


Asn 


Ser 


Ser 


Thr 


Thr 


Thr 






595 










600 










605 








Thr 


Ser 


Thr 


Thr 


Ser 


Thr 


Ser 


Thr 


Thr 


Ala 


Asn 


Thr 


Pro 


Lys 


Lys 


Thr 




610 










615 










620 








Lys 


Lys 


Ser 


Ala 


Ser 


Ala 


Ala 


Ser 


Asp 


Pro 


Phe 


Ala 


Lys 


Leu 


Thr 


Leu 


625 










630 










635 










640 


Asp 


Tyr 


Val 


Asp 


Ser 


Thr 


Ser 


Phe 


Val 


Phe 


Tyr 


Asn 


He 


Ser 


Lys 


Glu 










645 










650 










655 




Met 


Val 


Gin 


Arg 


He 


Leu 


Ala 


Gin 


Glu 


Arg 


Val 


Lys 


Thr 


Leu 


Lys 


Ala 








660 










665 










670 






Val 


Lys 


Asn 


Glu 


Glu 


Lys 


Met 


Glu 


He 


Val 


Glu 


Gly 


Glu 


Glu 


Ala 


Gin 






675 










680 










685 








Glu 


Tyr 


Gly 


He 


Val 


Lys 


He 


Lys 


Thr 


Asn 


Ala 


Lys 


Ala 


Tyr 


Asn 


Leu 




690 










695 










700 










Ala 


Asn 


Lys 


Thr 


Ser 


Gly 


Val 


Leu 


Phe 


Pro 


Ala 


Asp 


Lys 


Val 


Cys 


Leu 


705 










710 










715 










720 


Lys 


His 


Thr 


Leu 


Glu 


Asp 


Leu 


Gly 


Asp 


Val 


Leu 


Asp 


Phe 


Asp 


Val 


Val 










725 










730 










735 




Arg 


Glu 


Asp 


Asn 


Val 


Asn 


Lys 


Thr 


Val 


Ala 


Ser 


Thr 


Thr 


Thr 


Thr 


Ser 








740 










745 










750 






Ser 


Glu 


Asn 


Lys 


Ala 


Ser 


Gly 


Gly 


Asp 


Asp 


Glu 


Glu 


Thr 


Pro 


Met 


Glu 






755 










760 










765 
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Phe Glu Thr Asp Gly Glu Lys Leu Leu His Glu Leu Leu Asn Glu 
770 775 780 



<210> 40 
<211> 1878 
<212> DNA 
<213> SHRIMP 



<400> 40 

atgagctgtt 

gtagaaggag 

cgaaagagat 

gattctgact 

acaaggaaac 

ctctacatgt 

ggtgaagaag 

cttggttgta 

gagtatgaag 

gattgtgtta 

ttccagttct 

atgaattcag 

ttgggtgaca 

tcttcatcat 

gcaatgggaa 

aatttccgcc 

aacaagaaac 

tttagagaaa 

atgaataatg 

aatattattg 

agagatgcac 

aagagactcc 

aatgatgttc 

tttgcttctg 

ttcaatagta 

gctaatttaa 

agaacattaa 

tctagtggtt 

caattaatgt 

ggaaactttg 

gtgaagactg 

aacgtccctt 



cttcttcttc 
gaggaggcag 
cttctgttaa 
ctgatagaga 
cgcccaaagt 
ctagttcatc 
aaagtgacga 
agaaggaaaa 
aagaattcaa 
tacctgatcg 
ctcagccagt 
cattctcttc 
catctaaaat 
cgacgttcgt 
gagtagcaga 
ctgtagtttc 
tcaatagtaa 
tacacagtgt 
acaacacaaa 
gtaaaattcc 
tatatttgta 
gtatccctca 
ccattacctc 
ctatggtgga 
gttattccgt 
tcctatctat 
ctcagagcgt 
tgaccagtga 
atgagggtgt 
aggatgaaga 
ttggactatc 
tctactga 



atcgtgctca 
aatcggacca 
aagtacatca 
agaaaaagaa 
aagaaaactg 
ttcttcttcc 
tgatgatgat 
atctccccaa 
gagaatggca 
gattttaacc 
ttctttcctc 
catgttatcc 
ttcctccttt 
caataattgt 
attgctgtca 
cttaatgcga 
caggcaaact 
tatcgcgcta 
ttctagcgga 
ttataatgaa 
ccagaatgtg 
acaacaagct 
agatttactt 
tcctgcatat 
atataaagtc 
tttatcagct 
gtatttgttc 
agagagttct 
gactcgtcaa 
cggtgtaaca 
tgcattgcta 



agttcatctg 
acagaggcca 
tcttcttctt 
ggaagaaaac 
gatactcctt 
tcctcatctt 
gacgattatg 
gacatagaag 
ttaccttcac 
ctcttttcta 
cggttggtga 
agttctggta 
ataacacctc 
acagatgaag 
aatattgcag 
ggaccaactt 
attccccaag 
tatttatcgt 
tacgctgaag 
atgagtagag 
atcacggata 
gatttttgtt 
ttaactggaa 
cgcctggctg 
ctagatcttg 
agaaataagt 
ctaaatcatc 
ctaggaacag 
accatcgagg 
ctgaaatgtt 
tctgataggc 



aagaggagaa 
agaaaaagat 
cctcatcatc 
tatacgtgga 
cacaaacttt 
ccgactcatc 
acccagataa 
ctgaaaagga 
ggataaacac 
cacttttgaa 
tgaagcaagt 
tgaggttggt 
aaacggatac 
atatcaagaa 
ccagt tctaa 
gtggaggttc 
tattaaacaa 
ccgtctgtgt 
ggatggttac 
aaaaattcat 
tgactggccc 
acat tatagc 
aggcgacaaa 
tccataaaat 
accataaaat 
gtctcagtga 
ttttgcggaa 
ccgtaaaatt 
acggatgtag 
tgggagccga 
ttagaaaaaa 



tgaggtagga 
cctccgtaaa 
atcatcagac 
tattgcagat 
agagaacgac 
atcgtcttct 
tgtccatgtt 
aaaggaggaa 
atctgtagat 
gaaaaatagt 
aaacgaggcc 
ggaggactct 
tagtaattca 
gcgtaacatt 
tgaggagaat 
taatgcatcc 
ggttatattt 
gcagcgagca 
taaaatcttg 
atccgttgga 
caaacataac 
catgttggtt 
tttagtacaa 
ggcgtctgtt 
gttattaagg 
aagaaaacct 
caaattgagg 
ggtgtcacag 
catgattagc 
tgttaaggac 
cattagaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1878 



<210> 41 
<211> 621 
<212> PRT 
<213> SHRIMP 



<400> 41 

Met Ser Cys Ser Ser Ser Ser Ser Cys Ser Ser Ser Ser Glu Glu Glu 

15 10 15 

Asn Glu Val Gly Val Glu Gly Gly Gly Gly Arg lie Gly Pro Thr Glu 

20 25 30 

Ala Lys Lys Lys lie Leu Arg Lys Arg Lys Arg Ser Ser Val Lys Ser 

35 40 45 

Thr Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Asp Asp Ser Asp Ser 

50 55 60 

Asp Arg Glu Glu Lys Glu Gly Arg Lys Leu Tyr Val Asp lie Ala Asp 
65 70 75 80 

Thr Arg Lys Pro Pro Lys Val Arg Lys Leu Asp Thr Pro Ser Gin Thr 
85 90 95 
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Leu 


Glu 


Asn 


Asp 


Leu 


Tyr 


Met 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 








100 










105 










110 






Ser 


Ser 


Asp 


Ser 


Ser 


Ser 


Ser 


Ser 


Gly 


Glu 


Glu 


Glu 


Ser 


Asp 


Asp 


Asp 






115 










120 










125 








Asp 


Asp 


Asp 


Asp 


Tyr 


Asp 


Pro 


Asp 


Asn 


Val 


His 


Val 


Leu 


Gly 


Cys 


Lys 




130 










135 










140 










Lys 


Glu 


Lys 


Ser 


Pro 


Gin 


Asp 


He 


Glu 


Ala 


Glu 


Lys 


Glu 


Lys 


Glu 


Glu 


145 










150 










155 










160 


Glu 


Tyr 


Glu 


Glu 


Glu 


Phe 


Lys 


Arg 


Met 


Ala 


Leu 


Pro 


Ser 


Arg 


He 


Asn 










165 










170 










175 




Thr 


Ser 


Val 


Asp 


Asp 


Cys 


Val 


He 


Pro 


Asp 


Arg 


He 


Leu 


Thr 


Leu 


Phe 








180 










185 










190 






Ser 


Thr 


Leu 


Leu 


Lys 


Lys 


Asn 


Ser 


Phe 


Gin 


Phe 


Ser 


Gin 


Pro 


Val 


Ser 






195 










200 










205 








Phe 


Leu 


Arg 


Leu 


Val 


Met 


Lys 


Gin 


Val 


Asn 


Glu 


Ala 


Met 


Asn 


Ser 


Ala 




210 










215 










220 










Phe 


Ser 


Ser 


Met 


Leu 


Ser 


Ser 


Ser 


Gly 


Met 


Arg 


Leu 


Val 


Glu 


Asp 


Ser 


225 










230 










235 










240 


Leu 


Gly 


Asp 


Thr 


Ser 


Lys 


He 


Ser 


Ser 


Phe 


He 


Thr 


Pro 


Gin 


Thr 


Asp 










24 5 










250 










255 




Thr 


Ser 


Asn 


Ser 


Ser 


Ser 


Ser 


Ser 


Thr 


Phe 


Val 


Asn 


Asn 


Cys 


Thr 


Asp 








260 










265 










270 






Glu 


Asp 


lie 


Lys 


Lys 


Arg 


Asn 


He 


Ala 


Met 


Gly 


Arg 


Val 


Ala 


Glu 


Leu 






275 










280 










285 








Leu 


Ser 


Asn 


lie 


Ala 


Ala 


Ser 


Ser 


Asn 


Glu 


Glu 


Asn 


Asn 


Phe 


Arg 


Pro 




290 










295 










300 








Val 


Val 


Ser 


Leu 


Met 


Arg 


Gly 


Pro 


Thr 


Cys 


Gly 


Gly 


Ser 


Asn 


Asn 


Lys 


305 










310 










315 










320 


Lys 


Leu 


Asn 


Ser 


Asn 


Arg 


Gin 


Thr 


He 


Pro 


Gin 


Val 


Leu 


Asn 


Lys 


Val 










325 










330 










335 




lie 


Phe 


Phe 


Arg 


Glu 


He 


His 


Ser 


Val 


He 


Ala 


Leu 


Tyr 


Leu 


Ser 


Ser 








340 










345 










350 






Val 


Cys 


Val 


Gin 


Arg 


Ala 


Met 


Asn 


Asn 


Asp 


Asn 


Thr 


Asn 


Ser 


Ser 


Gly 






355 










360 










365 








Tyr 


Ala 


Glu 


Gly 


Met 


Val 


Thr 


Lys 


He 


Leu 


Asn 


He 


He 


Gly 


Lys 


He 




370 










375 










380 










Pro 


Tyr 


Asn 


Glu 


Met 


Ser 


Arg 


Glu 


Lys 


Phe 


He 


Ser 


Val 


Gly 


Arg 


Asp 


385 










390 










395 










400 


Ala 


Leu 


Tyr 


Leu 


Tyr 


Gin 


Asn 


Val 


He 


Thr 


Asp 


Met 


Thr 


Gly 


Pro 


Lys 










405 










410 










415 




His 


Asn 


Lys 


Arg 


Leu 


Arg 


He 


Pro 


Gin 


Gin 


Gin 


Ala 


Asp 


Phe 


Cys 


Tyr 








420 










425 










430 






lie 


lie 


Ala 


Met 


Leu 


Val 


Asn 


Asp 


Val 


Pro 


He 


Thr 


Ser 


Asp 


Leu 


Leu 






435 










440 










445 








Leu 


Thr 


Gly 


Lys 


Ala 


Thr 


Asn 


Leu 


Val 


Gin 


Phe 


Ala 


Ser 


Ala 


Met 


Val 




450 










455 










460 










Asp 


Pro 


Ala 


Tyr 


Arg 


Leu 


Ala 


Val 


His 


Lys 


Met 


Ala 


Ser 


Val 


Phe 


Asn 


H DO 










470 










4 75 










4 80 


Ser 


Ser 


Tyr 


Ser 


Val 


Tyr 


Lys 


Val 


Leu 


Asp 


Leu 


Asp 


His 


Lys 


Met 


Leu 










485 










490 










495 




Leu 


Arg 


Ala 


Asn 


Leu 


He 


Leu 


Ser 


He 


Leu 


Ser 


Ala 


Arg 


Asn 


Lys 


Cys 








500 










505 










510 






Leu 


Ser 


Lys 


Pro 


Arg 


Thr 


Leu 


Thr 


Gin 


Ser 


Val 


Tyr 


Leu 


Phe 


Leu 


Asn 






515 










520 










525 








His 


Leu 


Leu 


Arg 


Asn 


Lys 


Leu 


Arg 


Ser 


Ser 


Gly 


Leu 


Thr 


Ser 


Glu 


Glu 




530 










535 










540 










Ser 


Ser 


Leu 


Gly 


Thr 


Ala 


Val 


Lys 


Leu 


Val 


Ser 


Gin 


Gin 


Leu 


Met 


Tyr 


545 










550 










555 










560 


Glu 


Gly 


Val 


Thr 


Arg 


Gin 


Thr 


He 


Glu 


Asp 


Gly 


Cys 


Ser 


Met 


He 


Ser 










565 










570 










575 




Gly 


Asn 


Phe 


Glu 


Asp 


Glu 


Asp 


Gly 


Val 


Thr 


Leu 


Lys 


Cys 


Leu 


Gly 


Ala 
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580 585 590 

Asp Val Lys Asp Val Lys Thr Val Gly Leu Ser Ala Leu Leu Ser Asp 

595 600 605 

Arg Leu Arg Lys Asn He Arg Arg Asn Val Pro Phe Tyr 
610 615 620 



<210> 42 
<211> 3303 
<212> DNA 
<213> SHRIMP 



<400> 42 

atgtttggaa 

gctgccgcat 

aagaaagttg 

cccaacttta 

tcatcgaagg 

tctaaaggag 

aaggtagagg 

ccctctaaag 

gtcaaattga 

tccaatgata 

ccattcctaa 

gtgtggttct 

aattccacct 

aagaacttta 

gagcatgttt 

tttgaaagag 

tcatacatgg 

ggaggcggaa 

aataacaata 

tcactttctc 

gtgtggggga 

cagaacggtg 

ctttctcaga 

ctactcaaga 

atgtgtacag 

gaactataca 

ctgtttgtca 

gtacaaaatc 

tgtacagata 

aagagagctt 

gaggcaaatc 

aaccacaatt 

aggatgaaca 

acaacagacg 

ctttctgcct 

agcttcaata 

gaagaagtta 

attaaaatga 

ctctgcaaca 
tctgtgtgtg 
gaattgggtg 
aagacatatg 
aatgcactcg 
aggagagggt 
aacgctatcc 
tattatacgt 
agcaatgcag 



gttctgctaa 
catctgacga 
ctgccat tct 
agaatgtgat 
cagctgaaag 
gaagtgatgg 
ctgaagaaat 
ataatccatt 
ctcagaagaa 
ggagta tccg 
tgaccatcag 
tcaacagatt 
atgtagcaac 
tttctggaga 
ataacaagat 
tggacaacac 
aa tacaggtg 
agagtggtgt 
gtagtgcagc 
catttagcag 
agaagatgat 
gtggtgcatc 
aggaaaatgt 
acattatcaa 
atgaaaatat 
aggctat tgt 
accttattct 
ttccttcaaa 
tgcgcta tga 
gtgagtgcag 
tcacccctag 
ctcacaggag 
agctcaagaa 
attttctcct 
ctct tcatca 
ttaactttgc 
ct tcctcaca 
agagagggga 
ctaagaaggt 
actgcaacgt 
tctaca tcca 
aagaattgac 
atgatattat 
ttaaattctg 
acatgtgca t 
agta tcacga 
ctcgcaataa 
atgtagccaa 



taactttaat 
tcagcagtta 
tgccaacaga 
taacaaccct 
tggtagtgca 
aaagaaatca 
ggcctttaaa 
gagggatgac 
tctggaatat 
tagtggaaga 
tgttgacatt 
cttccgcatg 
tagcgatgct 
aaatgtgccc 
cagtaaatgt 
gattggattt 
ctttgacaca 
cctgtcttct 
tgcagcttct 
tgatggagat 
attcaacaca 
aacttacaag 
tcgtctcatg 
ctttctgaat 
cttcgatgag 
ttgctccaac 
tccccgtctt 
caatggatcg 
catccctcca 
aaaactgtgc 
ccagaaggca 
taacgctcac 
ggattcaaag 
gcaagatagg 
cattttctgt 
aaacaacaaa 
aacggtcaac 
tgatattata 
tagtaagatt 
taactcttct 
caacattatt 
caagcttgtt 
ccgcaattac 
tcacaggaat 
ggtgagcaac 
atcactatgc 
caattctgaa 
gactatggcc 



ggtgacaaga 
ggtccccttg 
acagagtccc 
aaccagatat 
aacaagaatg 
tcgcagcaga 
cgtgtggctg 
cctgatgcta 
cttttctggg 
tatcttcaag 
agaatcctgc 
gtttctggac 
attgcccaga 
caggcactga 
ggacgtcagc 
gtcaattctg 
atcaggaaga 
ggaacattct 
gcccctgctg 
gatgatgacg 
tcaggagacg 
agatttaggt 
gcgatgccaa 
agtgcactta 
gatcaagccg 
ccagccaatg 
aggaacccaa 
gtgagaacaa 
tatgcaaagg 
aaggacgtga 
ggaagggagg 
gactttactt 
aagaaggtaa 
aacgcttttg 
cctgatgtgc 
ctcgagtgct 
gccaaggaag 
gatgttgtga 
gtgagaaggt 
aatggagatg 
cctgtgctcg 
aaggagtgta 
gaaattactg 
tataatgtgg 
cttgtcgggt 
tctttgcact 
gatggaggag 
tctttctatg 



aatcttcctc 
gactgtctac 
tatatctgtt 
ctattgtgcc 
aaaaccaggc 
acaagtttaa 
aactcattgc 
ttccatcacg 
aagcagtaac 
ccagtgaagt 
aaagaatggc 
ttggagtaga 
tctgggtaga 
agtatttgaa 
catcctattt 
atactgaaca 
atgcatcatc 
tcattgataa 
tttctgctgg 
acgattgtag 
gatcaggaga 
gtggagaaaa 
agggaaatga 
actctgttga 
agcattatac 
tgtacagagt 
ttgtgagtga 
agaagatggt 
gaaagattcg 
ggtgctttga 
tggaggaacc 
tctatgacaa 
agaagattga 
atctacttag 
ttatggtgca 
ataatgaacg 
cacttgagga 
agagtaaagg 
ttaatgagat 
tggatttcca 
aagatatctc 
gagacgtggc 
taaagtactt 
cagtaacctc 
attattgtaa 
ctagcatctc 
gaaactcttc 
accagttcga 



atcatcatca 
tgctgatttc 
gccagattct 
gtttctgggt 
tgaaaattct 
cctattgaac 
ggatactcct 
caacccatgg 
aattgaagtc 
gggggagaat 
acttaatgtc 
aaacagggcc 
gatgctcctc 
ggaacattat 
tgttgttgaa 
taatggatca 
tgggccaagt 
cgaaatgggg 
agtttctcca 
tggtgatgac 
atcttctggg 
tactgcttct 
agataaacaa 
aaaccatg ta 
gtcaaacaag 
aatggttgaa 
cattgaaact 
tgaacatgga 
tcttagtgcc 
caagtctaga 
ttttccccgc 
gtacagggca 
cacctttaca 
aaagtgcttc 
tagaggagac 
taacggaatt 
tattacaaaa 
actttctttg 
cacaaaccaa 
cgtctttact 
catttttgca 
tggagaggac 
taagctct tt 
tgccattaac 
gctgtctgat 
ttatgcagac 
ttcagaaaag 
taagagtgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 
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gacagcaaga 
gatagggttt 
gagaatgaag 
gtggaagatg 
aagcaggaag 
aaagaggaac 
gacgagaaag 
taa 



aaaataagaa 
tggatggcat 
aggaggagga 
ttgatagcaa 
aagatgtgga 
aggaggcgaa 
cagtagcatc 



caaaacttca 
ggatgatgat 
agaggaggaa 
taagaagaca 
gatggaggaa 
ggaggaagac 
tgatgaagat 



aa tgagatcc 
gatgatgaag 
attgtaaaga 
ctgccaaagg 
gtgaaggaag 
gctactgagt 
gaagatgatg 



ttataaaaat 
atagtgatag 
aaccagcaaa 
aacctgccgt 
cagcagcaga 
atgacgacga 
aagattctaa 



gttccaaatg 
tagtagcagt 
gaagaggaaa 
taagaaggtg 
agaagaaaag 
tacagaagag 
agctattttc 



2940 
3000 
3060 
3120 
3180 
3240 
3300 
3303 



<210> 43 
<211> HOC 
<212> PRT 
<213> SHRIMP 



<400> 43 
Met Phe Gly Ser 
1 

Ser Ser Ser Ser 
20 

Leu Gly Leu Ser 

35 

Asn Arg Thr Glu 
50 

Asn Val He Asn 
65 

Ser Ser Lys Ala 

Ala Glu Asn Ser 
100 

Gin Asn Lys Phe 
115 

Phe Lys Arg Val 
130 

Asn Pro Leu Arg 
145 

Val Lys Leu Thr 

Thr He Glu Val 
180 

Gin Ala Ser Glu 
195 

Asp He Arg He 
210 

Asn Arg Phe Phe 
225 

Asn Ser Thr Tyr 

Glu Met Leu Leu 
260 

Leu Lys Tyr Leu 
275 

Lys Cys Gly Arg 
290 

Asp Asn Thr He 
305 

Ser Tyr Met Glu 

Ser Gly Pro Ser 
340 

Phe Phe He Asp 
355 



Ser Ala Asn Asn 
5 

Ala Ala Ala Ser 

Thr Ala Asp Phe 
40 

Ser Leu Tyr Leu 
55 

Asn Pro Asn Gin 
70 

Ala Glu Ser Gly 
85 

Ser Lys Gly Gly 

Asn Leu Leu Asn 
120 

Ala Glu Leu He 
135 

Asp Asp Pro Asp 
150 

Gin Lys Asn Leu 
165 

Ser Asn Asp Arg 

Val Gly Glu Asn 
200 

Leu Gin Arg Met 
215 

Arg Met Val Ser 
230 

Val Ala Thr Ser 
245 

Lys Asn Phe He 

Lys Glu His Tyr 
280 

Gin Pro Ser Tyr 
295 

Gly Phe Val Asn 
310 

Tyr Arg Cys Phe 
325 

Gly Gly Gly Lys 

Asn Glu Met Gly 

360 



Phe Asn Gly Asp 
10 

Ser Asp Asp Gin 

25 

Lys Lys Val Ala 

Leu Pro Asp Ser 
60 

He Ser He Val 
75 

Ser Ala Asn Lys 
90 

Ser Asp Gly Lys 
105 

Lys Val Glu Ala 

Ala Asp Thr Pro 
140 

Ala He Pro Ser 
155 

Glu Tyr Leu Phe 
170 

Ser He Arg Ser 
185 

Pro Phe Leu Met 

Ala Leu Asn Val 
220 

Gly Leu Gly Val 
235 

Asp Ala He Ala 
250 

Ser Gly Glu Asn 
265 

Glu His Val Tyr 

Phe Val Val Glu 
300 

Ser Asp Thr Glu 
315 

Asp Thr He Arg 
330 

Ser Gly Val Leu 
345 

Asn Asn Asn Ser 



Lys Lys Ser Ser 
15 

Gin Leu Gly Pro 
30 

Ala He Leu Ala 
45 

Pro Asn Phe Lys 

Pro Phe Leu Gly 
80 

Asn Glu Asn Gin 
95 

Lys Ser Ser Gin 
110 

Glu Glu Met Ala 
125 

Pro Ser Lys Asp 

Arg Asn Pro Trp 
160 

Trp Glu Ala Val 
175 

Gly Arg Tyr Leu 
190 

Thr He Ser Val 
205 

Val Trp Phe Phe 

Glu Asn Arg Ala 
240 

Gin He Trp Val 
255 

Val Pro Gin Ala 
270 

Asn Lys He Ser 
285 

Phe Glu Arg Val 

His Asn Gly Ser 
320 

Lys Asn Ala Ser 
335 

Ser Ser Gly Thr 
350 

Ser Ala Ala Ala 
365 



wo 01/38351 



124 



PCT/USOO/28888 



Ala 


Ser 


Ala 


Pro 


Ala 


Val 


Ser 


Ala 


Gly 


Val 


Ser 


Pro 


Ser 


Leu 


Ser 


Pro 




370 










375 










380 










Phe 


Ser 


Ser 


Asp 


Gly 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Cys 


Ser 


Gly 


Asp 


Asp 


385 










390 










395 










400 


Val 


Trp 


Gly 


Lys 


Lys 


Met 


He 


Phe 


Asn 


Thr 


Ser 


Gly 


Asp 


Gly 


Ser 


Gly 










405 










410 










415 




Glu 


Ser 


Ser 


Gly 


Gin 


Asn 


Gly 


Gly 


Gly 


Ala 


Ser 


Thr 


Tyr 


Lys 


Arg 


Phe 








420 










425 










430 






Arg 


Cys 


Gly 


Glu 


Asn 


Thr 


Ala 


Ser 


Leu 


Ser 


Gin 


Lys 


Glu 


Asn 


Val 


Arg 






435 










440 










445 








Leu 


Met 


Ala 


Met 


Pro 


Lys 


Gly 


Asn 


Glu 


Asp 


Lys 


Gin 


Leu 


Leu 


Lys 


Asn 




450 










455 










460 










lie 


lie 


Asn 


Phe 


Leu 


Asn 


Ser 


Ala 


Leu 


Asn 


Ser 


Val 


Glu 


Asn 


His 


Val 


465 










470 










475 










480 


Met 


Cys 


Thr 


Asp 


Glu 


Asn 


He 


Phe 


Asp 


Glu 


Asp 


Gin 


Ala 


Glu 


His 


Tyr 










485 










490 










495 




Thr 


Ser 


Asn 


Lys 


Glu 


Leu 


Tyr 


Lys 


Ala 


He 


Val 


Cys 


Ser 


Asn 


Pro 


Ala 








500 










505 










510 






Asn 


Val 


Tyr 


Arg 


Val 


Met 


Val 


Glu 


Leu 


Phe 


Val 


Asn 


Leu 


He 


Leu 


Pro 






515 










520 










525 








Arg 


Leu 


Arg 


Asn 


Pro 


He 


Val 


Ser 


Asp 


He 


Glu 


Thr 


Val 


Gin 


Asn 


Leu 




530 










535 










540 










Pro 


Ser 


Asn 


Asn 


Gly 


Ser 


Val 


Arg 


Thr 


Lys 


Lys 


Met 


Val 


Glu 


His 


Gly 


545 










550 










555 










560 


Cys 


Thr 


Asp 


Met 


Arg 


Tyr 


Asp 


He 


Pro 


Pro 


Tyr 


Ala 


Lys 


Gly 


Lys 


He 










565 










570 










575 




Arg 


Leu 


Ser 


Ala 


Lys 


Arg 


Ala 


Cys 


Glu 


Cys 


Arg 


Lys 


Leu 


Cys 


Lys 


Asp 








580 










585 










590 






Val 


Arg 


Cys 


Phe 


Asp 


Lys 


Ser 


Arg 


Glu 


Ala 


Asn 


Leu 


Thr 


Pro 


Ser 


Gin 






595 










600 










605 








Lys 


Ala 


Gly 


Arg 


Glu 


Val 


Glu 


Glu 


Pro 


Phe 


Pro 


Arg 


Asn 


His 


Asn 


Ser 




610 










615 










620 










His 


Arg 


Ser 


Asn 


Ala 


His 


Asp 


Phe 


Thr 


Phe 


Tyr 


Asp 


Lys 


Tyr 


Arg 


Ala 


625 










630 










635 










640 


Arg 


Met 


Asn 


Lys 


Leu 


Lys 


Lys 


Asp 


Ser 


Lys 


Lys 


Lys 


Val 


Lys 


Lys 


He 










645 










650 










655 




Asp 


Thr 


Phe 


Thr 


Thr 


Thr 


Asp 


Asp 


Phe 


Leu 


Leu 


Gin 


Asp 


Arg 


Asn 


Ala 








660 










665 










670 






Phe 


Asp 


Leu 


Leu 


Arg 


Lys 


Cys 


Phe 


Leu 


Ser 


Ala 


Ser 


Leu 


His 


His 


He 






675 










680 










685 








Phe 


Cys 


Pro 


Asp 


Val 


Leu 


Met 


Val 


His 


Arg 


Gly 


Asp 


Ser 


Phe 


Asn 


He 




690 










695 










700 










Asn 


Phe 


Ala 


Asn 


Asn 


Lys 


Leu 


Glu 


Cys 


Tyr 


Asn 


Glu 


Arg 


Asn 


Gly 


He 


705 










710 










715 










720 


Glu 


Glu 


Val 


Thr 


Ser 


Ser 


Gin 


Thr 


Val 


Asn 


Ala 


Lys 


Glu 


Ala 


Leu 


Glu 










725 










730 










735 




Asp 


lie 


Thr 


Lys 


He 


Lys 


Met 


Lys 


Arg 


Gly 


Asp 


Asp 


He 


He 


Asp 


Val 








740 










745 










750 






Val 


Lys 


Ser 


Lys 


Gly 


Leu 


Ser 


Leu 


Arg 


Glu 


Phe 


Ser 


Lys 


Lys 


Val 


Ser 






755 










760 










7 65 








Lys 


lie 


Val 


Arg 


Arg 


Phe 


Asn 


Glu 


He 


Thr 


Asn 


Gin 


Leu 


Cys 


Asn 


Asn 




770 










775 










780 










Cys 


Asn 


Val 


Asn 


Ser 


Ser 


Asn 


Gly 


Asp 


Val 


Asp 


Phe 


His 


Val 


Phe 


Thr 


785 










790 










795 










800 


Ser 


Val 


Cys 


Val 


Tyr 


He 


His 


Asn 


He 


He 


Pro 


Val 


Leu 


Glu 


Asp 


He 










805 










810 










815 




Ser 


lie 


Phe 


Ala 


Glu 


Leu 


Gly 


Glu 


Glu 


Leu 


Thr 


Lys 


Leu 


Val 


Lys 


Glu 








820 










825 










830 






Cys 


Arg 


Asp 


Val 


Ala 


Gly 


Glu 


Asp 


Lys 


Thr 


Tyr 


Asp 


Asp 


He 


He 


Arg 






835 










840 










845 








Asn 


Tyr 


Glu 


lie 


Thr 


Val 


Lys 


Tyr 


Phe 


Lys 


Leu 


Phe 


Asn 


Ala 


Leu 


Val 
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850 










855 










860 










Lys 


Phe 


Cys 


His 


Arg 


Asn 


Tyr 


Asn 


Val 


Ala 


Val 


Thr 


Ser 


Ala 


He 


Asn 


fl £ c. 

o b J 










870 










875 










880 


Arg 


Arg 


Gly 


Tyr 


Met 


Cys 


Met 


Val 


Ser 


Asn 


Leu 


Val 


Gly 


Tyr 


Tyr 


Cys 










885 










890 










895 




Lys 


Leu 


Ser 


Asp 


Asn 


Ala 


He 


Gin 


Tyr 


His 


Glu 


Ser 


Leu 


Cys 


Ser 


Leu 








900 










905 










910 






His 


Ser 


Ser 


He 


Ser 


Tyr 


Ala 


Asp 


Tyr 


Tyr 


Thr 


Ser 


Arg 


Asn 


Asn 


Asn 






915 










920 










925 








Ser 


Glu 


Asp 


Gly 


Gly 


Gly 


Asn 


Ser 


Ser 


Ser 


Glu 


Lys 


Ser 


Asn 


Ala 


Asp 




930 










935 










940 










Val 


Ala 


Lys 


Thr 


Met 


Ala 


Ser 


Phe 


Tyr 


Asp 


Gin 


Phe 


Asp 


Lys 


Ser 


Glu 


945 










950 










955 










960 


Asp 


Ser 


Lys 


Lys 


Asn 


Lys 


Asn 


Lys 


Thr 


Ser 


Asn 


Glu 


He 


Leu 


He 


Lys 










965 










970 










975 




Met 


Phe 


Gin 


Met 


Asp 


Arg 


Val 


Leu 


Asp 


Gly 


Met 


Asp 


Asp 


Asp 


Asp 


Asp 








980 










985 










990 






Glu 


Asp 


Ser 


Asp 


Ser 


Ser 


Ser 


Ser 


Glu 


Asn 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 






995 










1000 








1005 






Glu 


Glu 


He 


Val 


Lys 


Lys 


Pro 


Ala 


Lys 


Lys 


Arg 


Lys 


Val 


Glu 


Asp 


Val 




1010 








1015 








1020 








Asp 


Ser 


Asn 


Lys 


Lys 


Thr 


Leu 


Pro 


Lys 


Glu 


Pro 


Ala 


Val 


Lys 


Lys 


Val 


1025 








1030 








1035 








1041 


Lys 


Gin 


Glu 


Glu 


Asp 


Val 


Glu 


Met 


Glu 


Glu 


Val 


Lys 


Glu 


Ala 


Ala 


Ala 










1045 








1050 








1055 


Glu 


Glu 


Glu 


Lys 


Lys 


Glu 


Glu 


Gin 


Glu 


Ala 


Lys 


Glu 


Glu 


Asp 


Ala 


Thr 








1060 








1065 








1070 




Glu 


Tyr 


Asp 


Asp 


Asp 


Thr 


Glu 


Glu 


Asp 


Glu 


Lys 


Ala 


Val 


Ala 


Ser 


Asp 






1075 








1080 








1085 






Glu 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp 


Ser 


Lys 


Ala 


He 


Phe 











1090 1095 1100 



<210> 44 
<211> 618 
<212> DNA 
<213> SHRIMP 



<400> 44 

atggcaggaa 

aaggaaatac 

tttaaca tgc 

cctttccctc 

gagct tctgg 

cccagaacct 

actaccggtc 

gttaggcacg 

gttggtaagc 

gttgtgatta 

aagttctcaa 



tcggtagaag 
cctataacgt 
agaagtgctc 
ca taccctgg 
ttgaccaatc 
accctctcga 
acagcgatat 
gttaccacta 
tcactgaaaa 
atctgtcaag 
aaaactag 



agataataga 
tccaccaacc 
agactgtgct 
tactgcagta 
tttccctccc 
atttcccgag 
ccccttcatc 
cttaatcaac 
gcaagaaaac 
cactttgaag 



<210> 45 
<211> 205 
<212> PRT 
<213> SHRIMP 



cccgtattac 
cctattatct 
cctttccctc 
gaagaggagg 
ccattccctg 
aagaaggaga 
gatctggaga 
cccaacaagg 
ctgaacaaat 
gaacttgaga 



atctggacat 
gtgaaaaaaa 
cctaccccgg 
agaagcaaaa 
gaaataagct 
aggatttccc 
aaaccccacc 
ttggggagct 
tggtgttgga 
agctgcgagc 



tgatccaaat 
ccccttcgtg 
cactgagaag 
ggaaattgag 
gagagatatc 
ttgcgttgac 
ccgtagtgac 
taaccatatc 
tgttgatgac 
tggcctgtgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

618 



<400> 45 

Met Ala Gly He Gly Arg Arg Asp Asn Arg Pro Val Leu His Leu Asp 

15 10 15 

He Asp Pro Asn Lys Glu He Pro Tyr Asn Val Pro Pro Thr Pro He 
20 25 30 
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He 


Cys 


Glu 


Lys 


Asn 


Pro 


Phe 


Val 


Phe 


Asn 


Met 


Gin 


Lys 


Cys 


Ser 


Asp 






35 










40 










45 








Cys 


Ala 


Pro 


Phe 


Pro 


Pro 


Tyr 


Pro 


Gly 


Thr 


Glu 


Lys 


Pro 


Phe 


Pro 


Pro 




50 










55 










60 










Tyr 


Pro 


Gly 


Thr 


Ala 


Val 


Glu 


Glu 


Glu 


Glu 


Lys 


Gin 


Lys 


Glu 


He 


Glu 


65 










70 










75 










80 


Glu 


Leu 


Leu 


Val 


Asp 


Gin 


Ser 


Phe 


Pro 


Pro 


Pro 


Phe 


Pro 


Gly 


Asn 


Lys 










85 










90 










95 




Leu 


Arg 


Asp 


He 


Pro 


Arg 


Thr 


Tyr 


Pro 


Leu 


Glu 


Phe 


Pro 


Glu 


Lys 


Lys 








100 










105 










110 






Glu 


Lys 


Asp 


Phe 


Pro 


Cys 


Val 


Asp 


Thr 


Thr 


Gly 


His 


Ser 


Asp 


He 


Pro 






115 










120 










125 








Phe 


He 


Asp 


Leu 


Glu 


Lys 


Thr 


Pro 


Pro 


Arg 


Ser 


Asp 


Val 


Arg 


His 


Gly 




130 










135 










140 










Tyr 


His 


Tyr 


Leu 


He 


Asn 


Pro 


Asn 


Lys 


Val 


Gly 


Glu 


Leu 


Asn 


His 


He 


145 










150 










155 










160 


Val 


Gly 


Lys 


Leu 


Thr 


Glu 


Lys 


Gin 


Glu 


Asn 


Leu 


Asn 


Lys 


Leu 


Val 


Leu 










165 










170 










175 




Asp 


Val 


Asp 


Asp 


Val 


Val 


He 


Asn 


Leu 


Ser 


Ser 


Thr 


Leu 


Lys 


Glu 


Leu 








180 










185 










190 






Glu 


Lys 


Leu 


Arg 


Ala 


Gly 


Leu 


Cys 


Lys 


Phe 


Ser 


Lys 


Asn 









195 200 205 



<210> 46 

<2H> 630 

<212> DNA 

<213> SHRIMP 



<400> 46 

atggcctcat 

gaaagaga tg 

gttagcgccg 

cttgaagatt 

agcgatgacg 

ctgtgctctt 

gtgttccttc 

actgaagaag 

ggagatagtg 

ggatctcttt 

gatggtggag 



catcatcatc 
aagaaaatac 
cctgggtgcc 
tttcttcaga 
atgacgatga 
ttgtgagggg 
ttacaagcgc 
gcgcagcaac 
gtat tgctgc 
tggaatctat 
aagaaaccac 



ccctgttgcc 
actgtccctc 
tgttgatgaa 
tgaagaagac 
cgaggaggat 
tttctttggc 
agccgttgct 
catggaagaa 
tgatgttgtc 
tgcaacaaca 
agaattgtaa 



ctctcctctg 
aggaacagga 
gaggatgaag 
gatgataata 
ccttcatgct 
ttcctcagga 
gctattttta 
aattcaacag 
tctctcgcta 
ctcatcaaga 



tcgcatcatc 


cgtgatgatg 


60 


atgtgaacaa 


accaacacct 


120 


atagggagga 


aatgagaaga 


180 


aatcatgtca 


t tgtgaccat 


240 


ttaagggttt 


ttcagctggc 


300 


agtcacttac 


caagaaacag 


360 


agactagaga 


tgtggctaaa 


420 


atgtgattac 


tggaggaga t 


480 


gtgagggaga 


gggagaaaat 


540 


ctacaattga 


aaatct tgta 


600 






630 



<210> 47 

<211> 207 

<212> PRT 

<213> SHRIMP 



<400> 47 



Met 


Ala 


Ser 


Ser 


Ser 


Ser 


Ser 


Pro 


Val 


Ser 


Ser 


Val 


Ala 


Ser 


Ser 


Val 


1 








5 










10 










15 




Met 


Met 


Glu 


Arg 


Asp 


Glu 


Glu 


Asn 


Thr 


Leu 


Ser 


Leu 


Arg 


Asn 


Arg 


Asn 








20 










25 










30 






Val 


Asn 


Lys 


Pro 


Thr 


Pro 


Val 


Ser 


Ala 


Ala 


Trp 


Val 


Pro 


Val 


Asp 


Glu 






35 










40 










45 








Glu 


Asp 


Glu 


Asp 


Arg 


Glu 


Glu 


Met 


Arg 


Arg 


Leu 


Glu 


Asp 


Phe 


Ser 


Ser 




50 










55 










60 










Asp 


Glu 


Glu 


Asp 


Asp 


Asp 


Asn 


Lys 


Ser 


Cys 


His 


Cys 


Asp 


His 


Ser 


Asp 


65 










70 










75 










80 


Asp 


Asp 


Asp 


Asp 


Asp 


Glu 


Glu 


Asp 


Pro 


Ser 


Cys 


Phe 


Lys 


Gly 


Phe 


Ser 










85 










90 










95 




Ala 


Gly 


Leu 


Cys 


Ser 


Phe 


Val 


Arg 


Gly 


Phe 


Phe 


Gly 


Phe 


Leu 


Arg 


Lys 
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100 

Ser Leu Thr Lys Lys 
115 

Ala lie Phe Lys Thr 
130 

Thr Met Glu Glu Asn 
145 

Ser Gly He Ala Ala 
165 

Glu Asn Gly Ser Leu 
180 

Thr He Glu Asn Leu 
195 



105 

Gin Val Phe Leu Leu Thr 
120 

Arg Asp Val Ala Lys Thr 
135 

Ser Thr Asp Val He Thr 
150 155 
Asp Val Val Ser Leu Ala 
170 

Leu Glu Ser lie Ala Thr 
185 

Val Asp Gly Gly Glu Glu 
200 



110 

Ser Ala Ala Val Ala 
125 

Glu Glu Gly Ala Ala 
140 

Gly Gly Asp Gly Asp 
160 

Ser Glu Gly Glu Gly 
175 

Thr Leu He Lys Thr 
190 

Thr Thr Glu Leu 
205 



<210> 48 
<211> 2685 
<212> DNA 
<213> SHRIMP 



<400> 48 

atgacgagac 

gtcgactaca 

gacaatatca 

aagaaaataa 

gccaccaagg 

gaaaacgcca 

ctatcagaaa 

cgagacacgt 

ttgatatttg 

ttgtgtgtga 

gctaacgtgt 

cgactcggga 

cccggtatgt 

ggggaatact 

tctgactggt 

ggagaaggag 

ttccattact 

gaaaaatacg 

tacccgtacg 

atttctgaag 

gcaaatttag 

caactttttg 

ctttctttcg 

aaagagagat 

agtatgatga 

aaggataaac 

tcaaagggtt 

ccttatgttt 

aataatagta 

aaacaccgtg 

aattacatct 

agtttattac 

gaaagggccc 

actctatttg 

aaacgtaaat 

gtcaatttga 

aatagtacca 

cgt tttaaag 

tcctcttcat 

cctagtgatt 



atggtgtgct 
ctttctgtac 
cagaccaaaa 
agaaggaaga 
aattcgaaca 
tcgcggccgc 
gtgaaggaaa 
cccagaaaca 
aaactggatc 
ttaaagccac 
ctatagtgaa 
aattcttcga 
ttttcaagct 
acacaaaggt 
tgaggataaa 
agagtgggtg 
ttatggacaa 
gcatcccgca 
tcaaaccatc 
tggataggac 
tgccttcaaa 
gacat ttgtg 
atccta taaa 
gtgacccgtt 
atagtcgcat 
ttgatcatat 
ttattgcctc 
tattagcgag 
actatttgtc 
taactttagc 
caacagacat 
atttaatgac 
tttcaagtaa 
aaaaatccat 
tttctcgaat 
acaagggtag 
gttacataga 
cagctgcaac 
ctccttcttc 
ttatagatta 



tgttccaaaa 
tactgacaa t 
cattcagtta 
tgcacctgaa 
gacaataaaa 
agctgccgct 
atcaatggtc 
gttcactgca 
ag tgatagtt 
gactgatatt 
tacagtgtcc 
aagaaattgc 
gcgagtgccc 
tgcaatgatg 
aactgcatta 
cggtgatgtt 
caacat taaa 
tc tagaatgg 
ggccgagcaa 
atactatgga 
agaagagagt 
taaacct tea 
ccaagacagg 
ttctgttgca 
ttcttgtcaa 
tgtggatttg 
cccatttttg 
agcctgtaat 
gggttcaagt 
caggttgaac 
tgcacctgat 
aaacttgcct 
tgaaagtaca 
tataaataac 
cggacaagaa 
tgatgaaatc 
gaatgcatta 
ccatccgcca 
ttcttcttct 
tgtgtacaaa 



ggccgttcta 
aattgtgtca 
ttgaacaaga 
acaaaggaaa 
ggtctacaga 
gccactgctg 
atcaacagag 
ttgaagaaca 
gtcgggcttc 
gctgatattc 
acttttaata 
atctcttaca 
gcaaagcctc 
cgcgatagta 
acatttaaag 
tctgtcgtat 
atgtccccca 
tacttgtaca 
gtgaaaaggg 
acaaagaaca 
atctccttca 
aaagaaacta 
tggtggaata 
cgtttagtgt 
gggaaatggt 
tgtacagaag 
aggaagcacc 
caaaaaaatg 
agggcgaaga 
acgatgatgg 
tttgccaagt 
aaatcccgtg 
tataaaacac 
caagaaactg 
aagagctctt 
ataaagggta 
tctgacatta 
gtgaagcaag 
cagacgtcta 
ttcactgacg 



ggcatgttat 
gtctagacat 
aattgggtaa 
atagtgacga 
caaaaaaagg 
ctgcggtaga 
ctagaatggt 
gggaatcttt 
aagatccttc 
tacagaaaaa 
gattccactt 
gttataaccc 
tcttgcctgg 
aggatcccaa 
ttgggaaaat 
ccaaattact 
aagaagcaca 
ttgacatgtt 
cgatggtgga 
gtatggacgc 
ttaaaaaaat 
ctcgacgtgc 
aaaatgacca 
ctgtttctga 
ggctagatga 
aaatagtgga 
agaaggaaaa 
gtaacaaaat 
ggaatgcaaa 
cgtcgtaccg 
tatttggtaa 
gacatgctct 
ctggaaatgc 
ctaataaagg 
ttctgtgcaa 
tttgtacaag 
acagagacaa 
aattggtaga 
acaagaacaa 
aaacaacagg 



cctaggaaat 
cgatttcaag 
gaaaacagca 
agacatatac 
tgccaccgag 
aaaggctatg 
gctttctaag 
cttcagtgtt 
gcttacaaaa 
catcagtgtg 
gaactttatt 
agaaacgttc 
agagactata 
ttttaaaatg 
tactgtgctc 
atttggttta 
aagagtcaga 
gctccactcc 
ccaacaacat 
tgccatgtct 
aagatcacaa 
tatagacaca 
atattacggt 
aaatacaaac 
aaacgaatac 
ggaatgtgaa 
aataccaacg 
gagtattaac 
actacaggaa 
atttttgaat 
tgatgtatat 
aacatacaac 
atactttagt 
taacaataga 
cgcgtgtggt 
ttgcgatcaa 
gaagattaaa 
ttctttatcc 
tagatgcacc 
tgctccaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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gtgggcttag tgtttaaaat gtgtgatatt cttgcatcct tagcaagcag gagagggatg 2460 
gaagatcgtc ccacagccaa ctatagaacc tccttacatt cagctactca aaataaaacc 2520 
aatttgaata aactattagt ttctgctatc aaggaaacag gagccactga aactgaagca 2580 
cagatattca acaagattat tggtagtgaa aagggactat caattctctg tcaacttgtg 2640 
gaaaggagga acaaagacaa taatgtcttc gactgatttg tctaa 2 685 



<210> 49 
<211> 886 
<212> PRT 
<213> SHRIMP 



<400> 49 



Met 


Thr 


Arg 


His 


Gly 


Val 


Leu 


Val 


Pro 


Lys 


Gly 


Arg 


Ser 


Arg 


His 


Val 


1 








5 










10 










15 




lie 


Leu 


Gly 


Asn 


Val 


Asp 


Tyr 


Thr 


Phe 


Cys 


Thr 


Thr 


Asp 


Asn 


Asn 


Cys 








20 










25 










30 






Val 


Ser 


Leu 


Asp 


He 


Asp 


Phe 


Lys 


Asp 


Asn 


He 


Thr 


Asp 


Gin 


Asn 


He 






35 










40 










45 








Gin 


Leu 


Leu 


Asn 


Lys 


Lys 


Leu 


Gly 


Lys 


Lys 


Thr 


Ala 


Lys 


Lys 


He 


Lys 




50 










55 










60 










Lys 


Glu 


Asp 


Ala 


Pro 


Glu 


Thr 


Lys 


Glu 


Asn 


Ser 


Asp 


Glu 


Asp 


He 


Tyr 


65 










70 










75 










80 


Ala 


Thr 


Lys 


Glu 


Phe 


Glu 


Gin 


Thr 


He 


Lys 


Gly 


Leu 


Gin 


Thr 


Lys 


Lys 










85 










90 










95 




Gly 


Ala 


Thr 


Glu 


Glu 


Asn 


Ala 


He 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Thr 








100 










105 










110 






Ala 


Ala 


Ala 


Val 


Glu 


Lys 


Ala 


Met 


Leu 


Ser 


Glu 


Ser 


Glu 


Gly 


Lys 


Ser 






115 










120 










125 








Met 


Val 


He 


Asn 


Arg 


Ala 


Arg 


Met 


Val 


Leu 


Ser 


Lys 


Arg 


Asp 


Thr 


Ser 




130 










135 










140 










Gin 


Lys 


Gin 


Phe 


Thr 


Ala 


Leu 


Lys 


Asn 


Arg 


Glu 


Ser 


Phe 


Phe 


Ser 


Val 


145 










150 










155 










160 


Leu 


He 


Phe 


Glu 


Thr 


Gly 


Ser 


Val 


He 


Val 


Val 


Gly 


Leu 


Gin 


Asp 


Pro 










165 










170 










175 




Ser 


Leu 


Thr 


Lys 


Leu 


Cys 


Val 


He 


Lys 


Ala 


Thr 


Thr 


Asp 


He 


Ala 


Asp 








180 










185 










190 






He 


Leu 


Gin 


Lys 


Asn 


He 


Ser 


Val 


Ala 


Asn 


Val 


Ser 


He 


Val 


Asn 


Thr 






195 










200 










205 








Val 


Ser 


Thr 


Phe 


Asn 


Arg 


Phe 


His 


Leu 


Asn 


Phe 


He 


Arg 


Leu 


Gly 


Lys 




210 










215 










220 










Phe 


Phe 


Glu 


Arg 


Asn 


Cys 


He 


Ser 


Tyr 


Ser 


Tyr 


Asn 


Pro 


Glu 


Thr 


Phe 


225 










230 










235 










240 


Pro 


Gly 


Met 


Phe 


Phe 


Lys 


Leu 


Arg 


Val 


Pro 


Ala 


Lys 


Pro 


Leu 


Leu 


Pro 










245 










250 










255 




Gly 


Glu 


Thr 


He 


Gly 


Glu 


Tyr 


Tyr 


Thr 


Lys 


Val 


Ala 


Met 


Met 


Arg 


Asp 








260 










265 










270 






Ser 


Lys 


Asp 


Pro 


Asn 


Phe 


Lys 


Met 


Ser 


Asp 


Trp 


Leu 


Arg 


He 


Lys 


Thr 






275 










280 










285 








Ala 


Leu 


Thr 


Phe 


Lys 


Val 


Gly 


Lys 


He 


Thr 


Val 


Leu 


Gly 


Glu 


Gly 


Glu 




290 










295 










300 










Ser 


Gly 


Cys 


Gly 


Asp 


Val 


Ser 


Val 


Val 


Ser 


Lys 


Leu 


Leu 


Phe 


Gly 


Leu 


305 










310 










315 










320 


Phe 


His 


Tyr 


Phe 


Met 


Asp 


Asn 


Asn 


He 


Lys 


Met 


Ser 


Pro 


Lys 


Glu 


Ala 










325 










330 










335 




Gin 


Arg 


Val 


Arg 


Glu 


Lys 


Tyr 


Gly 


He 


Pro 


His 


Leu 


Glu 


Trp 


Tyr 


Leu 








340 










345 










350 






Tyr 


He 


Asp 


Met 


Leu 


Leu 


His 


Ser 


Tyr 


Pro 


Tyr 


Val 


Lys 


Pro 


Ser 


Ala 






355 










360 










365 








Glu 


Gin 


Val 


Lys 


Arg 


Ala 


Met 


Val 


Asp 


Gin 


Gin 


His 


Glu 


Val 


Asp 


Arg 




370 










375 










380 










Thr 


Tyr 


Tyr 


Gly 


Thr 


Lys 


Asn 


Ser 


Met 


Asp 


Ala 


Ala 


Met 


Ser 


Ala 


Asn 



wo 01/38351 

129 



385 390 



Leu 


Val 


Pro 


Ser 


Lys 
405 


Glu 


Glu 


Ser 


He 


Ser 
410 


Ser 


Gin 


Gin 


Leu 
420 


Phe 


Gly 


His 


Leu 


Cys 
425 


Lys 


Arg 


Arg 


Ala 
435 


He 


Asp 


Thr 


Leu 


Ser 
440 


Phe 


Asp 


Trp 


Trp 
450 


Asn 


Lys 


Asn 


Asp 


Gin 
455 


Tyr 


Tyr 


Gly 


Phe 


Ser 


Val 


Ala 


Arg 


Leu 


Val 


Ser 


Val 


Ser 


465 










470 










Met 


Asn 


Ser 


Arg 


He 
485 


Ser 


Cys 


Gin 


Gly 


Lys 
490 


Glu 


Tyr 


Lys 


Asp 
500 


Lys 


Leu 


Asp 


His 


He 
505 


Val 


He 


Val 


Glu 
515 


Glu 


Cys 


Glu 


Ser 


Lys 
520 


Gly 


Phe 


His 


Gin 
530 


Lys 


Glu 


Lys 


He 


Pro 
535 


Thr 


Pro 


Tyr 


Cys 


Asn 


Gin 


Lys 


Asn 


Gly 


Asn 


Lys 


Met 


Ser 


545 










550 










Tyr 


Leu 


Ser 


Gly 


Ser 
565 


Ser 


Arg 


Ala 


Lys 


Arg 
570 


Lys 


His 


Arg 


Val 
580 


Thr 


Leu 


Ala 


Arg 


Leu 
585 


Asn 


Arg 


Phe 


Leu 
595 


Asn 


Asn 


Tyr 


He 


Ser 
600 


Thr 


Asp 


Lys 


Leu 
610 


Phe 


Gly 


Asn 


Asp 


Val 
615 


Tyr 


Ser 


Leu 


Leu 


Lys 


Ser 


Arg 


Gly 


His 


Ala 


Leu 


Thr 


Tyr 


625 










630 










Ser 


Asn 


Glu 


Ser 


Thr 
645 


Tyr 


Lys 


Thr 


Pro 


Gly 
650 


Leu 


Phe 


Glu 


Lys 
660 


Ser 


He 


He 


Asn 


Asn 
665 


Gin 


Asn 


Asn 


Arg 
675 


Lys 


Arg 


Lys 


Phe 


Ser 
680 


Arg 


He 


Phe 


Leu 
690 


Cys 


Asn 


Ala 


Cys 


Gly 
695 


Val 


Asn 


Leu 


He 


He 


Lys 


Gly 


He 


Cys 


Thr 


Ser 


Cys 


Asp 


705 










710 










He 


Glu 


Asn 


Ala 


Leu 
725 


Ser 


Asp 


He 


Asn 


Arg 
730 


Phe 


Lys 


Ala 


Ala 
740 


Ala 


Thr 


His 


Pro 


Pro 
745 


Val 


Ser 


Leu 


Ser 
755 


Ser 


Ser 


Ser 


Ser 


Pro 
760 


Ser 


Ser 


Asn 


Lys 
770 


Asn 


Asn 


Arg 


Cys 


Thr 
775 


Pro 


Ser 


Asp 


Lys 


Phe 


Thr 


Asp 


Glu 


Thr 


Thr 


Gly 


Ala 


Pro 


785 










790 










Lys 


Met 


Cys 


Asp 


He 
805 


Leu 


Ala 


Ser 


Leu 


Ala 
810 


Asp 


Arg 


Pro 


Thr 
820 


Ala 


Asn 


Tyr 


Arg 


Thr 
825 


Ser 


Asn 


Lys 


Thr 
835 


Asn 


Leu 


Asn 


Lys 


Leu 
840 


Leu 


Val 


Gly 


Ala 
850 


Thr 


Glu 


Thr 


Glu 


Ala 
855 


Gin 


He 


Phe 


Glu 


Lys 


Gly 


Leu 


Ser 


He 


Leu 


Cys 


Gin 


Leu 
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395 










400 


Phe 


He 


Lys 


Lys 


He 


Arg 










415 




Pro 


Ser 


Lys 


Glu 


Thr 


Thr 








430 






Pro 


He 


Asn 


Gin 


Asp 


Arg 






445 








Lys 


Glu 


Arg 


Cys 


Asp 


Pro 




460 










Glu 


Asn 


Thr 


Asn 


Ser 


Met 


475 










480 


Trp 


Trp 


Leu 


Asp 


Glu 


Asn 










495 




Asp 


Leu 


Cys 


Thr 


Glu 


Glu 








510 






He 


Asp 


Phe 


Leu 


Arg 


Lys 






525 








Val 


Leu 


Leu 


Ala 


Arg 


Ala 




540 










He 


Asn 


Asn 


Asn 


Ser 


Asn 


555 










560 


Asn 


Ala 


Lys 


Leu 


Gin 


Glu 










575 




Thr 


Met 


Met 


Ala 


Ser 


Tyr 








590 






He 


Ala 


Pro 


Asp 


Phe 


Ala 






605 








Leu 


His 


Leu 


Met 


Thr 


Asn 




620 










Asn 


Glu 


Arg 


Ala 


Leu 


Ser 


635 










640 


Asn 


Ala 


Tyr 


Phe 


Ser 


Thr 










655 




Glu 


Thr 


Ala 


Asn 


Lys 


Gly 








670 






Gly 


Gin 


Glu 


Lys 


Ser 


Ser 






685 








Asn 


Lys 


Gly 


Ser 


Asp 


Glu 




700 










Gin 


Asn 


Ser 


Thr 


Ser 


Tyr 


715 










720 


Asp 


Lys 


Lys 


He 


Lys 


Arg 










735 




Lys 


Gin 


Glu 


Leu 


Val 


Asp 








750 






Ser 


Ser 


Ser 


Gin 


Thr 


Ser 






765 








Phe 


He 


Asp 


Tyr 


Val 


Tyr 




780 










Lys 


Val 


Gly 


Leu 


Val 


Phe 


795 










800 


Ser 


Arg 


Arg 


Gly 


Met 


Glu 










815 




Leu 


His 


Ser 


Ala 


Thr 


Gin 








830 






Ser 


Ala 


He 


Lys 


Glu 


Thr 






845 








Asn 


Lys 


He 


He 


Gly 


Ser 




860 










Val 


Glu 


Arg 


Arg 


Asn 


Lys 


875 










880 
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Asp Asn Asn Val Phe Asp 
885 



<210> 50 
<211> 1268 
<212> DNA 
<213> SHRIMP 



<400> 50 

atgtcttcga 

tgtgaggtgt 

acgtactctg 

tgtgccagga 

atta taggta 

tggtttatca 

acgatgcatc 

aaacggatgg 

caaattgata 

gctggcaccg 

agcgttagtg 

agagggttta 

cttaaggaac 

gtatctacgc 

tttagcgacc 

ttaacagggc 

aaaaggactg 

gctgtgtgta 

ttcttgtgtg 

gaaccgtctc 

tttggtgtaa 

tttgataa 



ctgatttgtc 
ttgatgtaca 
t tgacggaaa 
gtgtt ta tag 
tcatcaagaa 
atcacaatgg 
taagcatagg 
cctcatcatt 
aagttaaata 
aagaggacaa 
aaattgtgga 
tccagaaaaa 
t tttccctga 
gtgaagattt 
ctgattctgg 
ctgcgtgtaa 
gaagacctga 
aatcgacaaa 
cgcctgtagc 
tcattgtcga 
actggcccta 



taagaatgcc 
ctgtgagacg 
ggaggttaca 
tgcgtcttct 
aaatccaaaa 
aggagatttg 
gaaactggat 
gagagagcac 
tgaaaaggca 
taaagaagaa 
tgttttaacg 
aatatatcct 
agaaactaca 
atgcaatgaa 
agagtacgtg 
gcacaaaaca 
atttctaact 
tccaaacttg 
tgaacttgta 
gagagatata 
tgtgaatctc 



ttccatgact 
gacagagatt 
aaattctctt 
ctagaaagag 
ttggaggaag 
tttgtgaata 
aatgtggaca 
ctactgagga 
aagaaatgga 
gaagatgcga 
ggcacacatg 
ttgtcaagaa 
tctcctcagg 
agtatgaatg 
gctacttgtg 
tacaggtatg 
gatcctgtac 
agggcaattt 
aagacggcat 
aatcaagctg 
ctaaatcgca 



gggtggtatc 
gtggcgctgc 
gtaaccaaca 
cagccaatga 
aactccctga 
agcgagccgc 
ctcttgccca 
agttggactc 
tattggatat 
aaaaagagga 
accctatgcc 
acgaattgag 
ttttgagtag 
cagggagggc 
catgtcttta 
ttatagacta 
ttcattttaa 
atagtccaga 
taacttttag 
aaaatctacc 
ttcaagacca 



aaagacagat 
ttgcgagaac 
gtcgggaaga 
tcttggccac 
atcatttttg 
gtactacgac 
gggtttagat 
tatactttta 
aacacaggag 
tcaatctctt 
cctgagggct 
agaattagcc 
gcaacatgat 
agaatccatt 
ctcggaatat 
cgacaaatgg 
aaaggcagaa 
taataaaggt 
gggttcacac 
atccaattca 
gtacacgtaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1268 



<210> 51 

<211> 413 

<212> PRT 

<213> SHRIMP 



<400> 51 



Met 


Ser 


Ser 


Thr 


Asp 


Leu 


Ser 


Lys 


Asn 


Ala 


Phe 


His 


Asp 


Trp 


Val 


Val 


1 








5 










10 










15 




Ser 


Lys 


Thr 


Asp 
20 


Cys 


Glu 


Val 


Phe 


Asp 
25 


Val 


His 


Cys 


Glu 


Thr 

30 


Asp 


Arg 


Asp 


Cys 


Gly 
35' 


Ala 


Ala 


Cys 


Glu 


Asn 
40 


Thr 


Tyr 


Ser 


Val 


Asp 
45 


Gly 


Lys 


Glu 


Val 


Thr 

50 


Lys 


Phe 


Ser 


Cys 


Asn 
55 


Gin 


Gin 


Ser 


Gly 


Arg 
60 


Cys 


Ala 


Arg 


Ser 


Val 


Tyr 


Ser 


Ala 


Ser 


Ser 


Leu 


Glu 


Arg 


Ala 


Ala 


Asn 


Asp 


Leu 


Gly 


His 


65 










70 










75 










80 


He 


He 


Gly 


He 


He 
85 


Lys 


Lys 


Asn 


Pro 


Lys 
90 


Leu 


Glu 


Glu 


Glu 


Leu 
95 


Pro 


Glu 


Ser 


Phe 


Leu 
100 


Trp 


Phe 


He 


Asn 


His 
105 


Asn 


Gly 


Gly 


Asp 


Leu 
110 


Phe 


Val 


Asn 


Lys 


Arg 
115 


Ala 


Ala 


Tyr 


Tyr 


Asp 
120 


Thr 


Met 


His 


Leu 


Ser 
125 


He 


Gly 


Lys 


Leu 


Asp 
130 


Asn 


Val 


Asp 


Thr 


Leu 
135 


Ala 


Gin 


Gly 


Leu 


Asp 
140 


Lys 


Arg 


Met 


Ala 


Ser 


Ser 


Leu 


Arg 


Glu 


His 


Leu 


Leu 


Arg 


Lys 


Leu 


Asp 


Ser 


He 


Leu 


Leu 


145 










150 










155 










160 


Gin 


He 


Asp 


Lys 


Val 
165 


Lys 


Tyr 


Glu 


Lys 


Ala 
170 


Lys 


Lys 


Trp 


He 


Leu 
175 


Asp 
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lie 


Thr 


Gin 


Glu 


Ala 


Gly 


Thr 


Glu 


Glu 


Asp 


Asn 


Lys 


Glu 


Glu 


Glu 


Asp 








180 










185 










190 






Ala 


Lys 


Lys 


Glu 


Asp 


Gin 


Ser 


Leu 


Ser 


Val 


Ser 


Glu 


He 


Val 


Asp 


Val 






195 










200 










205 








Leu 


Thr 


Gly 


Thr 


His 


Asp 


Pro 


Met 


Pro 


Leu 


Arg 


Arg 


Phe 


He 


Gin 


Lys 




210 










215 










220 










Lys 


lie 


Tyr 


Pro 


Leu 


Ser 


Arg 


Asn 


Glu 


Leu 


Arg 


Glu 


Leu 


Ala 


Leu 


Lys 


225 










230 










235 










240 


Glu 


Leu 


Phe 


Pro 


Glu 


Glu 


Thr 


Thr 


Ser 


Pro 


Gin 


Val 


Leu 


Ser 


Arg 


Gin 










245 










250 










255 




His 


Asp 


Val 


Ser 


Thr 


Arg 


Glu 


Asp 


Leu 


Cys 


Asn 


Glu 


Ser 


Met 


Asn 


Ala 








260 










265 










270 






Gly 


Arg 


Ala 


Glu 


Ser 


He 


Phe 


Ser 


Asp 


Pro 


Asp 


Ser 


Gly 


Glu 


Tyr 


Val 






275 










280 










285 








Ala 


Thr 


Cys 


Ala 


Cys 


Lys 


Glu 


Tyr 


Leu 


Thr 


Gly 


Pro 


Ala 


Cys 


Lys 


His 




290 










2 95 










300 










Lys 


Tyr 


Tyr 


Val 


He 


Asp 


Tyr 


Asp 


Lys 


Trp 


Lys 


Arg 


Thr 


Gly 


Arg 


Pro 


305 










310 










315 










320 


Glu 


Phe 


Leu 


Thr 


Asp 


Pro 


Val 


Leu 


His 


Phe 


Lys 


Lys 


Ala 


Glu 


Ala 


Val 










325 










330 










335 




Cys 


Lys 


Ser 


Thr 


Asn 


Pro 


Asn 


Leu 


Arg 


Ala 


He 


Tyr 


Ser 


Pro 


Asp 


Asn 








340 










345 










350 






Lys 


Gly 


Phe 


Leu 


Cys 


Ala 


Pro 


Val 


Ala 


Glu 


Leu 


Val 


Lys 


Thr 


Ala 


Leu 






355 










360 










365 








Thr 


Phe 


Arg 


Gly 


Ser 


His 


Glu 


Pro 


Ser 


Leu 


He 


Val 


Glu 


Arg 


Asp 


He 




370 










375 










380 










Asn 


Gin 


Ala 


Glu 


Asn 


Leu 


Pro 


Ser 


Asn 


Ser 


Phe 


Gly 


Val 


Asn 


Trp 


Pro 


385 










390 










395 










400 


Tyr 


Val 


Asn 


Leu 


Leu 


Asn 


Arg 


He 


Gin 


Asp 


Gin 


Tyr 


Thr 









405 410 



<210> 52 

<211> 1401 

<212> DNA 

<213> SHRIMP 



<400> 52 

atgtctgcat 

gtagctgact 

ctttctgttg 

atcactggag 

atgtctgttg 

gaaattagaa 

agcccgctcg 

attgtttgtg 

gattttcaaa 

aaggtcgact 

atggatataa 

gattctacac 

aaaacagcag 

cagctaacct 

gcactcatgt 

aaggaagatg 

tctggaaacc 

gggctctgtt 

cctaccattc 

cttagaaaat 

gaaagagcca 

gcagtagcaa 

agtgcgcaga 



ctttaatatt 
catttgagaa 
ctctctatgg 
aaagaacatc 
ctcgcgttgg 
ccatcatgga 
aggaagaaaa 
gatatta tac 
accccactgt 
ctgctagg tc 
agtcatcaat 
ctaacccttg 
ttaatgatat 
tta tgagcga 
ccgatactag 
catcaaatac 
ccagacagcc 
tcatgt t taa 
cccattcaac 
acggttcttc 
atgcagtgaa 
atctcaccgg 
gatcgtgttc 



ggacgaatac 
aatcaaggga 
agcacctcca 
tt taaacgac 
acaagaaaat 
aga tattaca 
taaggtgcat 
aatggacaca 
cat tgccaat 
tacaagattc 
aga ttgggca 
ttcagttgac 
atacgcttct 
gat tgaaaaa 
ggagagggca 
caattcaaaa 
cataaaattg 
ggtaatgcct 
acaataccat 
attcgaaaag 
aaaatttggt 
acctgtgtat 
aactaaattt 



ctcaagaaga 
gaaatccaat 
aaaccttcag 
aaatatctat 
aatcgcaaga 
gggagtttgt 
ataggcatca 
tctatttctt 
gtgactaagc 
gacgcttttg 
aatatggtag 
actattgtgt 
gttggaaaat 
ctgcgaaagg 
ttcgtagagt 
ttgtttgttc 
gtacgccgtt 
ccagaattca 
ggtctatatg 
tcctgggctc 
gtaaacgata 
gttctcattt 
ctccgtgaaa 



ctgcttcagc 
cacctgagga 
cttcggctgt 
cggataatgt 
gagccgacca 
ccggtgcgta 
tgaataacaa 
ccgaacctct 
ggatggagag 
ttaatggtgt 
aaaatgtgat 
ccagagacgc 
cttattgtcg 
ctgcagttgt 
tcctatttta 
agaataagct 
ctgctgagga 
tgaactgtat 
gtacatgttt 
att ttgagga 
cgaggataga 
tagatcttgt 
ttaaggaaaa 



cgttctggat 
ggctgcggct 
ggcctctatc 
cctattgaaa 
ggcagctgat 
caggcaatat 
aacgcctagc 
ttctctaaca 
cattttttca 
tgcgaataat 
caaattacca 
aagtgtagtt 
tcctgcaaca 
atgttttgag 
cgttagcttt 
atcttccatg 
aacactattc 
atttaacttc 
aacccctcta 
aattttaagc 
ttgtctagat 
acgtactcta 
ctatcttttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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tggaataggt ttgtgtcata a 

<210> 53 
<211> 459 
<212> PRT 
<213> SHRIMP 

<220> 

<221> VARIANT 

<222> (1) , . . (459) 

<223> Xaa = Any Amino Acid 



<400> 53 



Met 


Ser 


Ala 


Ser 


Leu 


He 


Leu 


Asp 


1 








5 








Ala 


Val 


Leu 


Asp 


Val 


Ala 


Asp 


Ser 








20 










Gin 


Ser 


Pro 


Glu 


Glu 


Ala 


Ala 


Ala 






35 










40 


I*ys 


Pro 


Ser 


Ala 


Ser 


Ala 


Val 


Ala 




50 










55 




Ser 


Leu 


Asn 


Asp 


Lys 


Tyr 


Leu 


Ser 


65 










70 






Val 


Ala 


Arg 


Val 


Gly 


Gin 


Glu 


Asn 










85 








Ala 


Asp 


Glu 


He 


Arg 


Thr 


He 


Met 








100 










Gly 


Ala 


Tyr 


Arg 


Gin 


Tyr 


Ser 


Pro 






115 










120 


lie 


Gly 


lie 


Met 


Asn 


Asn 


Lys 


Thr 




130 










135 




Thr 


Met 


Asp 


Thr 


Ser 


He 


Ser 


Ser 


145 










150 






Gin 


Asn 


Pro 


Thr 


Val 


He 


Ala 


Asn 










165 








Phe 


Ser 


Lys 


Val 


Asp 


Ser 


Ala 


Arg 








180 










Asn 


Gly 


Val 


Ala 


Asn 


Asn 


Met 


Asp 






195 










200 


Asn 


Met 


Val 


Glu 


Asn 


Val 


He 


Lys 




210 










215 




Cys 


Ser 


Val 


Asp 


Thr 


He 


Val 


Ser 


225 










230 






Ala 


Val 


Asn 


Asp 


He 


Tyr 


Ala 


Ser 










245 








Ala 


Thr 


Gin 


Leu 


Thr 


Phe 


Met 


Ser 








260 










Ala 


Val 


Val 


Cys 


Phe 


Glu 


Ala 


Leu 






275 










280 


Phe 


Val 


Glu 


Phe 


Leu 


Phe 


Tyr 


Val 




290 










295 




Ser 


Lys 


Leu 


Phe 


Val 


Gin 


Asn 


Lys 


305 










310 






Arg 


Gin 


Pro 


He 


Lys 


Leu 


Val 


Arg 










325 








Gly 


Leu 


Cys 


Phe 


Met 


Phe 


Lys 


Val 








340 










lie 


Phe 


Asn 


Phe 


Pro 


Thr 


He 


Pro 






355 










360 


Thr 


Cys 


Leu 


Thr 


Pro 


Leu 


Leu 


Arg 




370 










375 





1401 



Glu 


Tyr 


Leu 


Lys 


Lys 


Thr 


Ala 


Ser 




10 










15 




Phe 


Glu 


Lys 


He 


Lys 


Gly 


Glu 


He 


25 










30 






Leu 


Ser 


Val 


Tyr 


Gly 


Ala 


Pro 


Pro 










45 








Ser 


He 


He 


Thr 


Gly 


Glu 


Arg 


Thr 








60 










Asp 


Asn 


Val 


Leu 


Leu 


Lys 


Met 


Ser 






75 










80 


Asn 


Arg 


Lys 


Arg 


Ala 


Asp 


Gin 


Ala 




90 










95 




Glu 


Asp 


He 


Thr 


Gly 


Ser 


Leu 


Ser 


105 










110 






Leu 


Glu 


Glu 


Glu 


Asn 


Lys 


Val 


His 










125 








Pro 


Ser 


He 


Val 


Cys 


Gly 


Tyr 


Tyr 








140 










Glu 


Pro 


Leu 


Ser 


Leu 


Thr 


Asp 


Phe 






155 










160 


Val 


Thr 


Lys 


Arg 


Met 


Glu 


Ser 


He 




170 










175 




Ser 


Thr 


Arg 


Phe 


Asp 


Ala 


Phe 


Val 


185 










190 






He 


Lys 


Ser 


Ser 


He 


Asp 


Trp 


Ala 










205 








Leu 


Pro 


Asp 


Ser 


Thr 


Pro 


Asn 


Pro 








220 










Arg 


Asp 


Ala 


Ser 


Val 


Val 


Lys 


Thr 






235 










240 


Val 


Gly 


Lys 


Ser 


Tyr 


Cys 


Arg 


Pro 




250 










255 




Glu 


He 


Glu 


Lys 


Leu 


Arg 


Lys 


Ala 


265 










270 






Met 


Ser 


Asp 


Thr 


Arg 


Glu 


Arg 


Ala 










285 








Ser 


Phe 


Lys 


Glu 


Asp 


Asn 


Thr 


Asn 








300 










Leu 


Ser 


Ser 


Met 


Ser 


Gly 


Asn 


Pro 






315 










320 


Arg 


Ser 


Ala 


Glu 


Glu 


Thr 


Leu 


Phe 




330 










335 




Met 


Pro 


Pro 


Glu 


Phe 


Met 


Asn 


Cys 


345 










350 






His 


Ser 


Thr 


Gin 


Tyr 


His 


Gly 


Gly 










365 








Lys 


Tyr 


Gly 


Ser 


Ser 


Phe 


Glu 


Lys 



380 
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Ser 


Trp 


Ala 


His 


Phe 


Glu 


Glu 


lie 


Leu 


Ser 


Ala 


Asn 


Ala 


Val 


Lys 


Lys 


385 










390 










395 










400 


Phe 


Gly 


Val 


Asn 


Asp 
405 


Thr 


Arg 


He 


Asp 


Cys 
410 


Leu 


Asp 


Ala 


Val 


Ala 
415 


Asn 


Leu 


Thr 


Gly 


Pro 
420 


Val 


Tyr 


Val 


Leu 


He 
425 


Leu 


Asp 


Leu 


Val 


Arg 
430 


Thr 


Leu 


Ser 


Ala 


Gin 
435 


Arg 


Ser 


Cys 


Ser 


Thr 
440 


Lys 


Phe 


Leu 


Arg 


Glu 
445 


He 


Lys 


Glu 


Asn 


Tyr 
450 


Leu 


Leu 


Trp 


Asn 


Arg 
455 


Phe 


Val 


Ser 


Xaa 













<210> 54 
<211> 819 
<212> DNA 
<213> SHRIMP 



<400> 54 

atggctcaga 

gaaaggcaac 

agttcatcca 

gaacaacttc 

aaggtcaagc 

tt tgggatcg 

a tcactacta 

cacctctatc 

gaaaaactga 

tccgattacg 

gctattgaaa 

actgttgtgt 

tttatttgta 

gtaatgagtc 



catcaaagat 
aacctttcac 
catcttctaa 
tagaattcgt 
aagaagttga 
ctcaagacag 
aaattgttat 
attttaagga 
gcgtcattct 
ttgatcgcct 
gtaaggaatt 
ccaacattcc 
catttgctaa 
aggtaacaag 



gggaactaac 
aaagaaatct 
gaagaagagc 
gaaggatctg 
agaaaagtcc 
caagttcaag 
cgatgcaatt 
gatgcagaat 
taagagtaag 
taacaggatt 
gttgcagagt 
ttacaacatg 
tgatgatttg 
atacatttca 



<210> 55 
<211> 272 
<212> PRT 
<213> SHRIMP 



aagaggtgtt 
aaatcggaac 
aaatccaata 
gagcggagcg 
cctgaagcta 
agccttcttc 
aatcagcctg 
gttgtggaga 
aaatccccca 
ctggttggtg 
aacagcatga 
aagattaata 
tacacattct 
aaggattag 



ttgaggagga 
cacccagttt 
aacacaccaa 
accccactgt 
ttgctgaaat 
ccattgaacg 
tgcgcaagat 
aatataagga 
aagaatttga 
taattaagag 
tcatgaacag 
tttgtgtgtt 
ttagggatga 



agtggaggaa 
tgaagacaag 
gaccaaggaa 
tcctgatgag 
tttttcaatg 
cataaagagc 
gttggttgac 
cgatagcgac 
cctctccttt 
ggtggccgga 
tgttctgggt 
tttgactaac 
tgagaaattt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

819 



<400> 55 



Met 


Ala 


Gin 


Thr 


Ser 


Lys 


Met 


Gly 


Thr 


Asn 


Lys 


Arg 


Cys 


Phe 


Glu 


Glu 


1 








5 










10 










15 




Glu 


Val 


Glu 


Glu 
20 


Glu 


Arg 


Gin 


Gin 


Pro 
25 


Phe 


Thr 


Lys 


Lys 


Ser 
30 


Lys 


Ser 


Glu 


Pro 


Pro 
35 


Ser 


Phe 


Glu 


Asp 


Lys 
40 


Ser 


Ser 


Ser 


Thr 


Ser 
45 


Ser 


Lys 


Lys 


Lys 


Ser 
50 


Lys 


Ser 


Asn 


Lys 


His 
55 


Thr 


Lys 


Thr 


Lys 


Glu 
60 


Glu 


Gin 


Leu 


Leu 


Glu 


Phe 


Val 


Lys 


Asp 


Leu 


Glu 


Arg 


Ser 


Asp 


Pro 


Thr 


Val 


Pro 


Asp 


Glu 


65 










70 










75 










80 


Lys 


Val 


Lys 


Gin 


Glu 

85 


Val 


Glu 


Glu 


Lys 


Ser 
90 


Pro 


Glu 


Ala 


He 


Ala 
95 


Glu 


He 


Phe 


Ser 


Met 
100 


Phe 


Gly 


He 


Ala 


Gin 
105 


Asp 


Ser 


Lys 


Phe 


Lys 
110 


Ser 


Leu 


Leu 


Pro 


He 
115 


Glu 


Arg 


He 


Lys 


Ser 
120 


He 


Thr 


Thr 


Lys 


He 
125 


Val 


He 


Asp 


Ala 


He 
130 


Asn 


Gin 


Pro 


Val 


Arg 
135 


Lys 


Met 


Leu 


Val 


Asp 
140 


His 


Leu 


Tyr 


His 


Phe 


Lys 


Glu 


Met 


Gin 


Asn 


Val 


Val 


Glu 


Lys 


Tyr 


Lys 


Asp 


Asp 


Ser 


Asp 


145 










150 










155 










160 


Glu 


Lys 


Leu 


Ser 


Val 
165 


He 


Leu 


Lys 


Ser 


Lys 
170 


Lys 


Ser 


Pro 


Lys 


Glu 
175 


Phe 
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Asp Leu Ser Phe Ser 
180 

Gly Val He Lys Arg 
195 

Gin Ser Asn Ser Met 
210 

Asn He Pro Tyr Asn 
225 

Phe He Cys Thr Phe 
245 

Asp Glu Lys Phe Val 
260 



Asp Tyr Val Asp Arg Leu 
185 

Val Ala Gly Ala He Glu 
200 

He Met Asn Ser Val Leu 
215 

Met Lys He Asn He Cys 
230 235 
Ala Asn Asp Asp Leu Tyr 
250 

Met Ser Gin Val Thr Arg 
265 



Asn Arg He Leu Val 
190 

Ser Lys Glu Leu Leu 
205 

Gly Thr Val Val Ser 
220 

Val Phe Leu Thr Asn 
240 

Thr Phe Phe Arg Asp 
255 

Tyr He Ser Lys Asp 
270 



<210> 56 
<211> 1398 
<2X2> DNA 
<213> SHRIMP 



<400> 56 

atggctgtag 

atgattttaa 

gacgtttata 

gtggtttctt 

caatttgtta 

cctaagaacg 

gattataaaa 

aacaagagga 

tatgtcttta 

tacgt tagaa 

agtaagctcg 

tacgtgatca 

tcttcttctg 

acggacccaa 

gcaaaagaag 

cttaaggctc 

gctgcaaaac 

acatctgctg 

gtgaagctat 

tatggtacat 

tgcaccgaca 

gatatggtgg 

tcttcttcgt 

ataaagaaaa 



gggattatct 
attttataaa 
gaagatgcaa 
cgactttaca 
gaggtttagt 
tgggggaata 
agggtaagaa 
ttatagggcc 
agagattgag 
aattttcccc 
tgtatcctca 
acaaagcata 
tgctcgtccc 
cacatagaat 
aaggttatgt 
ctctaattac 
gtgttttatt 
ctcatcaact 
cagatacggg 
ctt tagtaaa 
aggctgatgg 
ggtatcttaa 
ggttgacaac 
ttgtgtaa 



ttctatgtcc 
ctttgtcaca 
gcggccgtct 
agaagctaat 
ccccagaaaa 
tattttacct 
ggttggtctt 
aagggatctg 
caaagatccg 
tttcagagca 
ccctatacgg 
cggaatggat 
agactctaat 
cggcatctac 
agaaatgttg 
agagtattct 
ccctgcccct 
agacggagct 
tgactatgca 
tcttccagaa 
aatatatcaa 
cgccaaccct 
tggtaacaaa 



tctgtagggg 
atattatctt 
accaatggat 
ctcgttacca 
atgatggaaa 
tcagaaaaag 
ttaactgccc 
attagtagag 
ctcgtctact 
aaaaaattca 
tatggtactg 
aatgaggatc 
aatgatagat 
ggctttggag 
ttatgtaatt 
acaaatccaa 
ggttccgagc 
actggcgagc 
gttggatcac 
acaggatctc 
gtcaatcaaa 
gtggaagctg 
atatcttctg 



aagcaacgct 
taataatata 
attctggttg 
ctgaaaaaga 
aatatagatc 
aaacagacaa 
tgagtaatgg 
atgatgtgaa 
actcttctgc 
tgacatcaac 
cttttgtact 
tacacacttg 
taacagtaga 
ggtctgatga 
gtgacaacca 
ctgaaattca 
ctgtaaaatc 
acgatatttc 
ccattgtatt 
ctctggcatt 
agggaggga t 
catcactttc 
ttacatgtga 



cgtgggtttt 
tgccgtgaca 
tacaaccaat 
taaacctgtt 
ggacttgtct 
attgaaaagt 
tcatgacagt 
ggacaaaagt 
aacctctaaa 
acagttgggg 
acccacggga 
gaacccaccc 
atgtgctaaa 
taatagacgt 
caaggacttg 
agtagatgtt 
ttcccaagtg 
ccatgagccc 
caagccagtt 
gaactgcccc 
attatataga 
ttcctcggac 
aggagaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1398 



<210> 57 
<211> 463 
<212> PRT 
<213> SHRIMP 



<400> 57 



Met 


Ala 


Val 


Gly 


Asp 


Tyr 


Leu 


Ser 


Met 


Ser 


Ser 


Val 


Gly 


Glu 


Ala 


Thr 


1 








5 










10 










15 




Leu 


Val 


Gly 


Phe 


Met 


He 


Leu 


Asn 


Phe 


He 


Asn 


Phe 


Val 


Thr 


He 


Leu 








20 










25 










30 






Ser 


Leu 


He 


He 


Tyr 


Ala 


Val 


Thr 


Asp 


Val 


Tyr 


Arg 


Arg 


Cys 


Lys 


Arg 






35 










40 










45 








Pro 


Ser 


Thr 


Asn 


Gly 


Tyr 


Ser 


Gly 


Cys 


Thr 


Thr 


Asn 


Val 


Val 


Ser 


Ser 




50 










55 










60 










Thr 


Leu 


Gin 


Glu 


Ala 


Asn 


Leu 


Val 


Thr 


Thr 


Glu 


Lys 


Asp 


Lys 


Pro 


Val 


65 










70 










75 










80 
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Gin 


Phe 


Val 


Arg 


Gly 
85 


Leu 


Val 


Pro 


Arg 


Lys 
90 


Met 


Met 


Glu 


Lys 


Tyr 
95 


Arg 


Ser 


Asp 


Leu 


Ser 
100 


Pro 


Lys 


Asn 


Val 


Gly 
105 


Glu 


Tyr 


He 


Leu 


Pro 
110 


Ser 


Glu 


Lys 


Glu 


Thr 
115 


Asp 


Lys 


Leu 


Lys 


Ser 
120 


Asp 


Tyr 


Lys 


Lys 


Gly 
125 


Lys 


Lys 


Val 


Gly 


Leu 
130 


Leu 


Thr 


Ala 


Leu 


Ser 
135 


Asn 


Gly 


His 


Asp 


Ser 
140 


Asn 


Lys 


Arg 


He 


lie 


Gly 


Pro 


Arg 


Asp 


Leu 


He 


Ser 


Arg 


Asp 


Asp 


Val 


Lys 


Asp 


Lys 


Ser 


145 










150 










155 










160 


Tyr 


Val 


Phe 


Lys 


Arg 
165 


Leu 


Ser 


Lys 


Asp 


Pro 
170 


Leu 


Val 


Tyr 


Tyr 


Ser 
175 


Ser 


Ala 


Thr 


Ser 


Lys 
180 


Tyr 


Val 


Arg 


Lys 


Phe 
185 


Ser 


Pro 


Phe 


Arg 


Ala 
190 


Lys 


Lys 


Phe 


Met 


Thr 
195 


Ser 


Thr 


Gin 


Leu 


Gly 
200 


Ser 


Lys 


Leu 


Val 


Tyr 
205 


Pro 


His 


Pro 


lie 


Arg 
210 


Tyr 


Gly 


Thr 


Ala 


Phe 
215 


Val 


Leu 


Pro 


Thr 


Gly 
220 


Tyr 


Val 


He 


Asn 


Lys 


Ala 


Tyr 


Gly 


Met 


Asp 


Asn 


Glu 


Asp 


Leu 


His 


Thr 


Trp 


Asn 


Pro 


Pro 


225 










230 










235 










240 


Ser 


Ser 


Ser 


Val 


Leu 
245 


Val 


Pro 


Asp 


Ser 


Asn 
250 


Asn 


Asp 


Arg 


Leu 


Thr 
255 


Val 


Glu 


Cys 


Ala 


Lys 
260 


Thr 


Asp 


Pro 


Thr 


He 
265 


Gly 


He 


Tyr 


Gly 


Phe 
270 


Gly 


Gly 


Ser 


Asp 


Asp 
275 


Asn 


Arg 


Arg 


Ala 


Lys 
280 


Glu 


Glu 


Gly 


Tyr 


Val 
285 


Glu 


Met 


Leu 


Leu 


Cys 
290 


Asn 


Cys 


Asp 


Asn 


His 
295 


Lys 


Asp 


Leu 


Leu 


Lys 
300 


Ala 


Pro 


Leu 


He 


Thr 


Glu 


Tyr 


Ser 


Thr 


Asn 


Pro 


Thr 


Glu 


He 


Gin 


Val 


Asp 


Val 


Ala 


Ala 


305 










310 










315 










320 


Lys 


Arg 


Val 


Leu 


Phe 

325 


Pro 


Ala 


Pro 


Gly 


Ser 
330 


Glu 


Pro 


Val 


Lys 


Ser 
335 


Ser 


Gin 


Val 


Thr 


Ser 
34 0 


Ala 


Ala 


His 


Gin 


Leu 
345 


Asp 


Gly 


Ala 


Thr 


Gly 
350 


Glu 


His 


Asp 


He 


Ser 
355 


His 


Glu 


Pro 


Val 


Lys 
360 


Leu 


Ser 


Asp 


Thr 


Gly 
365 


Asp 


Tyr 


Ala 


Val 


Gly 
370 


Ser 


Pro 


He 


Val 


Phe 
375 


Lys 


Pro 


Val 


Tyr 


Gly 
380 


Thr 


Ser 


Leu 


Val 


Asn 


Leu 


Pro 


Glu 


Thr 


Gly 


Ser 


Pro 


Leu 


Ala 


Leu 


Asn 


Cys 


Pro 


Cys 


Thr 


385 










390 










395 










400 


Asp 


Lys 


Ala 


Asp 


Gly 
405 


He 


Tyr 


Gin 


Val 


Asn 
410 


Gin 


Lys 


Gly 


Gly 


He 
415 


Leu 


Tyr 


Arg 


Asp 


Met 
420 


Val 


Gly 


Tyr 


Leu 


Asn 

425 


Ala 


Asn 


Pro 


Val 


Glu 
430 


Ala 


Ala 


Ser 


Leu 


Ser 
435 


Ser 


Ser 


Asp 


Ser 


Ser 
440 


Ser 


Trp 


Leu 


Thr 


Thr 
445 


Gly 


Asn 


Lys 


He 


Ser 


Ser 


Val 


Thr 


Cys 


Glu 


Gly 


Glu 


Lys 


He 


Lys 


Lys 


He 


Val 





450 455 460 



<210> 58 
<211> 2361 
<212> DNA 
<213> SHRIMP 

<400> 58 

atggagtaca tggaagaagg agacatcgct 
ctggacgaaa actctgcttg tgtagttaat 
atggacgccg aggaggcaca gtacgcacag 
atccgtctgg cccactgctc cgaatccaac 
ggtcttttca tcaacaacat ctttgacaac 



gaaaggcgct cagaaggtgt cgactatatt 50 
gtgaagagta tccgtaacag gctcggtgcc 120 
gacatttccg cccaacttgt cacccatatt 180 
aagattaagg ataccattgc cagtattgcg 240 
aattcaacaa agaacaaact taaaacgtat 300 
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aatcaattca 
gatcctctga 
aatttggaca 
attgacgata 
aaggaagcca 
aaccataata 
atgagatact 
tgcctcgaca 
gctgattcag 
gtagacagag 
gtatctggag 
gcgattgaaa 
tttttgagct 
ctaaaagcct 
atttccggag 
gacatctctc 
aaggctagca 
tatatcccgt 
cacaacatca 
ttacacattc 
gatcaaactg 
aagcaggggt 
actacttcat 
tacgtcatct 
ccccattcta 
cccgacactc 
aatctttctg 
gacaaactca 
gctatcatta 
tcttcctcag 
ctccaaagta 
agtgcagtaa 
tctgaagaac 
agcctgatcg 
atcagagcta 



aggcagagtc 
gtatgctttc 
aatctttggg 
ttgtccttct 
tccgccagga 
ggttcaattt 
ctcaggacga 
gggatgatcc 
ataacgcttg 
ccgttattgc 
acagcttcaa 
ttttgttctg 
atgttaatgc 
accgttcatg 
gatggaagaa 
aagtcatcaa 
acgttacatc 
ttggaatact 
gccgtccagc 
taccttcaat 
tcaatgtttt 
cggggaagat 
tccctagtag 
tctcttcaac 
atgttctgaa 
gattccactt 
ctctaactac 
ccaccagagc 
agaacgtttc 
cgcctaactt 
ttcgcaaatt 
acaatgggaa 
tgttcgagct 
cctcatctga 
atgcttctta 



acaaaacaag 
tagcttcatg 
tgtgctcttt 
ggaaatgtgc 
acaacccatg 
tggaagcgac 
acgcgcagtc 
agctcatact 
ggtttcttcc 
tcacttttac 
gcagtttgtc 
tga tactgaa 
catggttaac 
ggtggtggat 
gaactacccc 
tea tct tgca 
tggcctggat 
tgaaaa taaa 
aagagaacaa 
taagggctgt 
tgataatttt 
tcttggactg 
tgaagcggaa 
acccatcaac 
ctcccctatc 
tctgctcatg 
aagtcagcta 
gttcttcaat 
tggaagcggc 
tttccaaatc 
cattggagaa 
cagaggagta 
gctctacaaa 
attcttgagc 

g 



tctagcgttc 
ggttctgatc 
gaggtgcttc 
ccatccaagt 
gaagcaatgt 
a taaagtcag 
gttgtgcctc 
ctttcctctt 
ctgtttgcag 
gtgtacacaa 
tataccgtat 
aattcgtctg 
gtatccgtgc 
caagcatccg 
tcacctgacc 
tcaccttcta 
agtatcaggt 
gcagggtatg 
tccaacggaa 
gaggcacttg 
gtcgcatctc 
ctcactagca 
tacaagaaga 
gacgaactag 
agct tgagaa 
atgtggcagc 
gaattgttgc 
atcgacagga 
ttcctagatg 
ttcagtggtg 
tcta tgcagc 
gaaaattatg 
ttgatcatcg 
aactacgtca 



tcaatatctt 
cagcaaagag 
aaaattacaa 
gcgccgcctg 
tgttgttttt 
catacgcctc 
tgaggagtat 
tcggggatac 
ccgtctctag 
tgctcagccg 
ttgttcgtat 
tagaatgtga 
tgggttctac 
tcgcaccggt 
acatcaagag 
gaatggttaa 
ctgttcgtca 
gtgtaataaa 
ggaactttaa 
gggcacaaaa 
atatggatat 
tgattgacag 
gaatccatga 
tcaattctcg 
atattgaccc 
ggccaaatat 
ttagcaagaa 
tcaatttcca 
ggagtaaaac 
ctgaatgcac 
atgtacaaaa 
acggactcaa 
aggaggatat 
acgccatgga 



tggctctcta 
tggaggggaa 
cccttgcaag 
caccggtctc 
caaatgtatc 
tgaaacatgc 
cctcctcggc 
tatcgagtat 
aatgcctatg 
acataggcga 
gatttactct 
tggaaagcac 
gtttaacgta 
tctagacatt 
ggtggcgtac 
aggtaacaac 
agcagaaaaa 
cattgccaag 
ttgcaacgct 
ggggagcgca 
tgccatgaaa 
gcaaggtctg 
tttcacaaga 
ctgtattctt 
agaatcagtc 
cgatgaacct 
ccaaaaatgg 
gatggcagac 
tgcctcttct 
tgcaaagcag 
ggaatggagt 
tgctcagttc 
gcggccatcc 
tgaacttctt 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2351 



<210> 59 
<211> 778 
<212> PRT 
<213> SHRIMP 



<400> 59 



Met 


Glu 


Tyr 


Met 


Glu 


Glu 


Gly 


Asp 


He 


Ala 


Glu 


Arg 


Arg 


Ser 


Glu 


Gly 


1 








5 










10 










15 




Val 


Asp 


Tyr 


He 


Leu 


Asp 


Glu 


Asn 


Ser 


Ala 


Cys 


Val 


Val 


Asn 


Val 


Lys 








20 










25 










30 






Ser 


He 


Arg 


Asn 


Arg 


Leu 


Gly 


Ala 


Met 


Asp 


Ala 


Glu 


Glu 


Ala 


Gin 


Tyr 






35 










40 










45 








Ala 


Gin 


Asp 


He 


Ser 


Ala 


Gin 


Leu 


Val 


Thr 


His 


He 


He 


Arg 


Leu 


Ala 




50 










55 










60 










His 


Cys 


Ser 


Glu 


Ser 


Asn 


Lys 


He 


Lys 


Asp 


Thr 


He 


Ala 


Ser 


He 


Ala 


65 










70 










75 










80 


Gly 


Leu 


Phe 


He 


Asn 


Asn 


He 


Phe 


Asp 


Asn 


Asn 


Ser 


Thr 


Lys 


Asn 


Lys 










85 










90 










95 




Leu 


Lys 


Thr 


Tyr 


Asn 


Gin 


Phe 


Lys 


Ala 


Glu 


Ser 


Gin 


Asn 


Lys 


Ser 


Ser 








100 










105 










110 






Val 


Leu 


Asn 


He 


Phe 


Gly 


Ser 


Leu 


Asp 


Pro 


Leu 


Ser 


Met 


Leu 


Ser 


Ser 






115 










120 










125 








Phe 


Met 


Gly 


Ser 


Asp 


Pro 


Ala 


Lys 


Ser 


Gly 


Gly 


Glu 


Asn 


Leu 


Asp 


Lys 




130 










135 










140 










Ser 


Leu 


Gly 


Val 


Leu 


Phe 


Glu 


Val 


Leu 


Gin 


Asn 


Tyr 


Asn 


Pro 


Cys 


Lys 
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145 150 155 160 



lie 


Asp 


Asp 


He 


Val 


Leu 


Leu 


Glu 


Met 


Cys 


Pro 


Ser 


Lys 


Cys 


Ala 


Ala 










165 










170 










175 




Cys 


Thr 


Gly 


Leu 


Lys 


Glu 


Ala 


He 


Arg 


Gin 


Glu 


Gin 


Pro 


Met 


Glu 


Ala 








180 










185 










190 






Met 


Leu 


Leu 


Phe 


Phe 


Lys 


Cys 


He 


Asn 


His 


Asn 


Arg 


Phe 


Asn 


Phe 


Gly 






195 










200 










205 








Ser 


Asp 


He 


Lys 


Ser 


Ala 


Tyr 


Ala 


Ser 


Glu 


Thr 


Cys 


Met 


Arg 


Tyr 


Ser 




210 










215 










220 










Gin 


Asp 


Glu 


Arg 


Ala 


Val 


Val 


Val 


Pro 


Leu 


Arg 


Ser 


He 


Leu 


Leu 


Gly 


225 










230 










235 










240 


Cys 


Leu 


Asp 


Arg 


Asp 


Asp 


Pro 


Ala 


His 


Thr 


Leu 


Ser 


Ser 


Phe 


Gly 


Asp 










245 










250 










255 




Thr 


He 


Glu 


Tyr 


Ala 


Asp 


Ser 


Asp 


Asn 


Ala 


Trp 


Val 


Ser 


Ser 


Leu 


Phe 








260 










265 










270 






Ala 


Ala 


Val 


Ser 


Arg 


Met 


Pro 


Met 


Val 


Asp 


Arg 


Ala 


Val 


He 


Ala 


His 






275 










280 










285 








Phe 


Tyr 


Val 


Tyr 


Thr 


Met 


Leu 


Ser 


Arg 


His 


Arg 


Arg 


Val 


Ser 


Gly 


Asp 




290 










295 










300 










Ser 


Phe 


Lys 


Gin 


Phe 


Val 


Tyr 


Thr 


Val 


Phe 


Val 


Arg 


Met 


He 


Tyr 


Ser 


305 










310 










315 










320 


TV 1 

Ala 


He 


Glu 


He 


Leu 


Phe 


Cys 


Asp 


Thr 


Glu 


Asn 


Ser 


Ser 


Val 


Glu 


Cys 










325 










330 










335 




Asp 


Gly 


Lys 


His 


Phe 


Leu 


Ser 


Tyr 


Val 


Asn 


Ala 


Met 


Val 


Asn 


Val 


Ser 








340 










345 










350 






Val 


Leu 


Gly 


Ser 


Thr 


Phe 


Asn 


Val 


Leu 


Lys 


Ala 


Tyr 


Arg 


Ser 


Trp 


Val 






355 










360 










365 








Val 


Asp 


Gin 


Ala 


Ser 


Val 


Ala 


Pro 


Val 


Leu 


Asp 


He 


He 


Ser 


Gly 


Gly 




370 










375 










380 










Trp 


Lys 


Lys 


Asn 


Tyr 


Pro 


Ser 


Pro 


Asp 


His 


He 


Lys 


Arg 


Val 


Ala 


Tyr 


385 










390 










395 










400 


Asp 


He 


Ser 


Gin 


Val 


He 


Asn 


His 


Leu 


Asp 


Ser 


Arg 


Met 


Val 


Lys 


Gly 










405 










410 










415 




Asn 


Asn 


Lys 


Asn 


Val 


Thr 


Ser 


Gly 


Leu 


Asp 


Ser 


He 


Arg 


Ser 


Val 


Arg 








420 










425 










430 






Gin 


Ala 


Glu 


Lys 


Tyr 


He 


Pro 


Phe 


Gly 


He 


Asn 


Lys 


Ala 


Gly 


Tyr 


Gly 






435 










440 










445 








Val 


He 


Asn 


He 


Ala 


Lys 


His 


Asn 


He 


Ser 


Arg 


Pro 


Ala 


Arg 


Glu 


Gin 




450 










455 










460 










Ser 


Asn 


Gly 


Arg 


Asn 


Phe 


Asn 


Cys 


Asn 


Ala 


Leu 


His 


He 


Leu 


Pro 


Ser 


465 










470 










475 










480 


lie 


Lys 


Gly 


Cys 


Glu 


Ala 


Leu 


Gly 


Ala 


Gin 


Lys 


Gly 


Ser 


Ala 


Asp 


Gin 










485 










490 










495 




Thr 


Val 


Asn 


Val 


Phe 


Asp 


Asn 


Phe 


Val 


Ala 


Ser 


His 


Met 


Asp 


He 


Ala 








500 










505 










510 






Met 


Lys 


Lys 


Gin 


Gly 


Ser 


Gly 


Lys 


He 


Leu 


Gly 


Leu 


Leu 


Thr 


Ser 


Met 






ate 
bl5 










520 










525 








He 


Asp 


Arg 


Gin 


Gly 


Leu 


Thr 


Thr 


Ser 


Phe 


Pro 


Ser 


Ser 


Glu 


Ala 


Glu 




530 










535 










540 










Tyr 


Lys 


Lys 


Arg 


He 


His 


Asp 


Phe 


Thr 


Arg 


Tyr 


Val 


He 


Phe 


Ser 


Ser 


545 










550 










555 










560 


Thr 


Pro 


He 


Asn 


Asp 


Glu 


Leu 


Val 


Asn 


Ser 


Arg 


Cys 


He 


Leu 


Pro 


His 










565 










570 










575 




Ser 


Asn 


Val 


Leu 


Asn 


Ser 


Pro 


He 


Ser 


Leu 


Arg 


Asn 


He 


Asp 


Pro 


Glu 








580 










585 










590 






Ser 


Val 


Pro 


Asp 


Thr 


Arg 


Phe 


His 


Phe 


Leu 


Leu 


Met 


Met 


Trp 


Gin 


Arg 






595 










600 










605 




Pro 


Asn 


He 


Asp 


Glu 


Pro 


Asn 


Leu 


Ser 


Ala 


Leu 


Thr 


Thr 


Ser 


Gin 


Leu 




610 










615 










620 










Glu 


Leu 


Leu 


Leu 


Ser 


Lys 


Asn 


Gin 


Lys 


Trp 


Asp 


Lys 


Leu 


Thr 


Thr 


Arg 


625 










630 










635 










640 
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Ala Phe Phe Asn 
lie Lys 
Ser Ser 



Glu Cys 
690 
Ser Met 
705 

Asn Arg 

Phe Glu 

Ser Leu 

Asp Glu 
770 



Asn Val 
660 
Ser Ser 
675 

Thr Ala 



Gin His 

Gly Val 

Leu Leu 
740 
He Ala 
755 

Leu Leu 



He 
645 
Ser 

Ser 

Lys 

Val 

Glu 
725 
Tyr 

Ser 

He 



Asp Arg He Asn 
Gly 
Ala 
Gin 



Gin 
710 
Asn 

Lys 

Ser 

Arg 



Ser Gly Phe 
665 

Pro Asn Phe 
680 

Leu Gin Ser 
695 

Lys Glu Trp 



Tyr Asp Gin 

Leu He He 
745 

Glu Phe Leu 
760 

Ala Asn Ala 
775 



Phe Gin 
650 

Leu Asp 

Phe Gin 

He Arg 

Ser Ser 
715 
Ala Gin 
730 

Glu Glu 
Ser Asn 
Ser 



Met Ala 

Gly Ser 

He Phe 
685 
Lys Phe 
700 

Ala Val 

Phe Ser 

Asp Met 

Tyr Val 
765 



Asp Ala He 

655 
Lys Thr Ala 
670 

Ser Gly Ala 

He Gly Glu 

Asn Asn Gly 
720 

Glu Glu Leu 

735 

Arg Pro Ser 
750 

Asn Ala Met 



<210> 60 
<211> 18234 
<212> DNA 
<213> SHRIMP 



<400> 60 

atggaccagt 

caacaacaac 

tacagtaaca 

aggtcaatcg 

cgtgtccaat 

ggtagttc tg 

cccaactaca 

tcaactagtc 

agtaatagca 

tgt t t ttcca 

aacgccgcca 

gcagcatgga 

aagtacag tc 

cagtcacaat 

gata tcctaa 

gatactgctc 

gcagccgcag 

aatagcaggc 

tcaaaata tt 

gctaaga tag 

agaagagcag 

acttccatta 

gagtatgaat 

ggaagtttat 

gctatcgaaa 

ccttcctctc 

cacgtttcag 

ttgag ta tea 

aggga taggc 

tccacccctt 

tctggt tgcc 

attagtggtc 

tcggctgcag 

tcacccatga 

atcaagaagg 



acccagaagt 
aagcagcaac 
ctgtttctgc 
cagcttcgtg 
tccatccact 
ttgtgttttt 
ttgcagtgcc 
tataccaatg 
aacatagtca 
aacaaaaaaa 
aggcattttc 
acgcgggaag 
ccgatagaaa 
caggtacttc 
aacgtcacaa 
tttctagggt 
attcagatca 
gtgagggtat 
ctgtcgcaga 
ggacaaagat 
atacttcagt 
aaaataccgg 
tttctctcca 
tgagctgtgc 
ggattatacg 
tttcagatca 
atcctaagga 
ttactagatc 
ttcttgttaa 
ctcaaagaag 
tacctatcat 
ttgtgggtga 
ctattgccgc 
tgaaaaaatt 
ctatgcgtag 



gagggatact 
aacaacagct 
agaaacttta 
ctggatcaat 
tagttccacg 
gaagcctaga 
tactcttcgt 
ttcaatgttc 
attcggtgat 
tgtagaaaat 
tcatgcagtt 
tgctgcaaac 
atacaaagca 
ttcttcctca 
gggaacgtcc 
ttttaccgag 
cctatccgcg 
tctagattca 
gtacttgatg 
tgcaacagat 
cgatgatctc 
cgatattaat 
catcacacag 
tttcggtgta 
taaaacagat 
atatctcttg 
tatcgttttc 
taaaaatgat 
cgataccgtg 
agtccccacc 
caggggaccc 
ttggtatatc 
tggccaccag 
ctcaaagaag 
gaatgcagat 



cctcagacag 
gccgctgctg 
tccgccattt 
aaccttaacc 
accacatacg 
gccctcccca 
gctgcgtcag 
aattcgtgga 
cgtgtcatca 
cttttaaagg 
caacagaaat 
ctcgaaaaat 
ggaggtcttt 
gttgaacata 
cttgatttag 
tttaaggaac 
tcggacccta 
gtgccaaata 
gcagacagag 
tttgaagcat 
aaagaatcac 
tcggtcacaa 
cttttcgcgc 
cagttcccct 
cctgatactg 
ttggtaggaa 
ggtagacaag 
aagaatgaaa 
ttgagagacg 
gccgcaggtg 
caggtagtga 
tctttggggg 
agagctctcg 
ggaggaaagt 
cgttctgcta 



aacaggaaca 
ctgccgctgc 
ctgaagatgg 
ctgatgaaaa 
attcagaaaa 
ccgggggcac 
aaattatcga 
atct tattcc 
agagatcaat 
aattgcgcag 
cagcagtaaa 
tagtagatt t 
ttagtgcgtc 
ct tctaatga 
attcagcaac 
aggctagagc 
ttttctctat 
tcggtatgt t 
acgagt ctgc 
tgagaggaga 
tagcggat tc 
atattcccac 
aggcattt tt 
tctcagatga 
gaaaggt ttc 
atttccaggt 
ttacgcctaa 
cgtctactat 
ccacgcaaaa 
aaccaaaaaa 
ccagggaaag 
tctattatgc 
cttcagccga 
atacagagga 
ggattt tggc 



ggcggcggcg 
tcctacgcag 
aaaattggag 
aatggctcaa 
tgtgaaccct 
gtgtttagcc 
ttctattgca 
catttttatg 
gatccgtaat 
gagaaaggtt 
tactgccctt 
ttgcaaattg 
agcaacagct 
tttcttactc 
aaacacattt 
agccgtagat 
tg ttcgacac 
ggctccacga 
agatatcgct 
taataataag 
tattgaaaaa 
agatacagag 
ggaaactatg 
aggttttgcg 
agaaatggac 
ttctcccttc 
tacgcctatt 
tattaatttc 
cgtctcaact 
gccaatgctc 
tgatgatatg 
catgggttca 
gtcaattaat 
ggaaaagcga 
tt tgttgggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 
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caaactgatg cccagtatgg gtatgtcgaa 
tcgaatgccg ctattcgtgc aaaggcaaag 
gcagttagga aacaattgga cggaaaatgt 
gagagatatc ttagagactc atttttcagg 
atgtttgatc aaggttttga tatgggaaga 
gccaggaacg cttggcaaca gtacgcagaa 
cgcgtattca acattgaagg tcttttcagt 
ccagaacagg ggcgtaaaaa gactgtaggt 
gccaataaga tcattaacgg catcacagag 
ggaatctctg atattattgg ttcagttagc 
tctcgcgtca aagctcttaa aacattgtcc 
attccagtta gtcgagcagt aaagtgtgcc 
tgtgtggaca ttccttcagt cattattgca 
caactttgcg gaggagggat gaacctcgct 
gcaggtaaag aacatacagg aaaggaatcc 
cttttgcgtt atatttggtt caacgcagcg 
atgcccctca acacaatgtc cgagggcact 
atcagaggca tggttaataa ttacaacagt 
tccgataggt acaattgcgg tagtggaaat 
aagactcaag gtgaattgtt aactgtcctt 
accaacaagg gaggcgtggg tgcaaccccc 
ccaattgcta atgcggatgt agttaagaac 
atcactgaga ccatcaactt cttttcattt 
attgaagagt atcttaggag atataggcta 
aattttgtgt atccaaatat tgcagctatt 
gcattgtcca gtaatctcga tactgatcgt 
cctttgatcg taaaggctag caagggttat 
aaaacaacaa gaacagcagc agaacaagct 
gccaatacta ttcctcaatt agtaaacagg 
actgccatca atgtcgttaa agctttcaca 
cacttggggg ttatgggtaa cgcgattaat 
aacgttgcga acaagcgttt aacagttaac 
gggttaaccg tatctctcat tcttgctcac 
ctcgtgcaag atttcgctaa gcttttactg 
tctaggtccc agaagagttt cttccccatt 
ttcaaaattg atagggaaat gttcgataat 
agacagctat ctaagaatgc taccgccgcc 
gctaggattg ccaagtcagg tgaaatttat 
cccggaactt cttcctccgc tttaaccttg 
gtatggtcta tgggagctct cccccatttc 
atttctcacg atcaaatgtt ccgcctatct 
cttaacagcg attgcaaacc agaagaatgg 
aaattctttg gcctttcttc ggtgagcgat 
actcttttgg cttcacctgc agagatttgc 
agtaacgata ttgtgcataa tattggatcg 
cttcaagttg gtgcctcagc agtagaaaaa 
actgacgtat attcccttgt ttctgctctg 
tcatctttgt cgtctgaggg acatctcacc 
cccgctcttc tcggtaccgc taaaactaca 
gctcctcttt cagccgtgtt ggactttaga 
tacgaagttg cagctgtttg tagtgtcatg 
agaaagatta tgggaatggt ggaacaagaa 
cgcattgcta gtcttgttag tacagttgct 
acagtaaacg attacaaaaa ttatctcatt 
tcaagattgg gaggaatatc ccctactagt 
tatgatggtg ttgtttcttc ctcatcatca 
agattctggt cgggagtatt ttctcagtgc 
gctggtcatg gaggtagtaa catgttccaa 
gtcaacacct acgcagctct gagctctggc 
gcgactcagg aaagaaagaa taggattgca 
accgatgtgg tggggggaga tactttggat 



cataattcta ccctagattc gttctggtct 2160 
gaggatgctc ttagccgtgc agaaatcttg 2220 
tcctcttcaa gggatgaata ttccatggta 2280 
tcagttaata ggtcaggagg aggatatgaa 2340 
tttgccgact ttttgagtga caactctgct 2400 
gtaatgagag gactttctaa gcatgagaaa 2460 
gctctaaatt ctttcaagtt cccccttgtt 2520 
ggaaggcata ggcttaacaa tttgaaggcg 2580 
atgactctcc agtcagccat cgatggtact 2640 
gatgggtggg gaaatactac agctcagccg 2700 
aatttcagcg gaaacggaaa tgttgtgtca 2760 
gcgggaagtc gaggtgggga aactctcaag 2820 
aacctaatct ctgataagag aattctagat 2880 
cacgaaatca caaactttat cgagacgatt 2940 
gttttcttgt ctcctagatt gtctgtcatt 3000 
gttgtttccc ttacggatag caacattaaa 3060 
ggcgatgaca tttataggga ctatttggcc 3120 
agtctctcct ctatttcagt caaggctatt 3180 
acttcaacca gtaataagaa tgtgaccatt 3240 
caacagactg ctaatgcctt gtccgccttt 3300 
gatgctgcca atatggccaa cgttatttcc 3360 
accaacgtgg ttgtttcagg gctagatagg 3420 
ttgtctcaga tcaaaacaat gaacgagaac 3480 
ggagaaggac tagataagaa agaattggat 3540 
gttaagcgag aattgggggt aagtggttcc 3600 
ccaattacta tcgacctgaa cactgaacag 3660 
gcctctaacc gctacgctaa attattcaac 3720 
cagatggagc agtataatgc acaaatggct 3780 
ttgaccatcc ctggatccat cacggcagac 3840 
gaaaatggag aatttagtaa cgcagaaaca 3900 
gaaatgcaac ctcttttcac ggacggattc 3960 
gtgggttcag ttagtaagct gattcagaat 4020 
tcaaaggcta gcccctatgt ctttaagcct 4080 
gcagtcactg cagagacttc tctggttgtc 4140 
cctccttcag tattttcttc aggtggtctt 4200 
atgaagacag attatgtagt ggaagtaatt 4260 
atcgaaaggt gcaatgattc cgattcagct 4320 
aacaaggatg ttgcatcaac cactgcagct 4380 
ttcgccaata atctccagaa ccctgcaaag 4440 
gatatggccg tggtaccaaa acttcatggc 4500 
acatattatc agggtattca taagatggaa 4560 
gataattctc ttcctggaaa tagggctagc 4 620 
aacaaccgtt cattcaattt ggcattggat 4680 
gatctggtga cgagggaaat ggtaaagacc 4740 
aattccaaca cggacgcgct tcaaaagagc 4800 
tacgacgagt ctactctttc tactaaagaa 4860 
gctaagagca aatctcctct atcttcttcc 4920 
tctaaggaga ttgataggac atggaacacc 4 980 
tcctattctg tttctgaaga cgctctcaat 5040 
aggaatgttg tggatgctac taaatctctg 5100 
agtaaagagg aggatgtgcg ttcttcgagt 5160 
tcgcccgtta tgcaagacat tggcattgac 5220 
acccccaaac agcatcgcag attcttacag 5280 
agaaaagttg catcgaatcc ccttctctct 5340 
ggtaacaccg attacaacct taaagctgta 5400 
atgaccccct cgtccatgtc tgtctctgac 54 60 
ctagagactg gcccttcaat gtttgccgat 5520 
atcactgcac ctaaacttta cggttctaga 558 0 
gttgagcggc tgagagactc tatttcttct 5640 
aagagcatcg aagctctgga aacgttcgta 5700 
caattgcgta aggcccagaa catgtacaac 5760 
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aaactgtcag 
aaaatcatga 
cttagctctc 
actcaatttg 
aatatccacg 
tataatgtgt 
ctcggcggaa 
ttctcccagg 
ttggttaaac 
tcttcaatta 
tcactttatt 
actattaaca 
aggacttcct 
gaggatgtac 
agggcagctg 
atggaatctc 
gaaatgatgg 
gatgatattg 
tcagatgaat 
tctccttatt 
aagcctctta 
ctctatgaag 
agagatcttc 
gaacgttcaa 
ggtttcctat 
cgaggcaaga 
ctcctcttct 
ttcagcaagg 
agagttgtac 
atcaccccag 
aagagggaaa 
ttgcccgacg 
aggtctctta 
atctttgacg 
ttgggtgata 
ctaatgaaag 
aaagacaccc 
gccggctatc 
gttgatcgtc 
attgtttttg 
atgacctccg 
tctcagcgtc 
cacactgaag 
tctgattaca 
aattgtagcc 
acatcacgta 
tccattgttg 
gaggtaggaa 
gtcgattttt 
tttgttccca 
gtcaaggatg 
ggaaaattca 
aacctaactg 
acaccaacag 
ctcaaaacac 
gcaattgtag 
tccaagtcaa 
tccacatctg 
aatctgagaa 
catattagta 
ttaacccccg 



atattacttc 
agaatgtgac 
tcttgcacga 
ctctggcgag 
agaaggaaac 
cggcagctgc 
ccatgctcca 
tttctgataa 
tggtaggatt 
gacgtgtaca 
cagcatacag 
aacttttcgg 
tggtacatga 
ttgaggctcc 
cttctaggcg 
tttacgggga 
atatagagta 
cctttattga 
attccgattc 
caactacatc 
ctgccatcta 
aagaacaaga 
atgagagtga 
tgtcagaaac 
ctcattctaa 
aattcaggcc 
ctgacgaatc 
ggagaaaatg 
gtgcttttgt 
tgactagtga 
gggcgcgtct 
aatacgtaga 
tcaaggcagc 
ctctcaccac 
cacttttgtt 
acctaactga 
aaatgatggc 
taggtgtccc 
ttttggctac 
cacgttcatt 
ctttgacaga 
taaaagtaaa 
cagtcatggg 
cacagcgcag 
aattgccacg 
aatttgcagg 
atgctctttc 
agtggcagga 
gttcaaagaa 
atggaactaa 
acgcttcaat 
ttacagcctc 
aaaaactata 
aaatggccac 
cattcggggg 
ttagagccaa 
ggggaatcaa 
aatttgaaag 
ccattgagaa 
atgatggact 
tgtatgcgga 



caactctatc 
gagcaagaaa 
actcgctggc 
ccatgttatc 
attcagtcaa 
catgtgtcag 
acaaggcctg 
tattaaaatg 
ttgcggtaca 
gaacgaagag 
agatttgagg 
acatcaaaac 
cgctgtttcc 
ttccacggtt 
agtgaagagg 
agaagtcttg 
tgg tgaggga 
ttccgaagaa 
atccgatgag 
ttattcgtat 
cgggtgcagg 
aaggaggaga 
tgatgacgac 
tat tgcaacc 
tattcttaac 
ttctgcgtac 
taccact tct 
caagcgccga 
gcccaccaga 
aaatacagta 
gatcgaagaa 
gggtagagca 
tgcttatgtt 
aaca tcaaac 
cgccaaacaa 
aatctctcct 
cga tgccaaa 
tctcagaact 
caaaaataag 
caacga tact 
catgtacagc 
gagtactctt 
tctcgtatac 
agagatgcaa 
aaaggatatt 
ttatggagga 
t tgcccctct 
tatgggagag 
caatatttct 
catggctgat 
gatcaggctt 
agccatgggt 
cgactcgttc 
tgctat tatc 
agatattgcc 
ggaaatgcgt 
ctcgttctca 
gatactagaa 
tagactcgcc 
ttccgagaca 
tgacaccagc 



tatagtgatt 
atgaccgcta 
ctggtacaca 
aaggcaaagt 
ttgatggccg 
cgtctagttg 
ttcgtttcat 
aacgaattga 
gtttcagatg 
gataagaaat 
aagaagactg 
tttatgtctt 
ggccctaggc 
cacaaatcgt 
gctggactca 
aacgatatga 
ggattcatga 
gagtctgaat 
tatgattttg 
gatgctctag 
ggagaaggtg 
cgtcgctcat 
gatgactact 
agaagagctg 
cgtccggcta 
gatagattct 
tcttcctctt 
acaagcgagg 
gtaacaatga 
ggattctatg 
tacaaaattg 
tctaaacaag 
gcccgcaccc 
gccactctag 
ctagaggcaa 
tcacttttca 
cagatcgttt 
cttgcttcat 
aaccatctcg 
actttccatg 
gctttcacca 
ttagattcta 
cctacagcgt 
tcactcgctc 
ggagacactg 
gaaaggggag 
gacaatcggc 
gaaatcttct 
ctggaaaatg 
attggcatga 
aggcgcgcag 
aatttgtacg 
gttctgctcc 
aaccgtatga 
acctataaga 
aactctatta 
tcccgcagtg 
acatctgctg 
gaacactaca 
cgcgcagtcg 
gagagaggag 



tcggaaacat 
gacaacaatc 
aacaacaacc 
atgtcactaa 
tggccggtgt 
cttccgacgt 
tccttcttaa 
acgatgaaac 
cgctaggatc 
tagacaggag 
aactatacag 
acgaatcttc 
caagaaggta 
tcatggtttc 
gggctctggc 
ggtcttcggc 
tgatgattag 
catctactga 
atgatgataa 
accgtctgaa 
aagacgatga 
caaagatggg 
ttgatgacga 
gccgtattca 
aagcacgcgc 
ttatggagga 
ccgattctcc 
accaatgtgc 
tcaatggtcg 
aaaattacca 
ttaagggtgc 
tgtctcccag 
aagaaagtaa 
ttaatgaccc 
ttaccgagag 
catcattcgg 
caggaggaaa 
gtattaaggg 
aatggatgac 
cactcgaaga 
acctcgtcgg 
ttttcaacac 
tcatcaacca 
ttaacattct 
ctggcctgtt 
gtttgtctct 
tcaagggagc 
acaagaggag 
ccgtaggtcc 
ccgatatcat 
aagagggcgc 
gaggtattga 
aagattcaga 
agtcgaggaa 
acttcccatc 
gcactatcgt 
gttctacttt 
ttctttcaaa 
acaaactaaa 
ttgccgtaat 
catctgttag 



tgactgcgct 
agatactatt 
tcaattggct 
tgacctcaat 
tgccgattac 
aacaatgttc 
caacgtactt 
aaagtctctt 
taggcacgtg 
ttttgttaca 
ggaaactgat 
catgctcaag 
cagcaccctt 
ttacccagag 
tgataacagg 
ggtctcttcc 
tgatgatgag 
tttctcctca 
taatggccag 
ttctgccgct 
ggaaaatgac 
gaagatcctt 
atttgatggc 
atatggtcca 
tttcttgaca 
tgacgattcc 
attctcctcc 
ctttgttaag 
agttagcatg 
gaaggccaac 
ttcagctacc 
ggaactgagg 
cttgaatatt 
atctactctt 
gaggaatagg 
tgatgcaagt 
tttcaagtct 
cactaataca 
cacagccgct 
tacactaaaa 
atcggaacat 
taggatggct 
tgaaatgccc 
taggggagtt 
gacctttatt 
atacagaatg 
agtctctcta 
caacgatctg 
tattgctagg 
ttctagaaca 
tggcgcagca 
taccgttgtg 
ctcgttcaat 
acataaggct 
ctcctctgaa 
gatggacatt 
ggccaagatt 
tacaaaggcc 
acaattcagc 
tgctgaatct 
tgaactattg 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
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acagacaata 
aaacgtcacg 
ctgctcgttg 
aaattgactg 
ataaagcaca 
atcaaggaca 
agaaatgcaa 
gacatttttg 
ggcgaagcac 
tctaacacaa 
ttctcaatgg 
caaaacaaac 
tcttcttcat 
caaacacaag 
gcgtctcata 
gttgttgcta 
ggaattctta 
ccatcatcat 
aaccagtacg 
aaggaagcag 
gaaatggtca 
caagctaaat 
gaggtttaca 
atggtttctt 
tcatcctctt 
attgatgaaa 
gctgccgttc 
gctattgaca 
tcagacacta 
gacaaagaac 
gacaagataa 
aacaacaaaa 
gatgtagcaa 
ctcttccaga 
gacatgagca 
cagattctcg 
ttttcaaagg 
tcttcagttg 
ataatgatgg 
aatactgtta 
aggctcacag 
aacgacgaga 
ttgatcaagg 
cagcagcttc 
agggttgttt 
cgtatttctt 
cccactagcg 
gactcttctt 
cctggactct 
tctcctcatt 
aacgctgcca 
gagtccaaca 
atcacccctg 
gttatccttc 
tccatggtta 
atcgtggaga 
gaccagaggc 
tcttcatctc 
ggcttcccag 
actatggaag 
caaacttata 



ctctcctcaa 
tgaccgccgc 
cctcagccga 
aaggaaatag 
aagaaacagg 
atgatcagcg 
ccatccgttc 
cacttgatga 
tcgcagagaa 
catcctcctc 
actattcaaa 
aatctaatgt 
ctgccaatat 
aatctaacga 
atcaagctgc 
taccaaatgc 
cccgcttctc 
catcatcatc 
ccaactcaat 
aaaggaaaat 
ccgtgtctaa 
cccttaagaa 
ccccagattc 
cttcctcggg 
cctctaatgt 
atactgaaga 
gcatccctct 
agttgaacga 
aacaggcttc 
aggtgatttc 
agatggctat 
ctaggagctt 
acttcaagga 
gcctgaaagc 
atggaagaga 
agggaggaat 
ctttggaggc 
tggagaggag 
agcgcaccga 
ataatgtaga 
agattaggaa 
ttgtagagtc 
tagatagagt 
ccaaattgac 
acgatccttc 
ctgtctataa 
atgccgactt 
ccacttcctc 
gccagcaatg 
cattcaattt 
cttcatccag 
atggttacgt 
ttaatacact 
ctacttctga 
acaggacaac 
ctcttgctcg 
gacacttcaa 
aattgaacac 
tgtttacagg 
gagctcgcaa 
tcgatatggg 



atttattgtt 
aattgaaggt 
catcaaccgt 
taactttgta 
tatctggctg 
tagagtagct 
tcgtctacag 
tgccgatatt 
ggcatctcct 
ttcttctccc 
caaccttgcc 
caattcttct 
tgataatgtt 
tttgagtaac 
aactgttggc 
aaacaaggct 
agcccctgaa 
aaatggagat 
cttatcatct 
cgatctggcc 
gtgcccctct 
aagtgccctc 
atctattgca 
atcaggatct 
gacagacta t 
aggggcagaa 
tctagtatca 
atactaccaa 
ctcatgggcc 
aaagattaat 
tagtgttttg 
tattgaaact 
gattatcgat 
attcgctgct 
cacaaaatca 
caaactcatc 
tttccctaga 
acagcgagaa 
gagtctgagg 
agaaactgtt 
tcaagcacaa 
tcctctcacc 
aggaagtatc 
agctacagaa 
atacacctgc 
ttcaaagaac 
gcaactgatg 
caagaaaatg 
cgcaatgatg 
cgagaacaaa 
tgacggtcct 
caaagatttt 
tctgggaggt 
gttgtttaac 
cgacaagaat 
cacgtctccc 
ctccatcacc 
tacttgtgat 
aagaaagcag 
ggacaagaac 
cacaaaattc 



caaaatgaac 
tcatcccaac 
atgtccgccc 
ccaatgacta 
aagaccgatg 
aaaaccatcc 
tctctttgct 
aagaatatgg 
tctagctcgg 
agttcttccc 
aaaactatcc 
gacgcatcat 
gagcacaaaa 
gtactttctg 
atcttcaacg 
aataa taatg 
aatgtttcct 
gataataagg 
attgaaaacg 
atccaggcag 
gctaaccaga 
gaattattgg 
gccgt ttctc 
gctccctctt 
ttcaactatg 
actgtccaga 
tatgctccat 
ctgattgatg 
atcaaggaaa 
aacttgcaac 
gacaacaaaa 
acaaagagcc 
tacgaaaaca 
gataactcgg 
gacagtaaat 
aatgagggac 
caa tccaacg 
cgtct tcagg 
aagaggt tgg 
aa tagtggta 
a tcgctgaaa 
ctctctttgg 
caacagcagc 
cagagaaagg 
ttcctgcaac 
aagggtcctc 
accatcactg 
ctgtacgaaa 
atcaccaacg 
agatccctga 
gccgtgagac 
ggattcactc 
actttcagtg 
tgcccaggag 
gtggctgacg 
aacgccgagc 
gacgccatca 
caaaatctgg 
ggcgaaagaa 
agtggcatcc 
atggttgctc 



tgaaaaacat 
tgcacgaaaa 
aaaataacct 
acgaccaagg 
aagaaaataa 
tcgcaattgt 
ttggaaaata 
acaaactcat 
ccatttcttc 
catcatcatc 
cctacatgcc 
cctcatcacc 
aagtggctct 
ttaccaccaa 
gaaggcaaca 
ccaccgtttc 
ccaccagcat 
taccagtaac 
catcagaatt 
cttccaccac 
ctgccatcac 
aaagagttat 
ttcccgttaa 
catcatcctc 
cttacggaaa 
aaaacatggt 
tcagcgaaat 
ccatcaaaac 
cggacaagga 
aaaacttttc 
ggaacgaatt 
gtatcgaggc 
catctgaaaa 
ggacagttta 
ttgtcgacat 
aaaatactgt 
gtgcttcaga 
ctgtcgagat 
cagattcggc 
tggttaacat 
gcactgcact 
gagcacgagt 
aacagcaaca 
aacaacaata 
ctcttcacga 
tcagtaacac 
acttgtctag 
atgttccctc 
tccacgaagc 
agcagctgac 
acgatgtact 
accgccaaaa 
gaaatgttgc 
ttgaaaatga 
cacccaagtc 
acctttactt 
tttctggtat 
taaacattga 
ggattgtgca 
cttcatgtac 
caggctctct 



tgaagaggca 
aatgttgagc 
cgaatgtaag 
tggtacattc 
caccagttct 
agaggacaat 
tgccatgaac 
tgaaaaacta 
ttcctcatca 
atcttcctct 
tatcgtcttc 
atcatcatct 
ccaacaactt 
gcacagattt 
cgcagagaca 
cgcaggccaa 
gcaattgcct 
tgtcaggctt 
taaggacttg 
agaaacaaag 
tgccatctct 
caaggcagtc 
tggagattct 
ctcctcatcc 
attgaagaac 
cgaacaagat 
gatgagacgt 
aaagatcgtg 
gcttgatatg 
aaacgaatca 
agagcttcag 
tggtggagga 
tgacaacaat 
cacccccact 
gtacaacaaa 
aaaggtagac 
gcctgtatct 
gtttatggca 
tgctcagtgg 
caagagtgaa 
aaactccatc 
cgaccagctc 
gcagcaacag 
cgctgcagat 
gacaattaaa 
acgtggtgtt 
gtctgtactc 
atcaattgtt 
cactcatact 
agaaatgttg 
aacaatgtta 
ggttgcgtgt 
acctaatact 
caaatttaga 
atctgcaagc 
ccccttcaag 
gagcggcgaa 
tcaaactact 
cactgaaaac 
aaaggaccgt 
tctgaatgct 



9480 

9540 

9600 

9660 

9720 

9780 

9840 

9900 

9960 

10020 

10080 

10140 

10200 

10260 

10320 

10380 

10440 

10500 

10560 

10620 

10680 

10740 

10800 

10860 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 
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aacaaggaag 
ggcactgatg 
gccgcctcct 
ggatctgtat 
ttgttgagca 
tgttcctcta 
cagcgcctgt 
atggatttga 
aaaatgacat 
tctaacgttc 
aggaataggc 
gtacaggcca 
cgccagaatg 
agtggccgtc 
tcccgcatca 
tcttccttct 
ggaaatgttg 
aacaagagat 
gccatccagc 
gacttgtgcg 
ttgaagaggt 
accaacaatg 
atgaacaaca 
ctcgttaggc 
aggagcatgc 
cctcttctca 
gacagactac 
ccagccttcc 
ca tacaaagt 
ttcaagaaca 
tcccacaagc 
aacgagcatc 
agagccacac 
tccaagatcg 
gaaatgagaa 
aactctggca 
agaagtgtac 
ggaactgtcg 
tctgaactga 
gctcaaactg 
a tcgtgaacg 
accaaggaaa 
ggagtagaat 
atctcagaag 
ccggaaaatg 
tcaaagggag 
tctacctatt 
gccatctcta 
aatatttctt 
gctatcaaga 
acacccgcat 
ct tctttcaa 
tcagttgatc 
gacacattca 
acagctctct 
c t taccacct 
gtgggtgcag 
tctaccgtca 
tctgttacac 
gttaacaaca 
gtatccatgc 



aaactctccg 
ttcatgtggc 
cattccctga 
ctgccatctc 
ccaacacgtc 
attacttggg 
tccctaacgg 
tgggaagtac 
ctactaacaa 
ctgctatcaa 
tttctcccct 
tgaccgatac 
gcagaaatac 
catcactctc 
atagagacgc 
ttaactcttt 
cagcagcgga 
tcgagatgct 
gcaatacaat 
aaattgaaaa 
ctatgctgat 
ttcccctttc 
acagtgccaa 
acatgatgtt 
tgacccaacc 
ctgaaatctg 
taacttcagc 
ctgccgccct 
catctttcat 
ttgaacgatt 
aaaattgtaa 
tggtatcctc 
cctatcaggc 
attctagagt 
tgaactccaa 
tgacaacggc 
taggaatatt 
cttcagccat 
tccagaatgc 
tcgtgtttgc 
aactggccgg 
tgattgatag 
caatggatta 
ccatgaagaa 
taccattcgc 
atacattcga 
cttcctcttc 
agcaagtaac 
ctgactatac 
gagaattcag 
tagtcgaccc 
ccaaatcaaa 
ttagccatct 
tgtacagaaa 
tgcaacagga 
tccttaatca 
gaggaggaaa 
caacatcaaa 
aagatcctgt 
acaatactgg 
catctgaatc 



tctaaacagg 
aggcgcaaac 
cggagataag 
tgctcaaaaa 
tatccagaat 
gtctgctgaa 
agatgacgaa 
aaagcgctac 
aaaggggtca 
cactgccttt 
ttatgaagac 
tgctgtggat 
tcttctttct 
ttattcttcg 
tagcctcctg 
ccttcgttct 
aattgttctg 
tggaggagga 
gtcctctatt 
taaaattgag 
gactccagga 
tcttctcatg 
tgtaatggaa 
ggatagtgga 
aagagctctc 
cctctacaac 
ctatctagta 
cacctgcgct 
ggacaacatc 
tgagaaattc 
ctgccccttc 
tttcgcattc 
caacaagctt 
aacaggatct 
ctttaatgga 
aggagtcaac 
ggaatgtcat 
gccagccgtc 
tctgccaagg 
taatgttttg 
caccatcgct 
actaggctct 
tgaagatagc 
tgccgtctat 
ctcatgcgcc 
agaaaagaac 
taacactact 
tgatgctgaa 
taactgtcca 
cagaattgtt 
taaaggcgcg 
actagaaaac 
aaaacccatt 
tgtccaccct 
acaaatggac 
tcccaacact 
cccaatgggc 
ccccgtcaca 
tatggtagtg 
aattgatgtc 
atctggcctc 



ctttcagaca 
tctgcatgga 
gaatcggcta 
tctgccagtc 
ctggtcaagg 
tctacgttcg 
aacgttacaa 
aatgacgctt 
aattgtgaaa 
ggagcctttg 
agcaccaaat 
gctttgtccg 
ctccctactt 
gacatgaaat 
tcaatgggag 
tcttccatcc 
gggactattc 
aaaatggtcg 
ctccagatga 
actaggcaac 
ggcgtgggag 
tcacgtgtcg 
gctgtggata 
aagtcccccg 
accgcccgcg 
acccgcgaca 
aaacaagcta 
tctcacctca 
aaattgcaca 
ttgggaagat 
catctccacc 
gcccgcccag 
attagtgaca 
tccctcctta 
acatttgaac 
ctcgacgtta 
cgccagcacg 
ttccaggcaa 
aacagataca 
gaacaactta 
gaatctgtac 
gacgacctct 
gaaacaacat 
cacacactaa 
agcggccctc 
gccgaacaag 
cttcgtaagc 
ttcaaggata 
ccaaatacta 
tccttcttaa 
ttacacgaga 
tttttccaat 
aattgtagca 
attcttatta 
cccgatactg 
gcttcaatgg 
ttgtatcttt 
gacaccacag 
aaccccttca 
ttgaatgata 
gtcaccaata 



ttaacaacgt 
gaattggtga 
tgaaaaagat 
acattaacga 
aagctttccc 
ccactcaact 
ccgtttcaaa 
tcaacaacat 
atctactgaa 
aagaagcttc 
attcgtccaa 
ctgtttctac 
ctattacttc 
ctaacctcat 
acagccaagt 
ctgtcaccac 
tcgacaagac 
ccgggagtcc 
acgaaaatga 
tgagggatgc 
ccatttct tc 
atgcatccag 
gtttcaatac 
ttcccatggc 
ctctactgag 
ctcaaccaga 
aaagattcga 
tgctttcttc 
tgactgatac 
atggggacga 
acacttttac 
aagtctccat 
aacattacgt 
agaagg ttag 
catcaagact 
ttgtcaaacc 
tgtgcaccgc 
ccgatggaaa 
tccaaaagag 
tcgccgatct 
cagaaagcgt 
tcaaatctaa 
ccaacaatgg 
tttccggcaa 
tcgcctttga 
gtgcagcagc 
att tggctcg 
tcctcaacga 
accaaaatgc 
ccattcttcg 
aagtagccat 
acggtctcag 
acaatgtcaa 
tggccctccc 
ccattgaaag 
cgaacggtgc 
cttcccacat 
aaaacgtcaa 
aggattctgc 
agtcgtgcaa 
ctggatgctc 



gagacattat 
ggtggtgaga 
gcttcttcta 
tcctactgct 
agaccctgtt 
cgcctaccgc 
tatctgccct 
ctttggctct 
atctgccatg 
atcttctgtc 
ccaacttgct 
tgttgtcggt 
tatcgcaacc 
caagacaatt 
agctgcaggt 
cagccaggat 
tgtggagatc 
tgaagctcgt 
actcgctcgt 
tttccaggat 
tggagcaagt 
cggtcttctg 
tactcctttg 
caaggaaatt 
cgaatcttcc 

^^gggcagtc 

cggagttgac 
catggattcc 
tcaatgcttc 
atacgccatg 
tccctcagat 
ggaagaaatt 
gatgaacatg 
cgaatggact 
cgccctctcc 
aaataatgca 
cgacgccaag 
cggtaacgaa 
cacaatgaac 
tggaaaggtt 
atatgaaaac 
taataatgga 
tcccgtcctc 
ggcagctcgc 
tttccttctg 
tgccgtatcc 
agttttcgaa 
tatcgaacgt 
ctttgctcta 
taagaacatt 
ctatttgacc 
taattcgtcc 
gaatattgaa 
agaaaatttc 
cagacgctcc 
aagagccgct 
tcttcacgag 
ctatcattcc 
taggttgatc 
ctacttgcaa 
ttcttcttct 



13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
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tcctcatctt 
ccccgtgtca 
tccagggcag 
attgctgccg 
tctcagaaca 
gtcgatccag 
ctatacactg 
aaaatgcacg 
gctaacggag 
atcgcaggca 
cgcctctcat 
caaacattca 
cccagaagcc 
gaggaatgtc 
ttgaccggtg 
atgactggtc 
ctgggaggtg 
aacgggaaaa 
gattcagact 
agggataaca 
ggggatttgg 
aacaatcgcg 
attctagaaa 
gtatctgctc 
ctgtcatcag 



cgtctgatac 
caccagccgt 
atattgtcct 
tctccaaatt 
acatcaacaa 
tcactgggga 
cagcaaatgg 
ccaaggcttt 
tgcctcttac 
tctctgaagg 
ccgacatgga 
agaaagcttc 
aagaaacaaa 
gtagggtgat 
gctacggagg 
t tattggtgc 
tttctgctgc 
cactaccaga 
ctattagcaa 
acaatttggg 
tttattctgc 
atcttaccaa 
a tgttgcaga 
aggaaatggc 
acgacaatgt 



cttcaagtac 
tctctgttct 
cgaaaacatg 
caaggggctg 
caacagcaac 
tattgttatc 
aggaacatca 
ccccaccttc 
atctgaggga 
atcaattggc 
acctgtcatg 
agataaactc 
cgctattatt 
ccaagacgag 
agatgctgct 
aatctccgcg 
taacgcagct 
acatggaatc 
cctatacaac 
aagaattatg 
tagacaactg 
gcgtcaagct 
taccatttac 
ttgtgcgtct 
tcaatccctg 



gtcaggagag 
gatgcttcct 
aacgtgagat 
accaaggaag 
aacaacaacg 
accaatgcca 
tcattcaaat 
tttattggta 
atttccctca 
acgggggctc 
aagggatgga 
actcatcttt 
aacaagatgc 
gctgctctgc 
ctggccatgg 
ccagttagag 
atccgcaagc 
gtacacaaat 
actgatcttc 
caatctttgg 
acggacctta 
atccttaaaa 
cttacaacag 
ctgacagtcg 
aaccgccttt 



acaatacgcc 
ctaatctct t 
ttggtttcat 
aggttattaa 
gaaatgggaa 
cattccccga 
ggggagatat 
acccaaccgc 
ctgaagaaaa 
tgcgtgcagc 
acaacattgt 
tgagatcggg 
acgacagctt 
tcgttgccac 
tttctccagt 
gtattagcca 
gcctcaacct 
cagccaagac 
aagacgttgt 
gacttaaggg 
ttactgtacc 
tgctcatttc 
gtaaaaatgc 
gaggaagtgg 
attttcgggt 



tgtgaatctt 
ggacgtgttc 
gcccgagatt 
gcaaatggtt 
gaaaacaacc 
cactcgtcct 
caacgacaga 
cgccgcaaca 
acgcaagaaa 
cgccaacacc 
tcagcttcaa 
aggaattcca 
caagacattg 
cagcgatctt 
acgtccagaa 
cttgttgaaa 
acctacatcc 
acttttgctt 
ctctaacgct 
gaataatgca 
agaatatgga 
taaccctgaa 
tctcgcaccg 
aggaggaaaa 
ctag 



16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18234 



<210> 61 
<211> 6025 
<212> PRT 
<213> SHRIMP 



<400> 61 



Met 


Asp 


Gin 


Tyr 


Pro 


Glu 


Val 


Arg 


Asp 


Thr 


Pro 


Gin 


Thr 


Glu 


Gin 


Glu 


1 








5 










10 










15 




Gin 


Ala 


Ala 


Ala 


Gin 


Gin 


Gin 


Gin 


Ala 


Ala 


Thr 


Thr 


Thr 


Ala 


Ala 


Ala 








20 










25 










30 






Ala 


Ala 


Ala 


Ala 


Ala 


Pro 


Thr 


Gin 


Tyr 


Ser 


Asn 


Thr 


Val 


Ser 


Ala 


Glu 






35 










40 










45 








Thr 


Leu 


Ser 


Ala 


lie 


Ser 


Glu 


Asp 


Gly 


Lys 


Leu 


Glu 


Arg 


Ser 


He 


Ala 




50 










55 










60 










Ala 


Ser 


Cys 


Trp 


He 


Asn 


Asn 


Leu 


Asn 


Pro 


Asp 


Glu 


Lys 


Met 


Ala 


Gin 


65 










70 










75 










80 


Arg 


Val 


Gin 


Phe 


His 


Pro 


Leu 


Ser 


Ser 


Thr 


Thr 


Thr 


Tyr 


Asp 


Ser 


Glu 










85 










90 










95 




Asn 


Val 


Asn 


Pro 


Gly 


Ser 


Ser 


Val 


Val 


Phe 


Leu 


Lys 


Pro 


Arg 


Ala 


Leu 








100 










105 










110 






Pro 


Thr 


Gly 


Gly 


Thr 


Cys 


Leu 


Ala 


Pro 


Asn 


Tyr 


He 


Ala 


Val 


Pro 


Thr 






115 










120 










125 








Leu 


Arg 


Ala 


Ala 


Ser 


Glu 


He 


He 


Asp 


Ser 


He 


Ala 


Ser 


Thr 


Ser 


Leu 




130 










135 










140 










Tyr 


Gin 


Cys 


Ser 


Met 


Phe 


Asn 


Ser 


Trp 


Asn 


Leu 


He 


Pro 


He 


Phe 


Met 


145 










150 










155 










160 


Ser 


Asn 


Ser 


Lys 


His 


Ser 


Gin 


Phe 


Gly 


Asp 


Arg 


Val 


He 


Lys 


Arg 


Ser 










165 










170 










175 




Met 


lie 


Arg 


Asn 


Cys 


Phe 


Ser 


Lys 


Gin 


Lys 


Asn 


Val 


Glu 


Asn 


Leu 


Leu 








180 










185 










190 






Lys 


Glu 


Leu 


Arg 


Arg 


Arg 


Lys 


Val 


Asn 


Ala 


Ala 


Lys 


Ala 


Phe 


Ser 


His 






195 










200 










205 








Ala 


Val 


Gin 


Gin 


Lys 


Ser 


Ala 


Val 


Asn 


Thr 


Ala 


Ala 


Trp 


Asn 


Ala 


Gly 




210 










215 










220 










Ser 


Ala 


Ala 


Asn 


Leu 


Glu 


Lys 


Leu 


Val 


Asp 


Phe 


Cys 


Lys 


Leu 


Lys 


Tyr 
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225 230 



Ser 


Pro 


Asp 


Arg 


Lys 
245 


Tyr 


Lys 


Ala 


Gly 


Gly 
250 


Thr 


Ala 


Gin 


Ser 
260 


Gin 


Ser 


Gly 


Thr 


Ser 
265 


Ser 


Ser 


Asn 


Asp 
275 


Phe 


Leu 


Leu 


Asp 


He 
280 


Leu 


Lys 


Leu 


Asp 
290 


Leu 


Asp 


Ser 


Ala 


Thr 
295 


Asn 


Thr 


Phe 


Val 


Phe 


Thr 


Glu 


Phe 


Lys 


Glu 


Gin 


Ala 


Arg 


305 










310 










Ala 


Asp 


Ser 


Asp 


His 
325 


Leu 


Ser 


Ala 


Ser 


Asp 
330 


Arg 


His 


Asn 


Ser 
340 


Arg 


Arg 


Glu 


Gly 


He 
345 


Leu 


Gly 


Met 


Leu 
355 


Ala 


Pro 


Arg 


Ser 


Lys 
360 


Tyr 


Ser 


Ala 


Asp 
370 


Arg 


Asp 


Glu 


Ser 


Ala 
375 


Asp 


He 


Ala 


He 


Ala 


Thr 


Asp 


Phe 


Glu 


Ala 


Leu 


Arg 


Gly 


385 










390 










Ala 


Asp 


Thr 


Ser 


Val 
405 


Asp 


Asp 


Leu 


Lys 


Glu 
410 


Glu 


Lys 


Thr 


Ser 
420 


He 


Lys 


Asn 


Thr 


Gly 
425 


Asp 


He 


Pro 


Thr 
435 


Asp 


Thr 


Glu 


Glu 


Tyr 
440 


Glu 


Phe 


Leu 


Phe 
450 


Ala 


Gin 


Ala 


Phe 


Leu 
455 


Glu 


Thr 


Met 


Ala 


Phe 


Gly 


Val 


Gin 


Phe 


Pro 


Phe 


Ser 


Asp 


4 65 










470 










Glu 


Arg 


He 


He 


Arg 
485 


Lys 


Thr 


Asp 


Pro 


Asp 
490 


Met 


Asp 


Pro 


Ser 
500 


Ser 


Leu 


Ser 


Asp 


Gin 
505 


Tyr 


Phe 


Gin 


Val 
515 


Ser 


Pro 


Phe 


His 


Val 

520 


Ser 


Asp 


Gly 


Arg 
530 


Gin 


Val 


Thr 


Pro 


Asn 
535 


Thr 


Pro 


He 


Ser 


Lys 


Asn 


Asp 


Lys 


Asn 


Glu 


Thr 


Ser 


Thr 


545 










550 










Arg 


Leu 


Leu 


Val 


Asn 
565 


Asp 


Thr 


Val 


Leu 


Arg 
570 


Ser 


Thr 


Ser 


Thr 
580 


Pro 


Ser 


Gin 


Arg 


Arg 
585 


Val 


Pro 


Lys 


Lys 
595 


Pro 


Met 


Leu 


Ser 


Gly 
600 


Cys 


Leu 


Gin 


Val 
610 


Val 


Thr 


Arg 


Glu 


Ser 
615 


Asp 


Asp 


Met 


Asp 


Trp 


Tyr 


He 


Ser 


Leu 


Gly 


Val 


Tyr 


Tyr 


625 










630 










Ala 


Ala 


He 


Ala 


Ala 
645 


Gly 


His 


Gin 


Arg 


Ala 
650 


Ser 


Pro 


Met 


Met 
660 


Lys 


Lys 


Phe 


Ser 


Lys 
665 


Lys 


Glu 


Glu 


Lys 
675 


Arg 


He 


Lys 


Lys 


Ala 
680 


Met 


Arg 


Ala 


Arg 
690 


He 


Leu 


Ala 


Leu 


Leu 
695 


Gly 


Gin 


Thr 


Val 


Glu 


His 


Asn 


Ser 


Thr 


Leu 


Asp 


Ser 


Phe 
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235 










240 


Leu 


Phe 


Ser 


Ala 


Ser 


Ala 










255 




Ser 


Ser 


Val 


Glu 


His 


Thr 








270 






Arg 


His 


Lys 


Gly 


Thr 


Ser 






285 








Asp 


Thr 


Ala 


Leu 


Ser 


Arg 




300 










Ala 


Ala 


Val 


Asp 


Ala 


Ala 


315 










320 


Pro 


He 


Phe 


Ser 


He 


Val 










335 




Asp 


Ser 


Val 


Pro 


Asn 


He 








350 






Val 


Ala 


Glu 


Tyr 


Leu 


Met 






365 








Ala 


Lys 


He 


Gly 


Thr 


Lys 




380 










Asp 


Asn 


Asn 


Lys 


Arg 


Arg 


395 










400 


Ser 


Leu 


Ala 


Asp 


Ser 


He 










415 




He 


Asn 


Ser 


Val 


Thr 


Asn 








430 






Ser 


Leu 


His 


He 


Thr 


Gin 






445 








Gly 


Ser 


Leu 


Leu 


Ser 


Cys 




460 










Glu 


Gly 


Phe 


Ala 


Ala 


He 


475 










480 


Thr 


Gly 


Lys 


Val 


Ser 


Glu 










495 




Leu 


Leu 


Leu 


Val 


Gly 


Asn 








510 






Pro 


Lys 


Asp 


He 


Val 


Phe 






525 








Leu 


Ser 


He 


He 


Thr 


Arg 




540 










He 


He 


Asn 


Phe 


Arg 


Asp 


555 










560 


Asp 


Ala 


Thr 


Gin 


Asn 


Val 










575 




Pro 


Thr 


Ala 


Ala 


Gly 


Glu 








590 






Pro 


He 


He 


Arg 


Gly 


Pro 






605 








He 


Ser 


Gly 


Leu 


Val 


Gly 




620 










Ala 


Met 


Gly 


Ser 


Ser 


Ala 


635 










640 


Ser 


Ala 


Glu 


Ser 


He 


Asn 










655 




Gly 


Gly 


Lys 


Tyr 


Thr 


Glu 








670 






Arg 


Asn 


Ala 


Asp 


Arg 


Ser 






685 








Asp 


Ala 


Gin 


Tyr 


Gly 


Tyr 




700 










Trp 


Ser 


Ser 


Asn 


Ala 


Ala 


715 










720 
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lie 


Arg 


Ala 


Lys 


Ala 


Lys 


Glu 


Asp 


Ala 


Leu 


Ser 


Arg 


Ala 


Glu 


He 


Leu 










725 










730 










735 




Ala 


Val 


Arg 


Lys 


Gin 


Leu 


Asp 


Gly 


Lys 


Cys 


Ser 


Ser 


Ser 


Arg 


Asp 


Glu 








740 










745 










750 






Tyr 


Ser 


Met 


Val 


Glu 


Arg 


Tyr 


Leu 


Arg 


Asp 


Ser 


Phe 


Phe 


Arg 


Ser 


Val 






755 










760 










765 








Asn 


Arg 


Ser 


Gly 


Gly 


Gly 


Tyr 


Glu 


Met 


Phe 


Asp 


Gin 


Gly 


Phe 


Asp 


Met 




770 










775 










780 










Gly 


Arg 


Phe 


Ala 


Asp 


Phe 


Leu 


Ser 


Asp 


Asn 


Ser 


Ala 


Ala 


Arg 


Asn 


Ala 


785 










790 










795 










800 


Trp 


Gin 


Gin 


Tyr 


Ala 


Glu 


Val 


Met 


Arg 


Gly 


Leu 


Ser 


Lys 


His 


Glu 


Lys 










805 










810 










815 




Arg 


Val 


Phe 


Asn 


He 


Glu 


Gly 


Leu 


Phe 


Ser 


Ala 


Leu 


Asn 


Ser 


Phe 


Lys 








820 










825 










830 






Phe 


Pro 


Leu 


Val 


Pro 


Glu 


Gin 


Gly 


Arg 


Lys 


Lys 


Thr 


Val 


Gly 


Gly 


Arg 






835 










840 










845 








His 


Arg 


Leu 


Asn 


Asn 


Leu 


Lys 


Ala 


Ala 


Asn 


Lys 


He 


He 


Asn 


Gly 


He 




850 










855 










860 










Thr 


Glu 


Met 


Thr 


Leu 


Gin 


Ser 


Ala 


He 


Asp 


Gly 


Thr 


Gly 


He 


Ser 


Asp 


865 










870 










875 










880 


lie 


He 


Gly 


Ser 


Val 


Ser 


Asp 


Gly 


Trp 


Gly 


Asn 


Thr 


Thr 


Ala 


Gin 


Pro 










885 










890 










895 




Ser 


Arg 


Val 


Lys 


Ala 


Leu 


Lys 


Thr 


Leu 


Ser 


Asn 


Phe 


Ser 


Gly 


Asn 


Gly 








900 










905 










910 






Asn 


Val 


Val 


Ser 


He 


Pro 


Val 


Ser 


Arg 


Ala 


Val 


Lys 


Cys 


Ala 


Ala 


Gly 






915 










920 










925 








Ser 


Arg 


Gly 


Gly 


Glu 


Thr 


Leu 


Lys 


Cys 


Val 


Asp 


He 


Pro 


Ser 


Val 


He 




930 










935 










940 










lie 


Ala 


Asn 


Leu 


He 


Ser 


Asp 


Lys 


Arg 


He 


Leu 


Asp 


Gin 


Leu 


Cys 


Gly 


945 










950 










955 










960 


Gly 


Gly 


Met 


Asn 


Leu 


Ala 


His 


Glu 


He 


Thr 


Asn 


Phe 


He 


Glu 


Thr 


He 










965 










970 










975 




Ala 


Gly 


Lys 


Glu 


His 


Thr 


Gly 


Lys 


Glu 


Ser 


Val 


Phe 


Leu 


Ser 


Pro 


Arg 








980 










985 










990 






Leu 


Ser 


Val 


He 


Leu 


Leu 


Arg 


Tyr 


He 


Trp 


Phe 


Asn 


Ala 


Ala 


Val 


Val 






995 










1000 








1005 






Ser 


Leu 


Thr 


Asp 


Ser 


Asn 


He 


Lys 


Met 


Pro 


Leu 


Asn 


Thr 


Met 


Ser 


Glu 




1010 








1015 








1020 








Gly 


Thr 


Gly 


Asp 


Asp 


He 


Tyr 


Arg 


Asp 


Tyr 


Leu 


Ala 


He 


Arg 


Gly 


Met 


1025 








1030 








1035 








1041 


Val 


Asn 


Asn 


Tyr 


Asn 


Ser 


Ser 


Leu 


Ser 


Ser 


He 


Ser 


Val 


Lys 


Ala 


He 










1045 








1050 








1055 


Ser 


Asp 


Arg 


Tyr 


Asn 


Cys 


Gly 


Ser 


Gly 


Asn 


Thr 


Ser 


Thr 


Ser 


Asn 


Lys 








1060 








1065 








1070 




Asn 


Val 


Thr 


He 


Lys 


Thr 


Gin 


Gly 


Glu 


Leu 


Leu 


Thr 


Val 


Leu 


Gin 


Gin 






1075 








1080 








1085 






Thr 


Ala 


Asn 


Ala 


Leu 


Ser 


Ala 


Phe 


Thr 


Asn 


Lys 


Gly 


Gly 


Val 


Gly 


Ala 




1090 








1095 








1100 








Thr 


Pro 


Asp 


Ala 


Ala 


Asn 


Met 


Ala 


Asn 


Val 


He 


Ser 


Pro 


He 


Ala 


Asn 


1105 








1110 








1115 








1121 


Ala 


Asp 


Val 


Val 


Lys 


Asn 


Thr 


Asn 


Val 


Val 


Val 


Ser 


Gly 


Leu 


Asp 


Arg 










1125 








1130 








1135 


He 


Thr 


Glu 


Thr 


He 


Asn 


Phe 


Phe 


Ser 


Phe 


Leu 


Ser 


Gin 


He 


Lys 


Thr 








1140 








1145 








1150 




Met 


Asn 


Glu 


Asn 


He 


Glu 


Glu 


Tyr 


Leu 


Arg 


Arg 


Tyr 


Arg 


Leu 


Gly 


Glu 






1155 








1160 








1165 






Gly 


Leu 


Asp 


Lys 


Lys 


Glu 


Leu 


Asp 


Asn 


Phe 


Val 


Tyr 


Pro 


Asn 


He 


Ala 




1170 








1175 








1180 








Ala 


He 


Val 


Lys 


Arg 


Glu 


Leu 


Gly 


Val 


Ser 


Gly 


Ser 


Ala 


Leu 


Ser 


Ser 


1185 








1190 








1195 








1201 


Asn 


Leu 


Asp 


Thr 


Asp 


Arg 


Pro 


He 


Thr 


He 


Asp 


Leu 


Asn 


Thr 


Glu 


Gin 
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1205 1210 1215 

Pro Leu lie Val Lys Ala Ser Lys Gly Tyr Asn Arg Tyr Ala Lys Leu 

1220 1225 1230 

Phe Asn Lys Thr Thr Arg Thr Ala Ala Glu Gin Ala Gin Met Glu Gin 

1235 1240 1245 

Tyr Asn Ala Gin Met Ala Ala Asn Thr lie Pro Gin Leu Val Asn Arg 

1250 1255 1260 

Leu Thr lie Pro Gly Ser lie Thr Ala Asp Thr Ala lie Asn Val Val 

1265 1270 1275 1280 

Lys Ala Phe Thr Glu Asn Gly Glu Phe Ser Asn Ala Glu Thr His Leu 

1285 1290 1295 

Gly Val Met Gly Asn Ala lie Asn Glu Met Gin Pro Leu Phe Thr Asp 

1300 1305 1310 

Gly Phe Asn Val Ala Asn Lys Arg Leu Thr Val Asn Val Gly Ser Val 

1315 1320 1325 

Ser Lys Leu lie Gin Asn Gly Leu Thr Val Ser Leu lie Leu Ala His 

1330 1335 1340 

Ser Lys Asp Tyr Val Phe Lys Pro Leu Val Gin Asp Phe Ala Lys Leu 

1345 1350 1355 1360 

Leu Leu Ala Val Thr Ala Glu Thr Ser Leu Val Val Ser Arg Ser Gin 

1365 1370 1375 

Lys Ser Phe Phe Pro lie Pro Pro Ser Val Phe Ser Ser Gly Gly Leu 

1380 1385 1390 

Phe Lys lie Asp Arg Glu Met Phe Asp Asn Met Lys Thr Asp Tyr Val 

1395 1400 1405 

Val Glu Val lie Arg Gin Leu Ser Lys Asn Ala Thr Ala Ala He Glu 

1410 1415 1420 

Arg Cys Asn Asp Ser Asp Ser Ala Ala Arg He Ala Lys Ser Gly Glu 

1425 1430 1435 1440 

He Tyr Asn Lys Asp Val Ala Ser Thr Thr Ala Ala Pro Gly Thr Ser 

1445 1450 1455 

Ser Ser Ala Leu Thr Leu Phe Ala Asn Asn Leu Gin Asn Pro Ala Lys 

1460 1465 1470 

Val Trp Ser Met Gly Ala Leu Pro His Phe Asp Met Ala Val Val Pro 

1475 1480 1485 

Lys Leu His Gly He Ser His Asp Gin Met Phe Arg Leu Ser Thr Tyr 

1490 1495 1500 

Tyr Gin Gly He His Lys Met Glu Leu Asn Ser Asp Cys Lys Pro Glu 

1505 1510 1515 1520 

Glu Trp Asp Asn Ser Leu Pro Gly Asn Arg Ala Ser Lys Phe Phe Gly 

1525 1530 1535 

Leu Ser Ser Val Ser Asp Asn Asn Arg Ser Phe Asn Leu Ala Leu Asp 

1540 1545 1550 

Thr Leu Leu Asp Ala Glu He Cys Asp Leu Val Thr Arg Glu Met Val 

1555 1560 1565 

Lys Thr Ser Asn Asp He Val His Asn He Gly Ser Asn Ser Asn Thr 

1570 1575 1580 

Asp Ala Leu Gin Lys Ser Leu Gin Val Gly Ala Ser Ala Val Glu Lys 

1585 1590 1595 1600 

Tyr Asp Glu Ser Thr Leu Ser Thr Lys Glu Thr Asp Val Tyr Ser Leu 

1605 1610 1615 

Val Ser Ala Lys Ser Lys Ser Pro Leu Ser Ser Ser Ser Ser Leu Ser 

1620 1625 1630 

Ser Glu Gly His Leu Thr Ser Lys Glu He Asp Arg Thr Trp Asn Thr 

1635 1640 1645 

Pro Ala Leu Leu Gly Thr Ala Lys Thr Thr Ser Tyr Ser Val Ser Glu 

1650 1655 1660 

Asp Ala Leu Asn Ala Pro Leu Ser Ala Val Leu Asp Phe Arg Arg Asn 

1665 1670 1675 1680 

Val Val Asp Ala Thr Lys Ser Leu Tyr Glu Val Ala Ala Val Cys Ser 
1685 1690 1695 
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Vai Met Ser Lys Glu Glu Asp Val Arg Ser Ser Ser Arg Lys lie Met 

1700 1705 1710 

Gly Met Val Glu Gin Glu Ser Pro Val Met Gin Asp lie Gly lie Asp 

1715 1720 1725 

Arg lie Ala Ser Leu Val Ser Thr Val Ala Thr Pro Lys Gin His Arg 

1730 1735 1740 

Arg Phe Leu Gin Thr Val Asn Asp Tyr Lys Asn Tyr Leu lie Arg Lys 
1745 1750 1755 1760 

Val Asn Pro Leu Leu Ser Ser Arg Leu Gly Gly lie Ser Pro Thr Ser 

1765 1770 1775 

Gly Asn Thr Asp Tyr Asn Leu Lys Ala Val Tyr Asp Gly Val Val Ser 

1780 1785 1790 

Ser Ser Ser Ser Met Thr Pro Ser Ser Met Ser Val Ser Asp Arg Phe 

1795 1800 1805 

Trp Ser Gly Val Phe Ser Gin Cys Leu Glu Thr Gly Pro Ser Met Phe 

1810 1815 1820 

Ala Asp Ala Gly His Gly Gly Ser Asn Met Phe Gin lie Thr Ala Pro 
1825 1830 1835 1840 

Lys Leu Tyr Gly Ser Arg Val Asn Thr Tyr Ala Ala Leu Ser Ser Gly 

1845 1850 1855 

Val Glu Arg Leu Arg Asp Ser lie Ser Ser Ala Thr Gin Glu Arg Lys 

1860 1865 1870 

Asn Arg lie Ala Lys Ser He Glu Ala Leu Glu Thr Phe Val Thr Asp 

1875 1880 1885 

Vai Val Gly Gly Asp Thr Leu Asp Gin Leu Arg Lys Ala Gin Asn Met 

1890 1895 1900 

Tyr Asn Lys Leu Ser Asp He Thr Ser Asn Ser He Tyr Ser Asp Phe 
1905 1910 1915 1920 

Gly Asn He Asp Cys Ala Lys He Met Lys Asn Val Thr Ser Lys Lys 

1925 1930 1935 

Met Thr Ala Arg Gin Gin Ser Asp Thr He Leu Ser Ser Leu Leu His 

1940 1945 1950 

Glu Leu Ala Gly Leu Val His Lys Gin Gin Pro Gin Leu Ala Thr Gin 

1955 1960 1965 

Phe Ala Ser His Val He Lys Ala Lys Tyr Val Thr Asn Asp Leu Asn 

1970 1975 1980 

Asn He His Glu Lys Glu Thr Phe Ser Gin Leu Met Ala Val Ala Gly 
1985 1990 1995 2000 

Val Ala Asp Tyr Tyr Asn Val Ser Ala Ala Ala Met Cys Gin Arg Leu 

2005 2010 2015 

Val Ala Ser Asp Val Thr Met Phe Leu Gly Gly Thr Met Leu Gin Gin 

2020 2025 2030 

Gly Leu Phe Val Ser Phe Leu Leu Asn Asn Val Leu Phe Ser Gin Val 

2035 2040 2045 

Ser Asp Asn He Lys Met Asn Glu Leu Asn Asp Glu Thr Lys Ser Leu 

2050 2055 2060 

Leu Val Lys Leu Val Gly Phe Cys Gly Thr Val Ser Asp Ala Leu Gly 
2065 2070 2075 2080 

Ser Arg His Val Ser Ser He Arg Arg Val Gin Asn Glu Glu Asp Lys 

2085 2090 2095 

Lys Leu Asp Arg Ser Phe Val Thr Ser Lys Ala Tyr Arg Asp Leu Arg 

2100 2105 2110 

Lys Lys Thr Glu Leu Tyr Arg Glu Thr Asp Thr He Asn Lys Leu Phe 

2115 2120 2125 

Gly His Gin Asn Phe Met Ser Tyr Glu Ser Ser Met Leu Lys Arg Thr 

2130 2135 2140 

Ser Leu Val His Asp Ala Val Ser Gly Pro Arg Pro Arg Arg Tyr Ser 
2145 2150 2155 2160 

Thr Leu Glu Asp Val Leu Glu Ala Pro Ser Thr Val His Lys Ser Phe 

2165 2170 2175 

Met Val Ser Tyr Pro Glu Arg Ala Ala Ala Ser Arg Arg Val Lys Arg 
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2180 2185 2190 

Ala Gly Leu Arg Ala Asp Asn Arg Met Glu Ser Leu Tyr Gly Glu Glu 

2195 2200 2205 

Val Leu Asn Asp Met Arg Ser Ser Ala Val Ser Ser Glu Met Met Asp 

2210 2215 2220 

lie Glu Tyr Gly Glu Gly Gly Phe Met Met Met lie Ser Asp Asp Glu 

2225 2230 2235 2240 

Asp Asp lie Ala Phe lie Asp Ser Glu Glu Glu Ser Glu Ser Ser Thr 

2245 2250 2255 

Asp Phe Ser Ser Ser Asp Glu Tyr Ser Asp Ser Ser Asp Glu Tyr Asp 

2260 2265 2270 

Phe Asp Asp Asp Asn Asn Gly Gin Ser Pro Tyr Ser Thr Thr Ser Tyr 

2275 2280 2285 

Ser Tyr Asp Ala Leu Asp Arg Leu Asn Ser Ala Ala Lys Pro Leu Thr 

2290 2295 2300 

Ala lie Tyr Gly Cys Arg Gly Glu Gly Glu Asp Asp Glu Glu Asn Asp 

2305 2310 2315 2320 

Leu Tyr Glu Glu Glu Gin Glu Arg Arg Arg Arg Arg Ser Ser Lys Met 

2325 2330 2335 

Gly Lys lie Leu Arg Asp Leu His Glu Ser Asp Asp Asp Asp Asp Asp 

2340 2345 2350 

Tyr Phe Asp Asp Glu Phe Asp Gly Glu Arg Ser Met Ser Glu Thr lie 

2355 2360 2365 

Ala Thr Arg Arg Ala Gly Arg lie Gin Tyr Gly Pro Gly Phe Leu Ser 

2370 2375 2380 

His Ser Asn He Leu Asn Arg Pro Ala Lys Ala Arg Ala Phe Leu Thr 

2385 2390 2395 2400 

Arg Gly Lys Lys Phe Arg Pro Ser Ala Tyr Asp Arg Phe Phe Met Glu 

2405 2410 2415 

Asp Asp Asp Ser Leu Leu Phe Ser Asp Glu Ser Thr Thr Ser Ser Ser 

2420 2425 2430 

Ser Ser Asp Ser Pro Phe Ser Ser Phe Ser Lys Gly Arg Lys Cys Lys 

2435 2440 2445 

Arg Arg Thr Ser Glu Asp Gin Cys Ala Phe Val Lys Arg Val Val Arg 

2450 2455 2460 

Ala Phe Val Pro Thr Arg Val Thr Met He Asn Gly Arg Val Ser Met 

2465 2470 2475 2480 

He Thr Pro Val Thr Ser Glu Asn Thr Val Gly Phe Tyr Glu Asn Tyr 

2485 2490 2495 

Gin Lys Ala Asn Lys Arg Glu Arg Ala Arg Leu He Glu Glu Tyr Lys 

2500 2505 2510 

He Val Lys Gly Ala Ser Ala Thr Leu Pro Asp Glu Tyr Val Glu Gly 

2515 2520 2525 

Arg Ala Ser Lys Gin Val Ser Pro Arg Glu Leu Arg Arg Ser Leu He 

2530 2535 2540 

Lys Ala Ala Ala Tyr Val Ala Arg Thr Gin Glu Ser Asn Leu Asn He 

2545 2550 2555 2560 

He Phe Asp Ala Leu Thr Thr Thr Ser Asn Ala Thr Leu Val Asn Asp 

2565 2570 2575 

Pro Ser Thr Leu Leu Gly Asp Thr Leu Leu Phe Ala Lys Gin Leu Glu 

2580 2585 2590 

Ala He Thr Glu Arg Arg Asn Arg Leu Met Lys Asp Leu Thr Glu He 

2595 2600 2605 

Ser Pro Ser Leu Phe Thr Ser Phe Gly Asp Ala Ser Lys Asp Thr Gin 

2610 2615 2620 

Met Met Ala Asp Ala Lys Gin He Val Ser Gly Gly Asn Phe Lys Ser 

2625 2630 2635 2640 

Ala Gly Tyr Leu Gly Val Pro Leu Arg Thr Leu Ala Ser Cys He Lys 

2645 2650 2655 

Gly Thr Asn Thr Val Asp Arg Leu Leu Ala Thr Lys Asn Lys Asn His 
2660 2665 2670 
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Leu Glu Trp Met Thr Thr Ala Ala lie Val Phe Ala Arg Ser Phe Asn 

2675 2680 2685 

Asp Thr Thr Phe His Ala Leu Glu Asp Thr Leu Lys Met Thr Ser Ala 

2690 2695 2700 

Leu Thr Asp Met Tyr Ser Ala Phe Thr Asn Leu Val Gly Ser Glu His 
2705 2710 2715 2720 

Ser Gin Arg Leu Lys Val Lys Ser Thr Leu Leu Asp Ser lie Phe Asn 

2725 2730 2735 

Thr Arg Met Ala His Thr Glu Ala Val Met Gly Leu Val Tyr Pro Thr 

2740 2745 2750 

Ala Phe lie Asn His Glu Met Pro Ser Asp Tyr Thr Gin Arg Arg Glu 

2755 2760 2765 

Met Gin Ser Leu Ala Leu Asn lie Leu Arg Gly Val Asn Cys Ser Gin 

2770 2775 2780 

Leu Pro Arg Lys Asp lie Gly Asp Thr Ala Gly Leu Leu Thr Phe lie 
2785 2790 2795 2800 

Thr Ser Arg Lys Phe Ala Gly Tyr Gly Gly Glu Arg Gly Gly Leu Ser 

2805 2810 2815 

Leu Tyr Arg Met Ser lie Val Asp Ala Leu Ser Cys Pro Ser Asp Asn 

2820 2825 2830 

Arg Leu Lys Gly Ala Val Ser Leu Glu Val Gly Lys Trp Gin Asp Met 

2835 2840 2845 

Gly Glu Glu lie Phe Tyr Lys Arg Ser Asn Asp Leu Val Asp Phe Cys 

2850 2855 2860 

Ser Lys Asn Asn lie Ser Leu Glu Asn Ala Val Gly Pro lie Ala Arg 
2865 2870 2875 2880 

Phe Val Pro Asn Gly Thr Asn Met Ala Asp He Gly Met Thr Asp He 

2885 2890 2895 

He Ser Arg Thr Val Lys Asp Asp Ala Ser Met He Arg Leu Arg Arg 

2900 2905 2910 

Ala Glu Glu Gly Ala Gly Ala Ala Gly Lys Phe He Thr Ala Ser Ala 

2915 2920 2925 

Met Gly Asn Leu Tyr Gly Gly He Asp Thr Val Val Asn Leu Thr Glu 

2930 2935 2940 

Lys Leu Tyr Asp Ser Phe Val Leu Leu Gin Asp Ser Asp Ser Phe Asn 
2945 2950 2955 2960 

Thr Pro Thr Glu Met Ala Thr Ala He He Asn Arg Met Lys Ser Arg 

2965 2970 2975 

Lys His Lys Ala Leu Lys Thr Pro Phe Gly Gly Asp He Ala Thr Tyr 

2980 2985 2990 

Lys Asn Phe Pro Ser Ser Ser Glu Ala He Val Val Arg Ala Lys Glu 

2995 3000 3005 

Met Arg Asn Ser He Ser Thr He Val Met Asp He Ser Lys Ser Arg 

3010 3015 3020 

Gly He Asn Ser Phe Ser Ser Arg Ser Gly Ser Thr Leu Ala Lys He 
3025 3030 3035 3040 

Ser Thr Ser Glu Phe Glu Arg He Thr Ser Ala Val Leu Ser Asn Thr 

3045 3050 3055 

Lys Ala Asn Leu Arg Thr He Glu Asn Arg Leu Ala Glu His Tyr Asn 

3060 3065 3070 

Lys Leu Lys Gin Phe Ser His Asn Asp Gly Leu Ser Glu Thr Arg Ala 

3075 3080 3085 

Val Val Ala Val He Ala Glu Ser Leu Thr Pro Val Tyr Ala Asp Asp 

3090 3095 3100 

Thr Ser Gly Ala Ser Val Ser Glu Leu Leu Thr Asp Asn Thr Leu Leu 
3105 3110 3115 3120 

Lys Phe He Val Gin Asn Glu Leu Lys Asn He Glu Glu Ala Lys Arg 

3125 3130 3135 

His Val Thr Ala Ala He Glu Gly Ser Ser Gin Leu His Glu Lys Met 

3140 3145 3150 

Leu Ser Leu Leu Val Ala Ser Ala Asp He Asn Arg Met Ser Ala Gin 
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3155 3160 3165 

Asn Asn Leu Glu Cys Lys Lys Leu Thr Glu Gly Asn Ser Asn Phe Val 

3170 3175 3180 

Pro Met Thr Asn Asp Gin Gly Gly. Thr Phe He Lys His Lys Glu Thr 
3185 3190 3195 3200 

Gly He Trp Leu Lys Thr Asp Glu Glu Asn Asn Thr Ser Ser He Lys 

3205 3210 3215 

Asp Asn Asp Gin Arg Arg Val Ala Lys Thr He Leu Ala He Val Glu 

3220 3225 3230 

Asp Asn Arg Asn Ala Thr He Arg Ser Arg Leu Gin Ser Leu Cys Phe 

3235 3240 3245 

Gly Lys Tyr Ala Met Asn Asp He Phe Ala Leu Asp Asp Ala Asp He 

3250 3255 3260 

Lys Asn Met Asp Lys Leu He Glu Lys Leu Gly Glu Ala Glu Lys Asp 
3265 3270 3275 3280 

Ser Ser Ser Ala He Ser Ser Ser Ser Ser Ser Asn Thr Thr Ser Ser 

3285 3290 3295 

Ser Ser Ser Pro Ser Ser Ser Pro Ser Ser Ser Ser Ser Ser Phe Ser 

3300 3305 3310 

Met Asp Tyr Ser Asn Asn Leu Ala Lys Thr He Pro Tyr Met Pro He 

3315 3320 3325 

Val Phe Gin Asn Lys Gin Ser Asn Val Asn Ser Ser Asp Ala Ser Ser 

3330 3335 3340 

Ser Ser Pro Ser Ser Ser Ser Ser Ser Ser Ala Asn He Asp Asn Val 
3345 3350 3355 3360 

Glu His Lys Lys Val Gin Gin Leu Gin Thr Gin Glu Ser Asn Asp Leu 

3365 3370 3375 

Ser Asn Val Leu Ser Val Thr Thr Lys His Arg Phe Ala Ser His Asn 

3380 3385 3390 

Gin Ala Ala Thr Val Gly He Phe Asn Gly Arg Gin His Ala Glu Thr 

3395 3400 3405 

Val Val Ala He Pro Asn Ala Asn Lys Ala Asn Asn Asn Ala Thr Val 

3410 3415 3420 

Ser Ala Gly Gin Gly He Leu Thr Arg Phe Ser Ala Pro Glu Asn Val 
3425 3430 3435 3440 

Ser Ser Thr Ser Met Gin Leu Pro Pro Ser Ser Ser Ser Ser Ser Asn 

3445 3450 3455 

Gly Asp Asp Asn Lys Val Pro Val Thr Val Arg Leu Asn Gin Tyr Ala 

3460 3465 3470 

Asn Ser He Leu Ser Ser He Glu Asn Ala Ser Glu Phe Lys Asp Leu 

3475 3480 3485 

Lys Glu Ala Glu Arg Lys He Asp Leu Ala He Gin Ala Ala Ser Thr 

3490 3495 3500 

Thr Glu Thr Lys Glu Met Val Thr Val Ser Lys Cys Pro Ser Ala Asn 
3505 3510 3515 3520 

Gin Thr Ala He Thr Ala He Ser Gin Ala Lys Ser Leu Lys Lys Ser 

3525 3530 ^ 3535 

Ala Leu Glu Leu Leu Glu Arg Val He Lys Ala Val Glu Vai Tyr Thr 

3540 3545 3550 

Pro Asp Ser Ser He Ala Ala Val Ser Leu Pro Val Asn Gly Asp Ser 

3555 3560 3565 

Met Val Ser Ser Ser Ser Gly Ser Gly Ser Ala Pro Ser Ser Ser Ser 

3570 3575 3580 

Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Val Thr Asp Tyr Phe Asn 
3585 3590 3595 3600 

Tyr Ala Tyr Gly Lys Leu Lys Asn He Asp Glu Asn Thr Glu Glu Gly 

3605 3610 3615 

Ala Glu Thr Val Gin Lys Asn Met Val Glu Gin Asp Ala Ala Val Arg 

3620 3625 3630 

He Pro Leu Leu Val Ser Tyr Ala Pro Phe Ser Glu Met Met Arg Arg 
3635 3640 3645 
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Ala lie Asp Lys Leu Asn Glu Tyr Tyr Gin Leu lie Asp Ala lie Lys 

3650 3655 3660 

Thr Lys lie Val Ser Asp Thr Lys Gin Ala Ser Ser Trp Ala lie Lys 
3665 3670 3675 3680 

Glu Thr Asp Lys Glu Leu Asp Met Asp Lys Glu Gin Val lie Ser Lys 

3685 3690 3695 

lie Asn Asn Leu Gin Gin Asn Phe Ser Asn Glu Ser Asp Lys lie Lys 

3700 3705 3710 

Met Ala lie Ser Val Leu Asp Asn Lys Arg Asn Glu Leu Glu Leu Gin 

3715 3720 3725 

Asn Asn Lys Thr Arg Ser Phe lie Glu Thr Thr Lys Ser Arg lie Glu 

3730 3735 3740 

Ala Gly Gly Gly Asp Val Ala Asn Phe Lys Glu lie lie Asp Tyr Glu 
3745 3750 3755 3760 

Asn Thr Ser Glu Asn Asp Asn Asn Leu Phe Gin Ser Leu Lys Ala Phe 

3765 3770 3775 

Ala Ala Asp Asn Ser Gly Thr Val Tyr Thr Pro Thr Asp Met Ser Asn 

3780 3785 3790 

Gly Arg Asp Thr Lys Ser Asp Ser Lys Phe Val Asp Met Tyr Asn Lys 

3795 3800 3805 

Gin lie Gly Gly lie Lys Leu lie Asn Glu Gly Gin Asn Thr Val Lys 

3810 3815 3820 

Val Asp Phe Ser Lys Ala Leu Glu Ala Phe Pro Arg Gin Ser Asn Gly 
3825 3830 3835 3840 

Ala Ser Glu Pro Val Ser Ser Ser Val Val Glu Arg Arg Gin Arg Glu 

3845 3850 3855 

Arg Leu Gin Ala Val Glu Met Phe Met Ala He Met Met Glu Arg Thr 

3860 3865 3870 

Glu Ser Leu Arg Lys Arg Leu Ala Asp Ser Ala Ala Gin Trp Asn Thr 

3875 3880 3885 

Val Asn Asn Val Glu Glu Thr Val Asn Ser Gly Met Val Asn He Lys 

3890 3895 3900 

Ser Leu Thr Glu He Arg Asn Gin Ala Gin He Ala Glu Ser Thr Ala 
3905 3910 3915 3920 

Leu Asn Ser He Asn Asp Glu He Val Glu Ser Pro Leu Thr Leu Ser 

3925 3930 3935 

Leu Gly Ala Arg Val Asp Gin Leu Leu He Lys Val Asp Arg Val Gly 

3940 3945 3950 

Ser He Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Leu Pro 

3955 3960 3965 

Lys Leu Thr Ala Thr Glu Gin Arg Lys Glu Gin Gin Tyr Ala Ala Asp 

3970 3975 3980 

Arg Val Val Tyr Asp Pro Ser Tyr Thr Cys Phe Leu Gin Pro Leu His 
3985 3990 3995 4000 

Glu Thr He Lys Arg He Ser Ser Val Tyr Asn Ser Lys Asn Lys Gly 

4005 4010 4015 

Pro Leu Ser Asn Thr Arg Gly Val Pro Thr Ser Asp Ala Asp Leu Gin 

4020 4025 4030 

Leu Met Thr He Thr Asp Leu Ser Arg Ser Val Leu Asp Ser Ser Ser 

4035 4040 4045 

Thr Ser Ser Lys Lys Met Leu Tyr Glu Asn Val Pro Ser Ser He Val 

4050 4055 4060 

Pro Gly Leu Cys Gin Gin Cys Ala Met Met He Thr Asn Val His Glu 
4065 4070 4075 4080 

Ala Thr His Thr Ser Pro His Ser Phe Asn Phe Glu Asn Lys Arg Ser 

4085 4090 4095 

Leu Lys Gin Leu Thr Glu Met Leu Asn Ala Ala Thr Ser Ser Ser Asp 

4100 4105 4110 

Gly Pro Ala Val Arg His Asp Val Leu Thr Met Leu Glu Ser Asn Asn 

4115 4120 4125 

Gly Tyr Val Lys Asp Phe Gly Phe Thr Gin Lys Val Ala Cys He Thr 
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4130 4135 4140 

Pro Val Asn Thr Leu Leu Gly Gly Thr Phe Ser Gly Asn Val Ala Pro 
4145 4150 4155 4160 

Asn Thr Val lie Leu Pro Thr Ser Glu Leu Phe Asn Cys Pro Gly Val 

4165 4170 4175 

Glu Asn Asp Lys Phe Arg Ser Met Val Asn Arg Thr Thr Asp Lys Asn 

4180 4185 4190 

Val Ala Asp Ala Pro Lys Ser Ser Ala Ser lie Val Glu Thr Leu Ala 

4195 4200 4205 

Arg Thr Ser Pro Asn Ala Glu His Leu Tyr Phe Pro Phe Lys Asp Gin 

4210 4215 4220 

Arg Arg His Phe Asn Ser lie Thr Asp Ala lie lie Ser Gly Met Ser 
4225 4230 4235 4240 

Gly Glu Ser Ser Ser Gin Leu Asn Thr Thr Cys Asp Gin Asn Leu Val 

4245 4250 4255 

Asn lie Asp Gin Thr Thr Gly Phe Pro Val Phe Thr Gly Arg Lys Gin 

4260 4265 4270 

Gly Glu Arg Arg He Val His Thr Glu Asn Thr Met Glu Gly Ala Arg 

4275 4280 4285 

Lys Asp Lys Asn Ser Gly He Pro Ser Cys Thr Lys Asp Arg Gin Thr 

4290 4295 4300 

Tyr He Asp Met Gly Thr Lys Phe Met Val Ala Pro Gly Ser Leu Leu 
4305 4310 4315 4320 

Asn Ala Asn Lys Glu Glu Thr Leu Arg Leu Asn Arg Leu Ser Asp He 

4325 4330 4335 

Asn Asn Val Arg His Tyr Gly Thr Asp Val His Val Ala Gly Ala Asn 

4340 4345 4350 

Ser Ala Trp Arg He Gly Glu Val Val Arg Ala Ala Ser Ser Phe Pro 

4355 4360 4365 

Asp Gly Asp Lys Glu Ser Ala Met Lys Lys Met Leu Leu Leu Gly Ser 

4370 4375 4380 

Val Ser Ala He Ser Ala Gin Lys Ser Ala Ser His He Asn Asp Pro 
4385 4390 4395 4400 

Thr Ala Leu Leu Ser Thr Asn Thr Ser He Gin Asn Leu Val Lys Glu 

4405 4410 4415 

Ala Phe Pro Asp Pro Val Cys Ser Ser Asn Tyr Leu Gly Ser Ala Glu 

4420 4425 4430 

Ser Thr Phe Ala Thr Gin Leu Ala Tyr Arg Gin Arg Leu Phe Pro Asn 

4435 4440 4445 

Gly Asp Asp Glu Asn Val Thr Thr Val Ser Asn He Cys Pro Met Asp 

4450 4455 4460 

Leu Met Gly Ser Thr Lys Arg Tyr Asn Asp Ala Phe Asn Asn He Phe 
4465 4470 4475 4480 

Gly Ser Lys Met Thr Ser Thr Asn Lys Lys Gly Ser Asn Cys Glu Asn 

4485 4490 4495 

Leu Leu Lys Ser Ala Met Ser Asn Val Pro Ala He Asn Thr Ala Phe 

4500 4505 4510 

Gly Ala Phe Glu Glu Ala Ser Ser Ser Val Arg Asn Arg Leu Ser Pro 

4515 4520 4525 

Leu Tyr Glu Asp Ser Thr Lys Tyr Ser Ser Asn Gin Leu Ala Val Gin 

4530 4535 4540 

Ala Met Thr Asp Thr Ala Val Asp Ala Leu Ser Ala Val Ser Thr Val 
4545 4550 4555 4560 

Val Gly Arg Gin Asn Gly Arg Asn Thr Leu Leu Ser Leu Pro Thr Ser 

4565 4570 4575 

He Thr Ser He Ala Thr Ser Gly Arg Pro Ser Leu Ser Tyr Ser Ser 

4580 4585 4590 

Asp Met Lys Ser Asn Leu He Lys Thr He Ser Arg He Asn Arg Asp 

4595 4600 4605 

Ala Ser Leu Leu Ser Met Gly Asp Ser Gin Val Ala Ala Gly Ser Ser 
4610 4615 4620 
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Phe Phe Asn Ser Phe Leu Arg Ser Ser Ser lie Pro Val Thr Thr Ser 
4625 4630 4635 464{ 

Gin Asp Gly Asn Val Ala Ala Ala Glu lie Val Leu Gly Thr lie Leu 

4645 4650 4655 

Asp Lys Thr Val Glu He Asn Lys Arg Phe Glu Met Leu Gly Gly Gly 

4660 4665 4670 

Lys Met Val Ala Gly Ser Pro Glu Ala Arg Ala He Gin Arg Asn Thr 

4675 4680 4685 

Met Ser Ser He Leu Gin Met Asn Glu Asn Glu Leu Ala Arg Asp Leu 

4690 4695 4700 

Cys Glu He Glu Asn Lys He Glu Thr Arg Gin Leu Arg Asp Ala Phe 
4705 4710 4715 472( 

Gin Asp Leu Lys Arg Ser Met Leu Met Thr Pro Gly Gly Val Gly Ala 

4725 4730 . 4735 

He Ser Ser Gly Ala Ser Thr Asn Asn Val Pro Leu Ser Leu Leu Met 

4740 4745 4750 

Ser Arg Val Asp Ala Ser Ser Gly Leu Leu Met Asn Asn Asn Ser Ala 

4755 4760 4765 

Asn Val Met Glu Ala Val Asp Ser Phe Asn Thr Thr Pro Leu Leu Val 

4770 4775 4780 

Arg His Met Met Leu Asp Ser Gly Lys Ser Pro Val Pro Met Ala Lys 
4785 4790 4795 480( 

Glu He Arg Ser Met Leu Thr Gin Pro Arg Ala Leu Thr Ala Arg Ala 

4805 4810 4815 

Leu Leu Ser Glu Ser Ser Pro Leu Leu Thr Glu He Cys Leu Tyr Asn 

4820 4825 4830 

Thr Arg Asp Thr Gin Pro Glu Arg Ala Val Asp Arg Leu Leu Thr Ser 

4835 4840 4845 

Ala Tyr Leu Val Lys Gin Ala Lys Arg Phe Asp Gly Val Asp Pro Ala 

4850 4855 4860 

Phe Pro Ala Ala Leu Thr Cys Ala Ser His Leu Met Leu Ser Ser Met 
4865 4870 4875 4881 

Asp Ser His Thr Lys Ser Ser Phe Met Asp Asn He Lys Leu His Met 

4885 4890 4895 

Thr Asp Thr Gin Cys Phe Phe Lys Asn He Glu Arg Phe Glu Lys Phe 

4900 4905 4910 

Leu Gly Arg Tyr Gly Asp Glu Tyr Ala Met Ser His Lys Gin Asn Cys 

4915 4920 4925 

Asn Cys Pro Phe His Leu His His Thr Phe Thr Pro Ser Asp Asn Glu 

4930 4935 4940 

His Leu Val Ser Ser Phe Ala Phe Ala Arg Pro Glu Val Ser Met Glu 
4945 4950 4955 496i 

Glu He Arg Ala Thr Pro Tyr Gin Ala Asn Lys Leu He Ser Asp Lys 

4965 4970 4975 

His Tyr Val Met Asn Met Ser Lys He Asp Ser Arg Val Thr Gly Ser 

4980 4985 4990 

Ser Leu Leu Lys Lys Val Ser Glu Trp Thr Glu Met Arg Met Asn Ser 

4995 5000 5005 

Asn Phe Asn Gly Thr Phe Glu Pro Ser Arg Leu Ala Leu Ser Asn Ser 

5010 5015 5020 

Gly Met Thr Thr Ala Gly Val Asn Leu Asp Val He Val Lys Pro Asn 
5025 5030 5035 504 

Asn Ala Arg Ser Val Leu Gly He Cys His Arg Gin His Val Cys Thr 

5045 5050 5055 

Ala Asp Ala Lys Gly Thr Val Ala Ser Ala Met Pro Ala Val Phe Gin 

5060 5065 5070 

Ala Thr Asp Gly Asn Gly Asn Glu Ser Glu Leu He Gin Asn Ala Leu 

5075 5080 5085 

Pro Arg Asn Arg Tyr He Gin Lys Ser Thr Met Asn Ala Gin Thr Val 

5090 5095 5100 

Val Phe Ala Asn Val Leu Glu Gin Leu He Ala Asp Leu Gly Lys Val 
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5105 5110 5115 5120 

lie Val Asn Glu Leu Ala Gly Thr He Ala Glu Ser Val Pro Glu Ser 

5125 5130 5135 

Val Tyr Glu Asn Thr Lys Glu Met He Asp Arg Leu Gly Ser Asp Asp 

5140 5145 5150 

Leu Phe Lys Ser Asn Asn Asn Gly Gly Val Glu Ser Met Asp Tyr Glu 

5155 5160 5165 

Asp Ser Glu Thr Thr Ser Asn Asn Gly Pro Val Leu He Ser Glu Ala 

5170 5175 5180 

Met Lys Asn Ala Val Tyr His Thr Leu lie Ser Gly Lys Ala Ala Arg 
5185 5190 5195 5200 

Pro Glu Asn Val Pro Phe Ala Ser Cys Ala Ser Gly Pro Leu Ala Phe 

5205 5210 5215 

Asp Phe Leu Leu Ser Lys Gly Asp Thr Phe Glu Glu Lys Asn Ala Glu 

5220 5225 5230 

Gin Gly Ala Ala Ala Ala Val Ser Ser Thr Tyr Ser Ser Ser Ser Asn 

5235 5240 5245 

Thr Thr Leu Arg Lys His Leu Ala Arg Val Phe Glu Ala He Ser Lys 

5250 5255 5260 

Gin Val Thr Asp Ala Glu Phe Lys Asp He Leu Asn Asp He Glu Arg 
5265 5270 5275 5280 

Asn He Ser Ser Asp Tyr Thr Asn Cys Pro Pro Asn Thr Asn Gin Asn 

5285 5290 5295 

Ala Phe Ala He Lys Arg Glu Phe Ser Arg He Val Ser Phe Leu Thr 

5300 5305 5310 

He Leu Arg Lys Asn He Thr Pro Ala Leu Val Asp Pro Lys Gly Ala 

5315 5320 5325 

Leu His Glu Lys Val Ala He Tyr Leu Thr Leu Leu Ser Thr Lys Ser 

5330 5335 5340 

Lys Leu Glu Asn Phe Phe Gin Tyr Gly Leu Ser Asn Ser Ser Ser Val 
5345 5350 5355 5360 

Asp Leu Ser His Leu Lys Pro He Asn Cys Ser Asn Asn Val Lys Asn 

5365 5370 5375 

He Glu Asp Thr Phe Met Tyr Arg Asn Val His Pro He Leu He Met 

5380 5385 5390 

Ala Leu Pro Glu Asn Phe Thr Ala Leu Leu Gin Gin Glu Gin Met Asp 

5395 5400 5405 

Pro Asp Thr Ala He Glu Ser Arg Arg Ser Leu Thr Thr Phe Leu Asn 

5410 5415 5420 

His Pro Asn Thr Ala Ser Met Ala Asn Gly Ala Arg Ala Ala Val Gly 
5425 5430 5435 5440 

Ala Gly Gly Gly Asn Pro Met Gly Leu Ser Ser His He Leu His Glu 

5445 5450 5455 

Ser Thr Val Thr Thr Ser Asn Pro Val Thr Asp Thr Thr Glu Asn Val 

5460 5465 5470 

Asn Tyr His Ser Ser Val Thr Gin Asp Pro Val Met Val Val Asn Pro 

5475 5480 5485 

Phe Lys Asp Ser Ala Arg Leu He Val Asn Asn Asn Asn Thr Gly He 

5490 5495 5500 

Asp Val Leu Asn Asp Lys Ser Cys Asn Tyr Leu Gin Val Ser Met Pro 
5505 5510 5515 5520 

Ser Glu Ser Ser Gly Leu Val Thr Asn Thr Gly Cys Ser Ser Ser Ser 

5525 5530 5535 

Ser Ser Ser Ser Ser Asp Thr Phe Lys Tyr Val Arg Arg Asp Asn Thr 

5540 5545 5550 

Pro Val Asn Leu Pro Arg Val Thr Pro Ala Val Leu Cys Ser Asp Ala 

5555 5560 5565 

Ser Ser Asn Leu Leu Asp Val Phe Ser Arg Ala Asp He Val Leu Glu 

5570 5575 5580 

Asn Met Asn Val Arg Phe Gly Phe Met Pro Glu He He Ala Ala Val 
5585 5590 5595 5600 
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Ser Lys Phe Lys Gly Leu Thr Lys Glu Glu Val lie Lys Gin Met Val 

5605 5610 5615 

Ser Gin Asn Asn lie Asn Asn Asn Ser Asn Asn Asn Asn Gly Asn Gly 

5620 5625 5630 

Lys Lys Thr Thr Val Asp Pro Val Thr Gly Asp lie Val He Thr Asn 

5635 5640 5645 

Ala Thr Phe Pro Asp Trp Leu Tyr Thr Ala Ala Asn Gly Gly Thr Ser 

5650 5655 5660 

Ser Phe Lys Trp Gly Asp He Asn Asp Arg Lys Met His Ala Lys Ala 
5665 5670 5675 5680 

Phe Pro Thr Phe Phe He Gly Asn Pro Thr Ala Ala Ala Thr Ala Asn 

5685 5690 5695 

Gly Val Pro Leu Thr Ser Glu Gly He Ser Leu Thr Glu Glu Lys Arg 

5700 5705 5710 

Lys Lys He Ala Gly He Ser Glu Gly Ser He Gly Thr Gly Ala Leu 

5715 5720 5725 

Arg Ala Ala Ala Asn Thr Arg Leu Ser Ser Asp Met Glu Pro Val Met 

5730 5735 5740 

Lys Gly Trp Asn Asn He Val Gin Leu Gin Gin Thr Phe Lys Lys Ala 
5745 5750 5755 5760 

Ser Asp Lys Leu Thr His Leu Leu Arg Ser Gly Gly He Pro Pro Arg 

5765 5770 5775 

Ser Gin Glu Thr Asn Ala He He Asn Lys Met His Asp Ser Phe Lys 

5780 5785 5790 

Thr Leu Glu Glu Cys Arg Arg Val He Gin Asp Glu Ala Ala Leu Leu 

5795 5800 5805 

Val Ala Thr Ser Asp Leu Leu Thr Gly Gly Tyr Gly Gly Asp Ala Ala 

5810 5815 5820 

Met Val Ser Pro Val Arg Pro Glu Met Thr Gly Leu He Gly Ala He 
5825 5830 5835 5840 

Ser Ala Pro Val Arg Gly He Ser His Leu Leu Lys Leu Gly Gly Val 

5845 5850 5855 

Ser Ala Ala Asn Ala Ala He Arg Lys Arg Leu Asn Leu Pro Thr Ser 

5860 5865 5870 

Asn Gly Lys Thr Leu Pro Glu His Gly He Val His Lys Ser Ala Lys 

5875 5880 5885 

Thr Leu Leu Leu Asp Ser Asp Ser He Ser Asn Leu Tyr Asn Thr Asp 

5890 5895 5900 

Leu Gin Asp Val Val Ser Asn Ala Arg Asp Asn Asn Asn Leu Gly Arg 
5905 5910 5915 5920 

He Met Gin Ser Leu Gly Leu Lys Gly Asn Asn Ala Gly Asp Leu Val 

5925 5930 5935 

Tyr Ser Ala Arg Gin Leu Thr Asp Leu He Thr Val Pro Glu Tyr Gly 

5940 5945 5950 

Asn Asn Arg Asp Leu Thr Lys Arg Gin Ala He Leu Lys Met Leu He 

5955 5960 5965 

Ser Asn Pro Glu He Asn Val Ala Asp Thr He Tyr Leu Thr Thr Gly 

5970 5975 5980 

Lys Asn Ala Pro Val Ser Ala Gin Glu Met Ala Cys Ala Ser Leu Thr 
5985 5990 5995 6000 

Val Gly Gly Ser Gly Gly Gly Lys Leu Ser Ser Asp Asp Asn Val Gin 

6005 6010 6015 

Ser Leu Asn Arg Leu Tyr Phe Arg Val 
6020 6025 



<210> 62 

<211> 2190 

<212> DNA 

<213> SHRIMP 
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<400> 62 

atggataaag 

ctactgtgcg 

atcgagacaa 

aatccaaatg 

gcacaaggaa 

aggagattaa 

gatttctccc 

attgatgaag 

gcactagtgc 

aaaaaaataa 

cgtataccaa 

gtaaatgttg 

cgatgttcca 

gaagaaagag 

tgttttaggg 

gaaactgaaa 

atgccctatt 

tgttttgtgg 

agatcactaa 

aaaaaaggtg 

attttaaaga 

catgttaacc 

catgtaaagt 

atttattttg 

gctagtatgg 

atggaattgg 

tttgccaagt 

ttttctgtta 

cttttttgta 

tatgcatgta 

tacaaattct 

acaagaggac 

gaagaaga tg 

aatgaggaca 

tataatatta 

ttgaggagtg 

tgtgcaactg 



tttgtgttat 
ttgcaacaga 
gagtggtggt 
atgaaggatt 
aaaaacccat 
tcagtggagg 
tttaccctaa 
acacgtacaa 
tgataagggt 
tgctaaggaa 
aatctaaact 
aaaaaaatat 
gatggtacct 
ttggacctcg 
acatagacct 
gcagatatgt 
ttggaccaaa 
atgaaatttc 
gaaatgtttc 
tgtctattca 
gaataccaaa 
ttgaacagta 
cgcgggtagt 
ataaagacac 
catctaaaat 
aagatttgac 
taaaatcttg 
taagattctt 
ggtttattcc 
aaagaggaga 
ccaacgctcc 
gtaagggtat 
gaagaagata 
aaaccttcaa 
accatttgta 
gtttcatttc 
gaaagggggc 



atcaaatact 
accggaaatc 
gttttcaagg 
ttctgtccaa 
tggtttgtac 
aactgcatac 
ttattcaata 
tggtgttgtt 
gactttttct 
agtgtttttc 
ggacaaattt 
ccctggttac 
ttacaacacc 
ccagttagtg 
cagaaaaaag 
ggtcgtaaca 
gtgttcagtg 
ttataatgat 
tattagacat 
tgctagattt 
aacttatttt 
cgaatggctt 
gactcgagca 
tttaaattta 
atgtaatgca 
tgagagcgca 
cccggatgtt 
tagaaacatg 
cgactcacac 
atctatgcat 
ccatgcggcc 
acacatagaa 
tgaagctaaa 
atctcttctt 
tgagcaatat 
ttatgacacg 
gtgcatctag 



agggagcgca 
tctaccaagg 
tgcgtctcgg 
Gttttcaagg 
atccaaataa 
ctggatccgg 
ttcaatgaca 
ttctctaaca 
acgcatgaaa 
aaggatggag 
actccctatt 
atatggggag 
gactcggaat 
aaaaaatatg 
gaagcaaagg 
Gtaacccata 
gtgaggttgg 
gaagatgtag 
aaggaaaatg 
acccttaatg 
gaagatgagg 
tgttctaata 
gttaagcgta 
aactacaaat 
aaacacgact 
tatttcaagg 
aaatatgtgc 
acaaagggtg 
actattacac 
atgtgcacaa 
atcgaacaaa 
tacgcgatcg 
tacactggaa 
gctcctcatt 
ggaaattttg 
tatgttactg 



cgtttaaggt 
aagaagatgc 
tccaggaact 
actacctgaa 
aggctggaga 
caacacacct 
tttcatcccg 
gtgaagaaaa 
aggcaattga 
atcttgattt 
ttcggagtca 
aaattatgaa 
gggaatataa 
gtgccaagtg 
aaaaaaggga 
agcatgaaat 
atgaaactag 
acgaaatttt 
tacctgtaca 
gtttggatga 
aactacaagc 
atagagggaa 
ggagaaaatg 
actttgataa 
gtttagtttt 
tagaaccttc 
agaaaaaaac 
atcttattca 
ttttgagtag 
acaaacaccg 
taaccaatat 
aaaatgtaca 
ctttaaccga 
taacacctgt 
atgaagaatt 
caaaagacaa 



acctgccgat 
aggtatagaa 
acatacaata 
attgcaatct 
ggatcttgaa 
tttctatctt 
tctgaaaatc 
agagaaggat 
agcagccata 
cgggtactta 
atacgggata 
gcaacgagtg 
aaatgtggcc 
tgaaaattta 
tatagaaaga 
gcctgaaaat 
aa tactttta 
gtctgagaat 
cacgttatta 
tgctttaata 
cgcttgtgcg 
taaagtagaa 
tagacactgg 
aaaagttact 
cca tagaaaa 
cccaataaat 
agatggtaca 
aaggatggat 
ggcggatttt 
tattcttcac 
catcagtgat 
agaaatgtac 
gtacaaaaga 
caacaaacca 
agaagacaag 
ctggggcagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2190 



<210> 63 
<211> 727 
<212> PRT 
<213> SHRIMP 



<400> 63 



Met 


Asp 


Lys 


Val 


Cys 


Val 


lie 


Ser 


Asn 


Thr 


Arg 


Glu 


Arg 


Thr 


Phe 


Lys 


1 








5 










10 










15 




Val 


Pro 


Ala 


Asp 


Leu 


Leu 


Cys 


Val 


Ala 


Thr 


Glu 


Pro 


Glu 


He 


Ser 


Thr 








20 










25 










30 






Lys 


Glu 


Glu 


Asp 


Ala 


Gly 


He 


Glu 


He 


Glu 


Thr 


Arg 


Val 


Val 


Val 


Phe 






35 










40 










45 








Ser 


Arg 


Cys 


Val 


Ser 


Val 


Gin 


Glu 


Leu 


His 


Thr 


He 


Asn 


Pro 


Asn 


Asp 




50 










55 










60 










Glu 


Gly 


Phe 


Ser 


Vai 


Gin 


Leu 


Phe 


Lys 


Asp 


Tyr 


Leu 


Lys 


Leu 


Gin 


Ser 


65 










70 










75 










80 


Ala 


Gin 


Gly 


Lys 


Lys 


Pro 


He 


Gly 


He 


Gin 


He 


Lys 


Ala 


Gly 


Glu 


Asp 










85 










90 










95 




Leu 


Glu 


Arg 


Arg 


Leu 


lie 


Ser 


Gly 


Gly 


Thr 


Ala 


Tyr 


Leu 


Asp 


Pro 


Ala 








100 










105 










110 






Thr 


His 


Leu 


Phe 


Tyr 


Leu 


Asp 


Phe 


Ser 


Leu 


Tyr 


Pro 


Asn 


Tyr 


Ser 


He 






115 










120 










125 









wo 01/38351 PCT/USOO/28888 

157 



Phe 


Asn 


Asp 


He 


Ser 


Ser 


Arg 


Leu 


Lys 


He 


He 


Asp 


Glu 


Asp 


Thr 


Tyr 




130 










135 










140 










Asn 


Gly 


Val 


Val 


Phe 


Ser 


Asn 


Ser 


Glu 


Glu 


Lys 


Glu 


Lys 


Asp 


Ala 


Leu 


145 










150 










155 










160 


Val 


Leu 


He 


Arg 


Val 


Thr 


Phe 


Ser 


Thr 


His 


Glu 


Lys 


Ala 


He 


Glu 


Ala 










1 65 










170 










175 




Ala 


lie 


Lys 


Lys 


He 


Met 


Leu 


Arg 


Lys 


Val 


Phe 


Phe 


Lys 


Asp 


Gly 


Asp 








180 










185 










190 






Leu 


Asp 


Phe 


Gly 


Tyr 


Leu 


Arg 


He 


Pro 


Lys 


Ser 


Lys 


Leu 


Asp 


Lys 


Phe 






195 










200 










205 








Thr 


Pro 


Tyr 


Phe 


Arg 


Ser 


Gin 


Tyr 


Gly 


He 


Val 


Asn 


Val 


Glu 


Lys 


Asn 




210 










215 










220 










lie 


Pro 


Gly 


Tyr 


He 


Trp 


Gly 


Glu 


He 


Met 


Lys 


Gin 


Arg 


Val 


Arg 


Cys 


225 










230 










235 










240 


Ser 


Arg 


Trp 


Tyr 


Leu 


Tyr 


Asn 


Thr 


Asp 


Ser 


Glu 


Trp 


Glu 


Tyr 


Lys 


Asn 










245 










250 










255 




Val 


Ala 


Glu 


Glu 


Arg 


Val 


Gly 


Pro 


Arg 


Gin 


Leu 


Val 


Lys 


Lys 


Tyr 


Gly 








260 










265 










270 






Ala 


Lys 


Cys 


Glu 


Asn 


Leu 


Cys 


Phe 


Arg 


Asp 


He 


Asp 


Leu 


Arg 


Lys 


Lys 






275 










280 










285 








Glu 


Ala 


Lys 


Glu 


Lys 


Arg 


Asp 


He 


Glu 


Arg 


Glu 


Thr 


Glu 


Ser 


Arg 


Tyr 




290 










295 










300 










Val 


Val 


Val 


Thr 


Leu 


Thr 


His 


Lys 


His 


Glu 


Met 


Pro 


Glu 


Asn 


Met 


Pro 


305 










310 










315 










320 


Tyr 


Phe 


Gly 


Pro 


Lys 


Cys 


Ser 


Val 


Val 


Arg 


Leu 


Asp 


Glu 


Thr 


Arg 


He 










325 










330 










335 




Leu 


Leu 


Cys 


Phe 


Val 


Asp 


Glu 


He 


Ser 


Tyr 


Asn 


Asp 


Glu 


Asp 


Val 


Asp 








340 










345 










350 






Glu 


He 


Leu 


Ser 


Glu 


Asn 


Arg 


Ser 


Leu 


Arg 


Asn 


Val 


Ser 


He 


Arg 


His 






355 










360 










365 








Lys 


Glu 


Asn 


Val 


Pro 


Val 


His 


Thr 


Leu 


Leu 


Lys 


Lys 


Gly 


Val 


Ser 


He 




370 










375 










380 










His 


Ala 


Arg 


Phe 


Thr 


Leu 


Asn 


Gly 


Leu 


Asp 


Asp 


Ala 


Leu 


He 


He 


Leu 


385 










390 










395 










400 


Lys 


Arg 


He 


Pro 


Lys 


Thr 


Tyr 


Phe 


Glu 


Asp 


Glu 


Glu 


Leu 


Gin 


Ala 


Ala 










405 










410 










415 




Cys 


Ala 


His 


Val 


Asn 


Leu 


Glu 


Gin 


Tyr 


Glu 


Trp 


Leu 


Cys 


Ser 


Asn 


Asn 








4 20 










425 










430 






Arg 


Gly 


Asn 


Lys 


Val 


Glu 


His 


Val 


Lys 


Ser 


Arg 


Val 


Val 


Thr 


Arg 


Ala 






435 










440 










445 








Val 


Lys 


Arg 


Arg 


Arg 


Lys 


Cys 


Arg 


His 


Trp 


He 


Tyr 


Phe 


Asp 


Lys 


Asp 




450 










455 










460 










Thr 


Leu 


Asn 


Leu 


Asn 


Tyr 


Lys 


Tyr 


Phe 


Asp 


Lys 


Lys 


Val 


Thr 


Ala 


Ser 


465 










470 










475 










480 


Met 


Ala 


Ser 


Lys 


He 


Cys 


Asn 


Ala 


Lys 


His 


Asp 


Cys 


Leu 


Val 


Phe 


His 










485 










490 










4 95 




Arg 


Lys 


Met 


Glu 


Leu 


Glu 


Asp 


Leu 


Thr 


Glu 


Ser 


Ala 


Tyr 


Phe 


Lys 


Val 








500 










c c 










Oil) 






Glu 


Pro 


Ser 


Pro 


He 


Asn 


Phe 


Ala 


Lys 


Leu 


Lys 


Ser 


Cys 


Pro 


Asp 


Val 






515 










520 










525 








Lys 


Tyr 


Val 


Gin 


Lys 


Lys 


Thr 


Asp 


Gly 


Thr 


Phe 


Ser 


Val 


He 


Arg 


Phe 




530 










535 










540 










Phe 


Arg 


Asn 


Met 


Thr 


Lys 


Gly Asp 


Leu 


He 


Gin 


Arg 


Met 


Asp 


Leu 


Phe 


545 










550 










555 










560 


Cys 


Arg 


Phe 


He 


Pro 


Asp 


Ser 


His 


Thr 


He 


Thr 


Leu 


Leu 


Ser 


Arg 


Ala 










565 










570 










575 




Asp 


Phe 


Tyr 


Ala 


Cys 


Lys 


Arg 


Gly 


Glu 


Ser 


Met 


His 


Met 


Cys 


Thr 


Asn 








580 










585 










590 






Lys 


His 


Arg 


He 


Leu 


His 


Tyr 


Lys 


Phe 


Ser 


Asn 


Ala 


Pro 


His 


Ala 


Ala 






595 










600 










605 








lie 


Glu 


Gin 


He 


Thr 


Asn 


He 


He 


Ser 


Asp 


Thr 


Arg 


Gly 


Arg 


Lys 


Gly 
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610 










615 










620 










lie 


His 


He 


Glu 


Tyr 


Ala 


He 


Glu 


Asn 


Val 


Gin 


Glu 


Met 


Tyr 


Glu 


Glu 


625 










630 










635 










640 


Asp 


Gly 


Arg 


Arg 


Tyr 
645 


Glu 


Ala 


Lys 


Tyr 


Thr 
650 


Gly 


Thr 


Leu 


Thr 


Glu 

655 


Tyr 


Lys 


Arg 


Asn 


Glu 
660 


Asp 


Lys 


Thr 


Phe 


Lys 
665 


Ser 


Leu 


Leu 


Ala 


Pro 
670 


His 


Leu 


Thr 


Pro 


Val 
675 


Asn 


Lys 


Pro 


Tyr 


Asn 
680 


He 


Asn 


His 


Leu 


Tyr 
685 


Glu 


Gin 


Tyr 


Gly 


Asn 
690 


Phe 


Asp 


Glu 


Glu 


Leu 
695 


Glu 


Asp 


Lys 


Leu 


Arg 
700 


Ser 


Gly 


Phe 


He 


Ser 


Tyr 


Asp 


Thr 


Tyr 


Val 


Thr 


Ala 


Lys 


Asp 


Asn 


Trp 


Gly 


Arg 


Cys 


Ala 


705 










710 










715 










720 


Thr 


Gly 


Lys 


Gly 


Ala 
725 


Cys 


He 





















<210> 64 
<211> 966 
<212> DNA 
<213> SHRIMP 



<400> 64 

atgtgcacat 

gatgtgttat 

cccttcacag 

acaacatgtg 

aaggacaaat 

aaggagttga 

atggtgtttg 

aatgaagata 

gatccagaaa 

gacgacaaca 

gaaaaggatg 

tctcctcctg 

cgggccccgt 

ggtgcagatg 

acaaattttg 

gaacatgatg 

gtataa 



taaaaacata 
ccatcaaggc 
aaggcaaggg 
gttcaagaga 
atgagcggaa 
cgcccaacag 
aaggattttg 
aaagttttct 
ataagaccag 
acaatgttga 
aaaaatatgc 
tatgtagagg 
cagcatttaa 
cactagaaga 
ctctcaagag 
gatctaaagc 



caaaatgact 
aactggagat 
aaatttacca 
ggcggcaaac 
aagagtaaaa 
atacttggta 
tgtcccacca 
actagcagat 
tagtgggggt 
cgtgtatgaa 
ctgtactttt 
gttattagta 
tccattcgga 
agaggatgaa 
aggacctgca 
agacctgtac 



acttcaacag 
tggtgcagta 
aacagtctcc 
gccacagagc 
cgtacaatcg 
gcagatgtat 
ggaaagtcgg 
accggaagat 
gatga tgacg 
gaaaacgacc 
tcaattttgg 
gagacagatg 
ggaagttcta 
gttgatggag 
actggccaga 
aacgtcacgt 



aaatatcaaa 
atatcaagac 
cgtttacgag 
attttatcac 
gattcaccct 
actcttggca 
gaacatttgt 
atatgaaaaa 
atgacgacga 
ccagaaatgt 
tctatagagc 
gaccctcatc 
tgttgaacgg 
ttcctgaaag 
actttgtatc 
gcttctccaa 



gaacctttca 
ggtattttca 
aagtcccaat 
cgtctttgca 
cgacaacaca 
agaagagaaa 
acgctactct 
gaagtacgat 
tgatgatgat 
attcgaggtc 
aatgaaaaag 
tcaccctaaa 
ttatggtgca 
agagaggatt 
tgttaaactg 
gcagcgtgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

966 



<210> 65 
<211> 321 
<212> PRT 
<213> SHRIMP 



<400> 65 



Met 


Cys 


Thr 


Leu 


Lys 


Thr 


Tyr 


Lys 


Met 


Thr 


Thr 


Ser 


Thr 


Glu 


He 


Ser 


1 








5 










10 










15 




Lys 


Asn 


Leu 


Ser 
20 


Asp 


Val 


Leu 


Ser 


He 
25 


Lys 


Ala 


Thr 


Gly 


Asp 
30 


Trp 


Cys 


Ser 


Asn 


He 
35 


Lys 


Thr 


Val 


Phe 


Ser 
40 


Pro 


Phe 


Thr 


Glu 


Gly 
45 


Lys 


Gly 


Asn 


Leu 


Pro 
50 


Asn 


Ser 


Leu 


Pro 


Phe 
55 


Thr 


Arg 


Ser 


Pro 


Asn 
60 


Thr 


Thr 


Cys 


Gly 


Ser 


Arg 


Glu 


Ala 


Ala 


Asn 


Ala 


Thr 


Glu 


His 


Phe 


He 


Thr 


Val 


Phe 


Ala 


65 










70 










75 










80 


Lys 


Asp 


Lys 


Tyr 


Glu 
85 


Arg 


Lys 


Arg 


Val 


Lys 
90 


Arg 


Thr 


He 


Gly 


Phe 
95 


Thr 


Leu 


Asp 


Asn 


Thr 
100 


Lys 


Glu 


Leu 


Thr 


Pro 
105 


Asn 


Arg 


Tyr 


Leu 


Val 
110 


Ala 


Asp 
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val 


Tyr 


Ser 


Trp 


Gin 


Glu 


Glu 


Lys 


Met 


Val 


Phe 


Glu 


Gly 


Phe 


Cys 


Val 






115 










120 










125 








Pro 


Pro 


Gly 


Lys 


Ser 


Gly 


Thr 


Phe 


Val 


Arg 


Tyr 


Ser 


Asn 


Glu 


Asp 


Lys 




130 










135 










140 










Ser 


Phe 


Leu 


Leu 


Ala 


Asp 


Thr 


Gly Arg 


Tyr 


Met 


Lys 


Lys 


Lys 


Tyr 


Asp 


145 










150 










155 










160 


Asp 


Pro 


Glu 


Asn 


Lys 


Thr 


Ser 


Ser 


Gly 


Gly 


Asp 


Asp 


Asp 


Asp 


Asp 


Asp 










165 










170 










175 




Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Asn 


Asn 


Asn 


Val 


Asp 


Val 


Tyr 


Glu 


Glu 


Asn 






180 










185 










190 






Asp 


Pro 


Arg 


Asn 


Val 


Phe 


Glu 


Val 


Glu 


Lys 


Asp 


Glu 


Lys 


Tyr 


Ala 


Cys 






195 










200 










205 








Thr 


Phe 


Ser 


He 


Leu 


Val 


Tyr 


Arg 


Ala 


Met 


Lys 


Lys 


Ser 


Pro 


Pro 


Val 




210 










215 










220 










Cys 


Arg 


Gly 


Leu 


Leu 


Val 


Glu 


Thr 


Asp 


Gly 


Pro 


Ser 


Ser 


His 


Pro 


Lys 


225 










230 










235 










240 


Arg 


Ala 


Pro 


Ser 


Ala 


Phe 


Asn 


Pro 


Phe 


Gly 


Gly 


Ser 


Ser 


Met 


Leu 


Asn 










245 










250 










255 




Gly 


Tyr 


Gly 


Ala 


Gly 


Ala 


Asp 


Ala 


Leu 


Glu 


Glu 


Glu 


Asp 


Glu 


Val 


Asp 








260 










265 










270 






Gly 


Val 


Pro 


Glu 


Arg 


Glu 


Arg 


He 


Thr 


Asn 


Phe 


Ala 


Leu 


Lys 


Arg 


Gly 






275 










280 










285 








Pro 


Ala 


Thr 


Gly 


Gin 


Asn 


Phe 


Val 


Ser 


Val 


Lys 


Leu 


Glu 


His 


Asp 


Gly 




290 










295 










300 










Ser 


Lys 


Ala 


Asp 


Leu 


Tyr 


Asn 


Val 


Thr 


Cys 


Phe 


Ser 


Lys 


Gin 


Arg 


Gly 


305 










310 










315 










320 


Val 

































<210> 66 
<211> 1197 
<212> DNA 
<213> SHRIMP 



<400> 66 

atgcaactca 

cccatgtttg 

aataaacatt 

acaatagtca 

gaactaatca 

ttcacggatt 

gcacttgatg 

acaaattcca 

gctcctttta 

gctgaaatgt 

aaggggaaac 

ctcttgttga 

aggagcagcc 

gatcatgcag 

cagttgttag 

ttctctttag 

cctcagcccg 

acttttggtg 

ctagacctcg 

attgaggagg 



ttctttctca 
cgggcaagtc 
gcctgtttgt 
ctcatgccgg 
gtgtgttacc 
tggtgctagt 
gaacttctga 
ttgttaagct 
ctgtaaggtt 
acgctgctgc 
ttgttgtact 
ccaataaaaa 
atacgggtaa 
ctcacttgtt 
acgatggaat 
ctagaggagt 
atcttgtatt 
tcgaaagatt 
caaataagga 
tgcaaactaa 



tcatctaacc 
tacctacctg 
caaacactcc 
agaagtgatt 
agaagttgtg 
caatagactg 
ccagcaaatg 
agcatctaaa 
tggtaatgac 
ctgccgggac 
tgaaggaggt 
ctcgcctctt 
actcatcaat 
attttctgca 
ccatgttgtg 
ggataccgtt 
gttgatgctt 
tgagacaaat 
cgaaaagaat 
aattataaat 



atggctggtc 
aaaaacatat 
ctagaaacta 
gaaggttgta 
gatgtgattc 
gctgacaagg 
ttcagtccta 
tgtatgattt 
aatgataata 
tgttacaaaa 
gacaggtgcg 
tatggaggag 
gattatttaa 
aataga tggg 
atggatagat 
gagtggtgct 
ttagatgttg 
tccattcaag 
gtatggatta 
attgtatata 



gtgtagagct 
accaacaaga 
ggtacggttg 
ctacagtttc 
tcattgacga 
ggaaaaggat 
ttcataagct 
gtaaaattga 
atgttatatg 
aaattaacaa 
gtaagagtac 
aatatatgtg 
ctaagaaaat 
aagtttgtag 
attactactc 
ctgctagcga 
aaaagtgttc 
aacgtgctag 
aggtagacgc 
atattgttga 



cgtcactgga 
aaatggaggc 
tggaactgga 
ttctatcaag 
agggcaattc 
tgtgattgca 
attgccttat 
taccaaagaa 
tgtaggagga 
gaaaaagaac 
ccaagccaaa 
ctttcccgac 
tgaactagat 
taaaattaag 
ggggattgtt 
tgagggactt 
aaatagggat 
agccctattc 
tcgcggcacc 
agaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



<210> 67 

<211> 394 

<212> PRT 

<213> SHRIMP 
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Met 


Gin 


Leu 


He 


Leu 


Ser 


His 


His 


Leu 


Thr 


1 








5 










10 


Leu 


Val 


Thr 


Gly 


Pro 


Met 


Phe 


Ala 


Gly 


Lys 








20 










25 




He 


Tyr 


Gin 


Gin 


Glu 


Asn 


Gly Gly 


Asn 


Lys 






35 










40 






His 


Ser 


Leu 


Glu 


Thr 


Arg 


Tyr 


Gly 


Cys 


Gly 




50 










55 








His 


Ala 


Gly 


Glu 


Val 


He 


Glu 


Gly 


Cys 


Thr 


65 










70 










Glu 


Leu 


He 


Ser 


Val 


Leu 


Pro 


Glu 


Val 


Val 










85 










90 


Glu 


Gly 


Gin 


Phe 


Phe 


Thr 


Asp 


Leu 


Val 


Leu 








100 










105 




Lys 


Gly 


Lys 


Arg 


He 


Val 


He 


Ala 


Ala 


Leu 






115 










120 






Gin 


Met 


Phe 


Ser 


Pro 


He 


His 


Lys 


Leu 


Leu 




130 










135 








Val 


Lys 


Leu 


Ala 


Ser 


Lys 


Cys 


Met 


He 


Cys 


145 










150 










Ala 


Pro 


Phe 


Thr 


Val 


Arg 


Phe 


Gly 


Asn 


Asp 










165 










170 


Cys 


Val 


Gly 


Gly 


Ala 


Glu 


Met 


Tyr 


Ala 


Ala 








180 










185 




Lys 


Lys 


He 


Asn 


Lys 


Lys 


Lys 


Asn 


Lys 


Gly 






195 










200 






Gly 


Gly 


Asp 


Arg 


Cys 


Gly 


Lys 


Ser 


Thr 


Gin 




210 










215 








Asn 


Lys 


Asn 


Ser 


Pro 


Leu 


Tyr 


Gly 


Gly 


Glu 


225 










230 










Arg 


Ser 


Ser 


His 


Thr 


Gly 


Lys 


Leu 


He 


Asn 










245 










250 


He 


Glu 


Leu 


Asp 


Asp 


His 


Ala 


Ala 


His 


Leu 








260 










265 




Trp 


Glu 


Val 


Cys 


Ser 


Lys 


He 


Lys 


Gin 


Leu 






275 










280 






Val 


Val 


Met 


Asp 


Arg 


Tyr 


Tyr 


Tyr 


Ser 


Gly 




290 










295 








Val 


Asp 


Thr 


Val 


Glu 


Trp 


Cys 


Ser 


Ala 


Ser 


305 










310 










Pro 


Asp 


Leu 


Val 


Leu 


Leu 


Met 


Leu 


Leu 


Asp 










325 










330 


Arg 


Asp 


Thr 


Phe 


Gly 


Val 


Glu 


Arg 


Phe 


Glu 








340 










345 




Arg 


Ala 


Arg 


Ala 


Leu 


Phe 


Leu 


Asp 


Leu 


Ala 






355 










360 






Val 


Trp 


He 


Lys 


Val 


Asp 


Arg 


Thr 


He 


Glu 




370 










375 








He 


Asn 


He 


Val 


Tyr 


Asn 


He 


Val 


Glu 


Glu 



385 390 



<210> 68 

<211> 486 

<212> DNA 

<213> SHRIMP 



PCT/USOO/28888 



Met 


Ala 


Gly 


Arg 


Val 
15 


Glu 


Ser 


Thr 


Tyr 


Leu 
30 


Lys 


Asn 


His 


Cys 


Leu 
45 


Phe 


Val 


Lys 


Thr 


Gly 
60 


Thr 


He 


Val 


Thr 


Thr 


Val 


Ser 


Ser 


He 


Lys 


75 










80 


Asp 


Val 


He 


Leu 


He 
95 


Asp 


Val 


Asn 


Arg 


Leu 
110 


Ala 


Asp 


Asp 


Gly 


Thr 

125 


Ser 


Asp 


Gin 


Pro 


Tyr 
140 


Thr 


Asn 


Ser 


He 


Lys 


He 


Asp 


Thr 


Lys 


Glu 


155 










160 


Asn 


Asp 


Asn 


Asn 


Val 
175 


He 


Ala 


Cys 


Arg 


Asp 
190 


Cys 


Tyr 


Lys 


Leu 


Val 
205 


Val 


Leu 


Glu 


Ala 


Lys 
220 


Leu 


Leu 


Leu 


Thr 


Tyr 


Met 


Cys 


Phe 


Pro 


Asp 


235 










240 


Asp 


Tyr 


Leu 


Thr 


Lys 
255 


Lys 


Leu 


Phe 


Ser 


Ala 
270 


Asn 


Arg 


Leu 


Asp 


Asp 
285 


Gly 


He 


His 


He 


Val 
300 


Phe 


Ser 


Leu 


Arg 


Asp 


Glu 


Gly 


Leu 


Pro 


Gin 


315 










320 


Val 


Glu 


Lys 


Cys 


Ser 

335 


Asn 


Thr 


Asn 


Ser 


He 
350 


Gin 


Glu 


Asn 


Lys 


Asp 
365 


Glu 


Lys 


Asn 


Glu 


Val 


Gin 


Thr 


Lys 


He 



380 



<400> 68 

atgttaccta gaaagacttt gcccgacact gaaaatggtt attttgtctt ggacgagtct 60 
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cttctggaga 
tatatgcaga 
tatagtgtgt 
gatggggtat 
gggtgctaca 
cagcaaatga 
tctagaacgt 
tgttaa 



aggtgtacta 
tatgcaagtc 
tggaggatta 
ttttaggacc 
aagctctttt 
aaaccaacct 
acaatacacc 



tgataacaac 
aaaatacatc 
tcaccccatc 
gtgggagttt 
gtcattgcca 
ggaaaaaatg 
ccagatagaa 



aatgaactga 
ttccatcacg 
aaagagattg 
ttatcgcgca 
gaggacaaat 
gaagaaatac 
ttgtctgacc 



ttgtaagagt 
atgatccaga 
ttgaacgact 
aacaagtgaa 
attgtaacct 
agcgtactag 
agctagatgg 



tggtgggatt 120 
gaggttcttt 180 
agcagaagag 240 
cctccaacac 300 
attattaccc 360 
actcattcac 420 
atgtgttata 480 
486 



<210> 69 
<211> 161 
<212> PRT 
<213> SHRIMP 



<400> 69 



Met 


Leu 


Pro 


Arg 


Lys 


Thr 


Leu 


Pro 


Asp 


Thr 


Glu 


Asn 


Gly 


Tyr 


Phe 


Val 


1 








5 










10 










15 




Leu 


Asp 


Glu 


Ser 


Leu 


Leu 


Glu 


Lys 


Val 


Tyr 


Tyr 


Asp 


Asn 


Asn 


Asn 


Glu 








20 










25 










30 






Leu 


He 


Val 


Arg 


Val 


Gly 


Gly 


He 


Tyr 


Met 


Gin 


He 


Cys 


Lys 


Ser 


Lys 






35 










40 










45 








Tyr 


He 


Phe 


His 


His 


Asp 


Asp 


Pro 


Glu 


Arg 


Phe 


Phe 


Tyr 


Ser 


Val 


Leu 




50 










55 










60 










Glu 


Asp 


Tyr 


His 


Pro 


He 


Lys 


Glu 


He 


Val 


Glu 


Arg 


Leu 


Ala 


Glu 


Glu 


65 










70 










75 










80 


Asp 


Gly 


Val 


Phe 


Leu 


Gly 


Pro 


Trp 


Glu 


Phe 


Leu 


Ser 


Arg 


Lys 


Gin 


Val 










85 










90 










95 




Asn 


Leu 


Gin 


His 


Gly 


Cys 


Tyr 


Lys 


Ala 


Leu 


Leu 


Ser 


Leu 


Pro 


Glu 


Asp 








100 










105 










110 






Lys 


Tyr 


Cys 


Asn 


Leu 


Leu 


Leu 


Pro 


Gin 


Gin 


Met 


Lys 


Thr 


Asn 


Leu 


Glu 






115 










120 










125 








Lys 


Met 


Glu 


Glu 


He 


Gin 


Arg 


Thr 


Arg 


Leu 


He 


His 


Ser 


Arg 


Thr 


Tyr 




130 










135 










140 










Asn 


Thr 


Pro 


Gin 


He 


Glu 


Leu 


Ser 


Asp 


Gin 


Leu 


Asp 


Gly 


Cys 


Val 


He 


145 










150 










155 










160 



<210> 70 
<211> 1926 
<212> DNA 
<213> SHRIMP 



<400> 70 

atggttgctt 

acaaactttc 

gatgtagttt 

agtatagttt 

caaatggttt 

aatgaagaag 

ctcctcaact 

tttgacaggg 

tccacgtcaa 

aatatggaac 

tcttttacta 

ttcatatgca 

tgccctgata 

aattaccctg 

ggggatcata 

gaaattatag 



caactccgtg 
ttgaagctca 
attgtgactc 
ctaccattgg 
cttccgtc tt 
ataaaggtat 
cattgttggt 
agaatatatg 
cttttcagac 
gtggcaaaca 
actttgaaaa 
atgaaaagga 
aaactaa tga 
cattgctata 
atgaatctct 
agtttataag 



tccaggccca 
caagcttgtc 
tgagacgtac 
agactatgtc 
agaaaaattt 
tcctctgtgg 
tcataacaac 
tgatgcttca 
aatttgcaaa 
aggagggagt 
tgaagaaatg 
aaaactagac 
tgaagggtac 
tctcgtttgt 
caaagcgttt 
tgatcactac 



ggaccagttc 
gtggaacttc 
accaaaccta 
ttatcaaacc 
cccttgctat 
aagaagttgt 
aagaactgga 
ggaaggagtg 
aacaacacac 
tttcttcact 
gactctcatg 
tctttcatac 
atcccgctgg 
aggtatggag 
gcaataagaa 
agtttcaaca 



caacccaaga 
ttctcccgtc 
taccgatttt 
ccaatgaaga 
tccactgcac 
acaacaaaag 
ctcctgttcc 
ttcttatgag 
attacttgtt 
tctttgcatc 
tgctcagtaa 
ctgccaacgg 
aaatagcaat 
catcttgggc 
atgatgcaaa 
aaaatgtgac 



acttctttct 
ctacagtagt 
tgggaacaag 
tgtgagttac 
ttataagacg 
aaaattcaaa 
agctatcccg 
tgaaataatg 
tgatatgtta 
taggaagaat 
catagcgaaa 
aaaaatacca 
tatggaagac 
aaacaca tac 
agattgtctg 
gaaggaagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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tttgttaaag 
cgttgttaca 
tctaaagcta 
aaatgtcccc 
atggatttgt 
aaattagaca 
ctagcaaaaa 
aaaaggacag 
acagcagaaa 
aaaataattg 
tcaactgcag 
gattccgatc 
gaagaaacaa 
ctaactaaag 
ataattgcga 
acttctcata 
gaataa 



agaagactgt 
aactcccatg 
actttagatg 
caactataca 
tcaatcgtgc 
aaaaatcaaa 
ggacagcaga 
cagaaattga 
ttgaagaggc 
aaaagggaaa 
ccataaacat 
aggagttggc 
aacttgacag 
atgatgaaac 
agaaaaacaa 
ttgagaaact 



agaatgtgtt 
tggacatttc 
tgttaaatgt 
atggaaaatg 
atttgacaca 
acctaaacac 
aattgaagag 
agaggccata 
catgaagaaa 
aagacgactg 
gtacaaagag 
tgaggcgaaa 
atttttgttt 
gttagccttc 
ccaaaaaggc 
acctcaatcc 



ggatgtttat 
atgcatacat 
ttccaaacct 
ggtataaacc 
tatttagatt 
aaacctgaaa 
gccataaaga 
aagaaaaagg 
aaggaagaag 
aatgaagaat 
aaagtgagaa 
gagaggttga 
agaccgaaac 
aagttagccc 
agtgaaagaa 
actgctttgg 



atgatattga 
tttgcttgtc 
ttgatgacac 
aaacgactaa 
ttatttgctc 
acaaaaaggt 
aaaaggaaga 
aagaagaact 
aagaactctc 
gtgtcaagct 
ttaatggtgt 
ggaaaatttt 
gagtagaaga 
tagaaaagaa 
gagatggaga 
ctagtgtgtg 



agacgagaaa 
taataagtgt 
aatttttaga 
ccataaggaa 
atataacgtc 
ggaagaagaa 
agaactagca 
agcaaaaagg 
aaaatataat 
gagagatatt 
attactaaaa 
attgctagaa 
acgtatattc 
aacggaggac 
atatactata 
tgtgttaaac 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1926 



<210> 71 
<211> 637 
<212> PRT 
<213> SHRIMP 



<400> 71 



Met 


Val 


Ala 


Ser 


Thr 


Pro 


Cys 


Pro 


Gly 


Pro 


Gly 


Pro 


Val 


Pro 


Thr 


Gin 


1 








5 










10 










15 




Glu 


Leu 


Leu 


Ser 


Thr 


Asn 


Phe 


Leu 


Glu 


Ala 


His 


Lys 


Leu 


Val 


Val 


Glu 








20 










25 










30 






Leu 


Leu 


Leu 


Pro 


Ser 


Tyr 


Ser 


Ser 


Asp 


Val 


Val 


Tyr 


Cys 


Asp 


Ser 


Glu 






35 










40 










45 








Thr 


Tyr 


Thr 


Lys 


Pro 


lie 


Pro 


lie 


Phe 


Gly 


Asn 


Lys 


Ser 


He 


Val 


Ser 




50 










55 










60 










Thr 


lie 


Gly 


Asp 


Tyr 


Val 


Leu 


Ser 


Asn 


Pro 


Asn 


Glu 


Asp 


Val 


Ser 


Tyr 


65 










70 










75 










80 


Gin 


Met 


Val 


Ser 


Ser 


Val 


Leu 


Glu 


Lys 


Phe 


Pro 


Leu 


Leu 


Phe 


His 


Cys 










85 










90 










95 




Thr 


Tyr 


Lys 


Thr 


Asn 


Glu 


Glu 


Asp 


Lys 


Gly 


He 


Pro 


Leu 


Trp 


Lys 


Lys 








100 










105 










110 






Leu 


Tyr 


Asn 


Lys 


Arg 


Lys 


Phe 


Lys 


Leu 


Leu 


Asn 


Ser 


Leu 


Leu 


Val 


His 






115 










120 










125 








Asn 


Asn 


Lys 


Asn 


Trp 


Thr 


Pro 


Val 


Pro 


Ala 


He 


Pro 


Phe 


Asp 


Arg 


Glu 




130 










135 










140 










Asn 


lie 


Cys 


Asp 


Ala 


Ser 


Gly 


Arg 


Ser 


Val 


Leu 


Met 


Ser 


Glu 


He 


Met 


145 










150 










155 










160 


Ser 


Thr 


Ser 


Thr 


Phe 


Gin 


Thr 


lie 


Cys 


Lys 


Asn 


Asn 


Thr 


His 


Tyr 


Leu 










165 










170 










175 




Phe 


Asp 


Met 


Leu 


Asn 


Met 


Glu 


Arg 


Gly 


Lys 


Gin 


Gly 


Gly 


Ser 


Phe 


Leu 








180 










185 










190 






His 


Phe 


Phe 


Ala 


Ser 


Arg 


Lys 


Asn 


Ser 


Phe 


Thr 


Asn 


Phe 


Glu 


Asn 


Glu 






195 










200 










205 








Glu 


Met 


Asp 


Ser 


His 


Val 


Leu 


Ser 


Asn 


He 


Ala 


Lys 


Phe 


He 


Cys 


Asn 




210 










215 










220 










Glu 


Lys 


Glu 


Lys 


Leu 


Asp 


Ser 


Phe 


He 


Pro 


Ala 


Asn 


Gly 


Lys 


He 


Pro 


225 










230 










235 










240 


Cys 


Pro 


Asp 


Lys 


Thr 


Asn 


Asp 


Glu 


Gly 


Tyr 


He 


Pro 


Leu 


Glu 


He 


Ala 










245 










250 










255 




lie 


Met 


Glu 


Asp 


Asn 


Tyr 


Pro 


Ala 


Leu 


Leu 


Tyr 


Leu 


Val 


Cys 


Arg 


Tyr 








260 










265 










270 






Gly 


Ala 


Ser 


Trp 


Ala 


Asn 


Thr 


Tyr 


Gly 


Asp 


His 


Asn 


Glu 


Ser 


Leu 


Lys 






275 










280 










285 








Ala 


Phe 


Ala 


lie 


Arg 


Asn 


Asp 


Ala 


Lys 


Asp 


Cys 


Leu 


Glu 


He 


He 


Glu 
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290 










295 










300 










Phe 


lie 


Ser 


Asp 


His 


Tyr 


Ser 


Phe 


Asn 


Lys 


Asn 


Val 


Thr 


Lys 


Glu 


Glu 


305 










310 










315 










320 


Phe 


Val 


Lys 


Glu 


Lys 


Thr 


Val 


Glu 


Cys 


Val 


Gly 


Cys 


Leu 


Tyr 


Asp 


He 










325 










330 










335 




Glu 


Asp 


Glu 


Lys 


Arg 


Cys 


Tyr 


Lys 


Leu 


Pro 


Cys 


Gly 


His 


Phe 


Met 


His 








340 










345 










350 






Thr 


Phe 


Cys 


Leu 


Ser 


Asn 


Lys 


Cys 


Ser 


Lys 


Ala 


Asn 


Phe 


Arg 


Cys 


Val 






355 










360 










365 








Lys 


Cys 


Phe 


Gin 


Thr 


Phe 


Asp 


Asp 


Thr 


lie 


Phe 


Arg 


Lys 


Cys 


Pro 


Pro 




370 










375 










380 










Thr 


lie 


Gin 


Trp 


Lys 


Met 


Gly 


lie 


Asn 


Gin 


Thr 


Thr 


Asn 


His 


Lys 


Glu 


385 










390 










395 










400 


Met 


Asp 


Leu 


Phe 


Asn 


Arg 


Ala 


Phe 


Asp 


Thr 


Tyr 


Leu 


Asp 


Phe 


He 


Cys 










405 










410 










415 




Ser 


Tyr 


Asn 


Val 


Lys 


Leu 


Asp 


Lys 


Lys 


Ser 


Lys 


Pro 


Lys 


His 


Lys 


Pro 








420 










425 










430 






Glu 


Asn 


Lys 


Lys 


Val 


Glu 


Glu 


Glu 


Leu 


Ala 


Lys 


Arg 


Thr 


Ala 


Glu 


He 






435 










440 










445 








Glu 


Glu 


Ala 


lie 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


Leu 


Ala 


Lys 


Arg 


Thr 


Ala 




450 










455 










460 










Glu 


lie 


Glu 


Glu 


Ala 


lie 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


Leu 


Ala 


Lys 


Arg 


4 65 










470 










475 










480 


Thr 


Ala 


Glu 


lie 


Glu 


Glu 


Ala 


Met 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


Glu 


Leu 










485 










490 










495 




Ser 


Lys 


Tyr 


Asn 


Lys 


lie 


lie 


Glu 


Lys 


Gly 


Lys 


Arg 


Arg 


Leu 


Asn 


Glu 








500 










505 










510 






Glu 


Cys 


Val 


Lys 


Leu 


Arg 


Asp 


lie 


Ser 


Thr 


Ala 


Ala 


He 


Asn 


Met 


Tyr 






bl D 










52 0 










525 








Lys 


Glu 


Lys 


Val 


Arg 


lie 


Asn 


Gly 


Val 


Leu 


Leu 


Lys 


Asp 


Ser 


Asp 


Gin 




530 










535 










540 










Glu 


Leu 


Ala 


Glu 


Ala 


Lys 


Glu 


Arg 


Leu 


Arg 


Lys 


He 


Leu 


Leu 


Leu 


Glu 


545 










550 










555 










560 


Glu 


Glu 


Thr 


Lys 


Leu 


Asp 


Arg 


Phe 


Leu 


Phe 


Arg 


Pro 


Lys 


Arg 


Val 


Glu 










565 










570 










575 




Glu 


Arg 


lie 


Phe 


Leu 


Thr 


Lys 


Asp 


Asp 


Glu 


Thr 


Leu 


Ala 


Phe 


Lys 


Leu 








580 










585 










590 






Ala 


Leu 


Glu 


Lys 


Lys 


Thr 


Glu 


Asp 


lie 


lie 


Ala 


Lys 


Lys 


Asn 


Asn 


Gin 






595 










600 










605 








Lys 


Gly 


Ser 


Arg 


Asp 


Gly 


Glu 


Tyr 


Thr 


He 


Thr 


Ser 


His 


He 


Glu 


Lys 




610 










615 










620 










Leu 


Pro 


Gin 


Ser 


Thr 


Ala 


Ser 


Val 


Cys 


Val 


Leu 


Asn 


Glu 









625 630 635 



<210> 72 

<211> 780 

<212> DNA 

<213> SHRIMP 



<400> 72 

atgtctactt 

ccattcgacc 

tttttcatta 

tcaaagatta 

aatatataca 

attatactga 

attcttccca 

aacaatgaac 

caaaatctag 

ataaaagcgt 



gttcgaattt 
tcgtccatac 
ctgaaacgtg 
ttgacacggg 
aagctataaa 
tggatgtaga 
agggtaaaaa 
ggcattacta 
tagattatta 
gcgcatattt 



gttgtcagta 
acgtcaagag 
taaaggagag 
taataatgat 
ccatttagaa 
aacaatgata 
tggaagtttc 
ccctgtattt 
taatgaaatt 
catgtacaac 



tttggtggag 
tgtgataaaa 
aatattggta 
tctacctcaa 
aatatcctaa 
ctggaaactc 
agtacatgcg 
gaaacagaga 
gtagctcaca 
tttctaactc 



gagattggac 
agagagagca 
tacattcgta 
tagaggaact 
aactcaacaa 
ataaaatttt 
tacgctttgc 
aagaagcgtt 
ccaatgacca 
tccacccttt 



aacaacattc 
agactactca 
tgaacacacg 
agaagtactg 
aggagaaaaa 
aatgaaaggg 
tgtaaataag 
caattctata 
aattaaaata 
caatgatggt 
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600 
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aatggaagaa cagctagatt attgtatagt tttctattga aaggtaatgg tatcgtacct 660 
catttttcac ccataacaca ccctagggat caatttgttg atactttagt gtattttaga 720 
gaacatggag atggacgacc tttattgtat gttttgctgg aatcaataaa aaataagtaa 780 



<210> 73 

<211> 255 

<212> PRT 

<213> SHRIMP 



<400> 73 
Met Ser Thr Cys 
1 

Thr Thr Thr Phe 
20 

Lys Lys Arg Glu 
35 

Gly Glu Asn lie 
50 

Asp Thr Gly Asn 
65 

Asn lie Tyr Lys 

Lys Gly Glu Lys 
100 

Lys lie Leu Met 
115 

Ser Thr Cys Val 
130 

Tyr Pro Val Phe 
145 

Leu Val Asp Tyr 

Lys lie lie Lys 
180 

His Pro Phe Asn 
195 

Leu Lys Gly Asn 
210 

Arg Asp Gin Phe 

225 

Gly Arg Pro Leu 



Ser 


Asn 


Leu 


Leu 


Ser 


Val 


Phe 


Gly 


Gly 


Gly 


Asp 


Trp 


5 










10 










15 




Pro 


Phe 


Asp 


Leu 


Val 


His 


Thr 


Arg 


Gin 


Glu 


Cys 


Asp 










25 










30 






Gin 


Asp 


Tyr 


Ser 


Phe 


Phe 


He 


Thr 


Glu 


Thr 


Cys 


Lys 








40 










45 








Gly 


He 


His 


Ser 


Tyr 


Glu 


His 


Thr 


Ser 


Lys 


He 


He 






55 










60 










Asn 


Asp 


Ser 


Thr 


Ser 


He 


Glu 


Glu 


Leu 


Glu 


Val 


Leu 




70 










75 










80 


Ala 


He 


Asn 


His 


Leu 


Glu 


Asn 


He 


Leu 


Lys 


Leu 


Asn 


85 










90 










95 




He 


He 


Leu 


Met 


Asp 


Val 


Glu 


Thr 


Met 


He 


Thr 


His 










105 










110 






Lys 


Gly 


He 


Leu 


Pro 


Lys 


Gly 


Lys 


Asn 


Gly 


Ser 


Phe 








120 










125 








Arg 


Phe 


Ala 


Val 


Asn 


Lys 


Asn 


Asn 


Glu 


Arg 


His 


Tyr 






135 










140 










Glu 


Thr 


Glu 


Lys 


Glu 


Ala 


Phe 


Asn 


Ser 


He 


Gin 


Asn 




150 










155 










160 


Tyr 


Asn 


Glu 


He 


Val 


Ala 


His 


Thr 


Asn 


Asp 


Gin 


He 


165 










170 










175 




Ala 


Cys 


Ala 


Tyr 


Phe 


Met 


Tyr 


Asn 


Phe 


Leu 


Thr 


Leu 










185 










190 






Asp 


Gly 


Asn 


Gly 


Arg 


Thr 


Ala 


Arg 


Leu 


Lys 


Phe 


Leu 








200 










205 








Gly 


He 


Val 


Pro 


His 


Phe 


Ser 


Pro 


He 


Thr 


His 


Pro 






215 










220 










Val 


Asp 


Thr 


Leu 


Val 


Tyr 


Phe 


Arg 


Glu 


His 


Gly 


Asp 




230 










235 










240 


Leu 


Tyr 


Val 


Leu 


Leu 


Glu 


Ser 


He 


Lys 


Asn 


Lys 




245 










250 










255 





<210> 74 
<211> 480 
<212> DNA 
<213> SHRIMP 



<400> 74 

atggaggacc 

caatggacaa 

gaggaacaaa 

ctcgcagtaa 

aaagaattga 

gcaacctcta 

acactcaaag 

atcaataaca 



taaaatccac 
atactgtaga 
aaagggccct 
aagaagggat 
taaatacgtg 
ccaataatca 
aaaagtacaa 
aaagggatta 



tatcgagaga 
ggaagaagaa 
ggacgcatgg 
atctgcactc 
gatagccgaa 
actgaagaac 
caaatattac 
tgaagcaagt 



gtatatgaag 
aggactgtct 
gaagcagcga 
gttttcaacg 
agggaaacgt 
cagatgtcat 
agaagaagtg 
caattttggg 



aaagag tgga 
cagcaatcga 
taaaggaacg 
cagcagacgc 
cagaaaaaag 
ctctagtcaa 
ccatactcaa 
tgta tacaaa 



gaatctagaa 
ttctgtcctg 
agaaaacgac 
caaaacacgt 
aagaaaggaa 
cacaaccaaa 
catgcaatac 
caatgcataa 
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<210> 75 
<211> 159 
<212> PRT 
<213> SHRIMP 



<400> 75 

Met Glu Asp Leu Lys Ser Thr lie Glu Arg Val Tyr Glu Glu Arg Val 

15 10 15 

Glu Asn Leu Glu Gin Trp Thr Asn Thr Val Glu Glu Glu Glu Arg Thr 

20 25 30 

Val Ser Ala lie Asp Ser Val Leu Glu Glu Gin Lys Arg Ala Leu Asp 

35 40 45 

Ala Trp Glu Ala Ala lie Lys Glu Arg Glu Asn Asp Leu Ala Val Lys 

50 55 60 

Glu Gly lie Ser Ala Leu Val Phe Asn Ala Ala Asp Ala Lys Thr Arg 
65 70 75 80 

Lys Glu Leu lie Asn Thr Trp lie Ala Glu Arg Glu Thr Ser Glu Lys 

85 90 95 

Arg Arg Lys Glu Ala Thr Ser Thr Asn Asn Gin Leu Lys Asn Gin Met 

100 105 110 

Ser Ser Leu Val Asn Thr Thr Lys Thr Leu Lys Glu Lys Tyr Asn Lys 

115 120 125 

Tyr Tyr Arg Arg Ser Ala lie Leu Asn Met Gin Tyr lie Asn Asn Lys 

130 135 140 

Arg Asp Tyr Glu Ala Ser Gin Phe Trp Val Tyr Thr Asn Asn Ala 
145 150 155 



<210> 76 
<211> 321 
<212> DNA 
<213> SHRIMP 



<400> 76 

atgcataagt 

gtccctgatg 

ggtggcggcg 

cctgttctta 

gtccttatca 

caggacgagg 



tttcaaataa attttatttt ataataaaag gtgttttaat tataattttt 60 

ttgtgttttc tatattttta ctgcctcctc ttggggtaag acataaaaac 120 

gaaacgagga acagaagagc ggacccagcc agaagcatca tatccctggt 180 

tatttgtcct catcatcgtt atcgttggca gtgtcgtcat catcatcggt 240 

gtgtcaggat cgctgtcctt ctttggtccc atccatacat tcatgacggc 300 

ataccaattg a 321 



<210> 77 
<211> 106 
<212> PRT 
<213> SHRIMP 



<400> 77 



Met 


His 


Lys 


Phe 


Ser 


Asn 


Lys 


Phe 


Tyr 


Phe 


He 


He 


Lys 


Gly 


Val 


Leu 


1 








5 










10 










15 




He 


lie 


He 


Phe 


V^l 


Pro 


Asp 


Val 


Val 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Pro 








20 










25 










30 






Pro 


Leu 


Gly 


Val 


Arg 


His 


Lys 


Asn 


Gly 


Gly 


Gly 


Gly 


Asn 


Glu 


Glu 


Gin 






35 










40 










45 








Lys 


Ser 


Gly 


Pro 


Ser 


Gin 


Lys 


His 


His 


He 


Pro 


Gly 


Pro 


Val 


Leu 


He 




50 










55 










60 










Phe 


Val 


Leu 


He 


He 


Val 


He 


Val 


Gly 


Ser 


Val 


Val 


He 


He 


He 


Gly 


65 










70 










75 










80 


Val 


Leu 


He 


Ser 


Val 


Arg 


He 


Ala 


Val 


Leu 


Leu 


Trp 


Ser 


His 


Pro 


Tyr 










85 










90 










95 




He 


His 


Asp 


Gly 


Gin 


Asp 


Glu 


Asp 


Thr 


Asn 















100 105 
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<210> 78 
<211> 1635 
<212> DNA 
<213> SHRIMP 



<400> 78 

atgttccgtc 

tccacggcgt 

ggagaaccac 

tgcgaggacc 

atacaggtaa 

attgatagga 

gacgggggtg 

aagaaattgt 

gaaaacagta 

gaatgtggat 

gagattaaag 

tcattaattg 

gcagtcaact 

tattgttgtg 

tctttaacag 

gagtgcgtgc 

agggactgct 

ttcctggaca 

tcttccaaca 

aggagtaact 

gcatcagcct 

atggtaagaa 

gcctctaaag 

gcatctttac 

aaatggatga 

tggagttaca 

ccctatcaga 

ccattatact 



aattctgttc 
ccgcttcggc 
cttctcataa 
cagaaagggt 
caaaagaatg 
gcggaattga 
gaaatggggg 
cctttgttgg 
cagaatttgc 
gcatcaaatc 
aggctgctga 
actggttttt 
ggtacttgaa 
tcttttccta 
tggcttcatc 
aacacttt tt 
tcttccctct 
caatgaagag 
acaagagaaa 
tttctacagc 
gtattcttat 
aatcaactat 
acactggtgc 
tcctccctgg 
ctctagcaga 
cgctagcaga 
catgcggcaa 
tttaa 



actatatctt 
^gcggcctca 
aatgagggga 
atatgttgat 
ggaattgttt 
gaataatggc 
aagaattgaa 
aagagaagat 
acgtcgttta 
aaaagaggaa 
cgttaggggt 
tgagttcgac 
atctcaagcg 
ca taagacgc 
cttttctccc 
aaaatcagac 
tttaactgaa 
acctacctta 
gagaaatact 
atccaataac 
cgaagggtat 
tattccagaa 
gaatatctta 
actttttaga 
aaacaacatc 
ttattgtcct 
ttttacttcg 



ct tcaacgcc 
ttaaaaggag 
ccttcttata 
attgtagtgt 
tccgataagt 
gaaggagaag 
gacagagaag 
ccagtcgctg 
acactcaagg 
ctctttgact 
atggcaaaca 
acatatagta 
tctccaattt 
caaacctttt 
acgcccgaca 
attaatatta 
atgccccgcc 
tctcttctac 
gccgctgcca 
aagagactga 
gcgaatggaa 
gtgtttaacc 
ttttttatca 
caccccaaac 
aacgacaaca 
ctgggctatt 
actacaaaca 



gggttaacga 
atggaactga 
gcgttttagg 
ctattttgca 
tgagaaaatt 
aagatggaga 
cacatcgacg 
tagatttacc 
aattgtgcga 
tcatttttga 
ggagtaaatt 
aatgtgtagt 
cattggtact 
taactagggc 
caaagctttt 
gccagatggc 
aacaaaaaaa 
cttccacctc 
atattcttct 
aaactgatga 
aaataagccc 
atcttttgtt 
aaatgaaatc 
aatttctcaa 
acataaactc 
acacccaaga 
agagactaca 



caatcttaga 
atttataacc 
acctgatccg 
gacaaataat 
gggtccatgg 
tgaaaatgaa 
aaaaatgatg 
cacgtggcga 
tttaatagtt 
agaaccgtgg 
caccaaggaa 
attttttgaa 
agatgatata 
aaaaaaccca 
ggctatcgac 
attaactgaa 
agtaaacacc 
ctcctcctct 
tccagtgtac 
tggggaaaat 
tataaggatt 
ccctgtcttt 
ctttgcaagt 
cgggccgtgc 
ttccacgatg 
gagccctcaa 
aaacgtgcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1635 



<210> 79 
<211> 540 
<212> PRT 
<213> SHRIMP 



<400> 79 



Met 


Phe 


Arg 


Gin 


Phe 


Cys 


Ser 


Leu 


1 








5 








Asp 


Asn 


Leu 


Arg 


Ser 


Thr 


Ala 


Ser 








20 










Gly 


Asp 


Gly 


Thr 


Glu 


Phe 


He 


Thr 






35 










40 


Arg 


Gly 


Pro 


Ser 


Tyr 


Ser 


Val 


Leu 




50 










55 




Glu 


Arg 


Val 


Tyr 


Val 


Asp 


He 


Val 


65 










70 






lie 


Gin 


Val 


Thr 


Lys 


Glu 


Trp 


Glu 










85 








Leu 


Gly 


Pro 


Trp 


He 


Asp 


Arg 


Ser 








100 










Glu 


Glu 


Asp 


Gly 


Asp 


Glu 


Asn 


Glu 






115 










120 


lie 


Glu 


Asp 


Arg 


Glu 


Ala 


His 


Arg 




130 










135 





Tyr 


Leu 


Leu 


Gin 


Arg 


Arg 


Val 


Asn 




10 










15 




Ala 


Ser 


Ala 


Ala 


Ala 


Ser 


Leu 


Lys 


25 










30 






Gly 


Glu 


Pro 


Pro 


Ser 


His 


Lys 


Met 










45 








Gly 


Pro 


Asp 


Pro 


Cys 


Glu 


Asp 


Pro 








60 










Val 


Ser 


He 


Leu 


Gin 


Thr 


Asn 


Asn 






75 










80 


Leu 


Phe 


Ser 


Asp 


Lys 


Leu 


Arg 


Lys 




90 










95 




Gly 


He 


Glu 


Asn 


Asn 


Gly 


Glu 


Gly 


105 










110 






Asp 


Gly 


Gly 


Gly 


Asn 


Gly 


Gly 


Arg 










125 








Arg 


Lys 


Met 


Met 


Lys 


Lys 


Leu 


Ser 



140 



wo 01/38351 



167 



PCT/USOO/28888 



Phe 


Val 


Gly 


Arg 


Glu 


Asp 


Pro 


Val 


Ala 


Val 


Asp 


Leu 


Pro 


Thr 


Trp 


Arg 


145 










150 










155 










160 


Glu 


Asn 


Ser 


Thr 


Glu 


Phe 


Ala 


Arg 


Arg 


Leu 


Thr 


Leu 


Lys 


Glu 


Leu 


Cys 










165 










170 










175 




Asp 


Leu 


lie 


Val 


Glu 


Cys 


Gly 


Cys 


He 


Lys 


Ser 


Lys 


Glu 


Glu 


Leu 


Phe 








180 










185 










190 






Asp 


Phe 


lie 


Phe 


Glu 


Glu 


Pro 


Trp 


Glu 


He 


Lys 


Glu 


Ala 


Ala 


Asp 


Val 






195 










200 










205 








Arg 


Gly 


Met 


Ala 


Asn 


Arg 


Ser 


Lys 


Phe 


Thr 


Lys 


Glu 


Ser 


Leu 


He 


Asp 




210 










215 










220 










Trp 


Phe 


Phe 


Glu 


Phe 


Asp 


Thr 


Tyr 


Ser 


Lys 


Cys 


Val 


Val 


Phe 


Phe 


Glu 


225 










230 










235 










240 


Ala 


Val 


Asn 


Trp 


Tyr 


Leu 


Lys 


Ser 


Gin 


Asp 


He 


Ser 


Leu 


Val 


Leu 


Asp 










245 










250 










255 




Asp 


lie 


Tyr 


Cys 


Cys 


Val 


Phe 


Ser 


Tyr 


He 


Arg 


Arg 


Gin 


Thr 


Phe 


Leu 








2 60 










265 










270 






Thr 


Arg 


Ala 


Lys 


Asn 


Pro 


Ser 


Leu 


Thr 


Val 


Ala 


Ser 


Ser 


Phe 


Ser 


Pro 






275 










2 80 










285 








Thr 


Pro 


Asp 


Thr 


Lys 


Leu 


Leu 


Ala 


He 


Asp 


Glu 


Cys 


Val 


Gin 


His 


Phe 




290 










295 










300 










Leu 


Lys 


Ser 


Asp 


lie 


Asn 


lie 


Ser 


Gin 


Met 


Ala 


Leu 


Thr 


Glu 


Arg 


Asp 


305 










310 










315 










320 


Cys 


Phe 


Phe 


Pro 


Leu 


Leu 


Thr 


Glu 


Met 


Pro 


Arg 


Gin 


Gin 


Lys 


Lys 


Val 










325 










330 










335 




Asn 


Thr 


Phe 


Leu 


Asp 


Thr 


Met 


Lys 


Arg 


Pro 


Thr 


Leu 


Ser 


Leu 


Leu 


Pro 








340 










345 










350 






Ser 


Thr 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Asn 


Asn 


Lys 


Arg 


Lys 


Arg 


Asn 


Thr 






355 










360 










365 








Ala 


Ala 


Ala 


Asn 


lie 


Leu 


Leu 


Pro 


Val 


Tyr 


Arg 


Ser 


Asn 


Phe 


Ser 


Thr 




370 










375 










380 










Asn 


Asn 


Lys 


Arg 


Leu 


Lys 


Thr 


Asp 


Asp 


Gly 


Glu 


Asn 


Ala 


Ser 


Ala 


Cys 


385 










390 










395 










400 


lie 


Leu 


lie 


Glu 


Gly 


Tyr 


Ala 


Asn 


Gly 


Lys 


He 


Ser 


Pro 


He 


Arg 


He 










405 










410 










415 




Met 


Val 


Arg 


Lys 


Ser 


Thr 


lie 


lie 


Pro 


Glu 


Val 


Phe 


Asn 


His 


Leu 


Leu 








420 










425 










430 






Phe 


Pro 


Val 


Phe 


Ala 


Ser 


Lys 


Asp 


Thr 


Gly Ala 


Asn 


He 


Leu 


Phe 


Phe 






435 










440 










445 








lie 


Lys 


Met 


Lys 


Ser 


Phe 


Ala 


Ser 


Ala 


Ser 


Leu 


Leu 


Leu 


Pro 


Gly 


Leu 




450 










455 










460 










Phe 


Arg 


His 


Pro 


Lys 


Gin 


Phe 


Leu 


Asn 


Gly 


Pro 


Cys 


Lys 


Trp 


Met 


Thr 


465 










470 










475 










480 


Leu 


Ala 


Glu 


Asn 


Asn 


lie 


Asn 


Asp 


Asn 


Asn 


He 


Asn 


Ser 


Ser 


Thr 


Met 










485 










490 










495 




Trp 


Ser 


Tyr 


Thr 


Leu 


Ala 


Asp 


Tyr 


Cys 


Pro 


Leu 


Gly 


Tyr 


Tyr 


Thr 


Gin 








500 










505 










510 






Glu 


Ser 


Pro 


Gin 


Pro 


Tyr 


Gin 


Thr 


Cys 


Gly 


Asn 


Phe 


Thr 


Ser 


Thr 


Thr 






515 










520 










525 








Asn 


Lys 


Arg 


Leu 


Gin 


Asn 


Val 


Gin 


Pro 


Leu 


Tyr 


Phe 












530 










535 










540 











<210> 80 
<211> 582 
<212> DNA 
<213> SHRIMP 

<400> 80 

atggatgact ctagcagaaa acaacatcaa cgacaacaac ataaactctt ccacgatgtg 60 
gagttacacg ctagcagatt attgtcctct gggctattac acccaagaga gccctcaacc 120 
ctatcagaca tgcggcaatt ttacttcgac tacaaacaag agactacaaa acgtgcagcc 180 
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attatacttt 
ccgtttaatc 
aaaataagtg 
gtttcttcgc 
gtaatggaat 
cgctccagaa 
agacaaatgt 



taaacactct 
tcttgcttaa 
caggtatcat 
ccaataataa 
atcctatcgg 
tggtcacctg 
ttgatataat 



tttggaatac 
tgtgatgaat 
atcgaaactc 
taataacaac 
gggtttattg 
tacaaagggc 
agaagcaaat 



tacaggacac 
aacaagtgga 
ccatggacca 
ggagactact 
cacacgcctg 
aaagaccacc 
ggacaactct 



cttcagaaga 
gtacactcat 
tgaaaacaat 
attctacatg 
ccataactaa 
agaagctata 
ga 



gtgggaaatt 240 
tccaggtgtc 300 
gtacgagatt 360 
caggcgaatg 420 
taagtatcca 4 80 
tgacatctct 540 
582 



<210> 81 
<211> 193 
<212> PRT 
<213> SHRIMP 



<400> 81 



Met 


Asp 


Asp 


Ser 


Ser 


Arg 


Lys 


Gin 


His 


Gin 


Arg 


Gin 


Gin 


His 


Lys 


Leu 


1 








5 










10 










15 




Phe 


His 


Asp 


Val 


Glu 


Leu 


His 


Ala 


Ser 


Arg 


Leu 


Leu 


Ser 


Ser 


Gly 


Leu 








20 










25 










30 






Leu 


His 


Pro 


Arg 


Glu 


Pro 


Ser 


Thr 


Leu 


Ser 


Asp 


Met 


Arg 


Gin 


Phe 


Tyr 






35 










40 










45 








Phe 


Asp 


Tyr 


Lys 


Gin 


Glu 


Thr 


Thr 


Lys 


Arg 


Ala 


Ala 


He 


He 


Leu 


Leu 




50 










55 










60 










Asn 


Thr 


Leu 


Leu 


Glu 


Tyr 


Tyr 


Arg 


Thr 


Pro 


Ser 


Glu 


Glu 


Trp 


Glu 


He 


65 










70 










75 










80 


Pro 


Phe 


Asn 


Leu 


Leu 


Leu 


Asn 


Val 


Met 


Asn 


Asn 


Lys 


Trp 


Ser 


Thr 


Leu 










85 










90 










95 




lie 


Pro 


Gly 


Val 


Lys 


He 


Ser 


Ala 


Gly 


He 


He 


Ser 


Lys 


Leu 


Pro 


Trp 








100 










105 










110 






Thr 


Met 


Lys 


Thr 


Met 


Tyr 


Glu 


He 


Val 


Ser 


Ser 


Pro 


Asn 


Asn 


Asn 


Asn 






115 










120 










125 








Asn 


Asn 


Gly 


Asp 


Tyr 


Tyr 


Ser 


Thr 


Cys 


Arg 


Arg 


Met 


Val 


Met 


Glu 


Tyr 




130 










135 










140 










Pro 


He 


Gly 


Gly 


Leu 


Leu 


His 


Thr 


Pro 


Ala 


He 


Thr 


Asn 


Lys 


Tyr 


Pro 


145 










150 










155 










160 


Arg 


Ser 


Arg 


Met 


Val 


Thr 


Cys 


Thr 


Lys 


Gly 


Lys 


Asp 


His 


Gin 


Lys 


Leu 










165 










170 










175 




Tyr 


Asp 


He 


Ser 


Arg 


Gin 


Met 


Phe 


Asp 


He 


He 


Glu 


Ala 


Asn 


Gly 


Gin 








180 










185 










190 







Leu 



<210> 82 

<211> 615 

<212> DNA 

<213> SHRIMP 



<400> 82 

atggatcttt 

gctgtattta 

acagacaata 

ggatcaggct 

aagatccgca 

actcccgtgg 

acattcaagg 

ccaaagatta 

gtctctattg 

attgcagcta 

actgagaccg 



ctttcactct 
ttgtgatttt 
tcgagacaaa 
acttcaagat 
atggaaagtc 
agggccgagc 
tgtggaacaa 
acccatcaaa 
atgaggatga 
ccgccggtgg 
agtaa 



ttcggtcgtg 
taggtatcac 
catggatgaa 
gactgatgtg 
tgatgcacag 
actcgaagtg 
cacatcaaga 
ggcctttgtc 
agttggcacc 
aaatcttttc 



tcggccatcc 
aacactgtga 
aacctccgca 
tcctttgaca 
atgaaggaag 
actgtggggc 
aagatcaaca 
ggtagctcca 
tttgtgtgtg 
gacatgtacg 



tcgccatcac 
ccaagaccat 
ttcctgtgac 
gcgacacctt 
aagatgcgga 
agaatctcac 
tcactggtat 
acacctcctc 
g taccacctt 
tgcacgtcac 



tgctgtgatt 
cgaaacccac 
tgctgaggtt 
gggcaaaatc 
tcttgtcatc 
ctttgaggga 
gcagatggtg 
cttcaccccc 
tggcgcacca 
ctactctggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

615 



<210> 83 
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<211> 204 
<212> PRT 
<213> SHRIMP 

<400> 83 

Met Asp Leu Ser Phe Thr Leu Ser Val Val Ser Ala lie Leu Ala lie 

15 10 15 

Thr Ala Val lie Ala Val Phe lie Val lie Phe Arg Tyr His Asn Thr 

20 25 ' 30 

Val Thr Lys Thr lie Glu Thr His Thr Asp Asn lie Glu Thr Asn Met 

35 40 45 

Asp Glu Asn Leu Arg lie Pro Val Thr Ala Glu Val Gly Ser Gly Tyr 

50 55 60 

Phe Lys Met Thr Asp Val Ser Phe Asp Ser Asp Thr Leu Gly Lys lie 
65 70 75 80 

Lys lie Arg Asn Gly Lys Ser Asp Ala Gin Met Lys Glu Glu Asp Ala 

85 90 95 

Asp Leu Val lie Thr Pro Val Glu Gly Arg Ala Leu Glu Val Thr Val 

100 105 110 

Gly Gin Asn Leu Thr Phe Glu Gly Thr Phe Lys Val Trp Asn Asn Thr 

115 120 125 

Ser Arg Lys lie Asn lie Thr Gly Met Gin Met Val Pro Lys lie Asn 

130 135 140 

Pro Ser Lys Ala Phe Val Gly Ser Ser Asn Thr Ser Ser Phe Thr Pro 
145 150 155 160 

Val Ser lie Asp Glu Asp Glu Val Gly Thr Phe Val Cys Gly Thr Thr 

165 170 175 

Phe Gly Ala Pro lie Ala Ala Thr Ala Gly Gly Asn Leu Phe Asp Met 

180 185 190 

Tyr Val His Val Thr Tyr Ser Gly Thr Glu Thr Glu 
195 200 



<210> 84 
<211> 888 
<212> DNA 
<213> SHRIMP 



<400> 84 

atgaagaat t 

ttttattttt 

ttattatctc 

tcacgaccaa 

gaaaaagaag 

atcggaagag 

tttgaacaac 

cctggagaag 

cctactccta 

gaagatgatg 

gatagtggag 

agactagaaa 

caaccagatc 

gagtcgtgga 

ataaccaaca 



cgcggcagag 
ctggtgccat 
tcatactttt 
ataaaactac 
a tgacaccga 
acgaggaagg 
cgagtgtaaa 
gtttgttacc 
ctcctccacc 
atgatga tat 
aagaaatgga 
caggcatgag 
ctgtagttgt 
aagttttgaa 
agattaaagc 



gtcaggcgtg 
aattgaatgt 
tgtgtgtttt 
tacaacatcc 
aggagctgta 
tgctgtagaa 
tactgggtcc 
tccaccccct 
tcccccgaca 
agacatagac 
agaagacgaa 
cggcgaagaa 
caaaaacata 
cagatctgag 
catgacggaa 



tggaggggta 
aaaaaaataa 
gtgggtgtgg 
aataaaaaaa 
ttagggcgaa 
gacggaaaag 
aatacaggag 
cctactccta 
cgaaccccat 
tttgatgata 
gaagaggaag 
gtagattttg 
gaaaggtcag 
ggagatatta 
gatctgaagg 



actcgtgttt 
ggataataat 
tggttggtgt 
caaagaaaga 
gggaacctga 
aagaggagga 
gtggaggaac 
ctcctactcc 
ctccttcttc 
atgatataga 
atttggacac 
atgcatca tc 
attatactct 
gattcttcgt 
aactgtaa 



g tacaaatca 
gatgttcctt 
gattttcatg 
taaagagaag 
aaa taggccg 
agtttttgaa 
aggaactgtg 
tccacctact 
atctcttggg 
cgaattttta 
actcctttca 
tgcatatatt 
ggacccaatg 
agatcgaggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

888 



<210> 85 
<211> 295 
<212> PRT 
<213> SHRIMP 



<400> 85 
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Met 


Lys 


Asn 


Ser 


Arg 


Gin 


Arg 


Ser 


Gly 


Val 


Trp 


Arg 


Gly 


Asn 


Ser 


Cys 


1 








5 










10 










15 




Leu 


Tyr 


Lys 


Ser 


Phe 


Tyr 


Phe 


Ser 


Gly 


Ala 


He 


He 


Glu 


Cys 


Lys 


Lys 








20 










25 










30 






lie 


Arg 


He 


He 


Met 


Met 


Phe 


Leu 


Leu 


Leu 


Ser 


Leu 


He 


Leu 


Phe 


Val 






35 










40 










45 








Cys 


Phe 


Val 


Gly 


Val 


Val 


Val 


Gly 


Val 


He 


Phe 


Met 


Ser 


Arg 


Pro 


Asn 




50 










55 










60 










Lys 


Thr 


Thr 


Thr 


Thr 


Ser 


Asn 


Lys 


Lys 


Thr 


Lys 


Lys 


Asp 


Lys 


Glu 


Lys 


65 










70 










75 










80 


Glu 


Lys 


Glu 


Asp 


Asp 


Thr 


Glu 


Gly 


Ala 


Val 


Leu 


Gly 


Arg 


Arg 


Glu 


Pro 










85 










90 










95 




Glu 


Asn 


Arg 


Pro 


He 


Gly 


Arg 


Asp 


Glu 


Glu 


Gly 


Ala 


Val 


Glu 


Asp 


Gly 








100 










105 










110 






Lys 


Glu 


Glu 


Glu 


Glu 


Val 


Phe 


Glu 


Phe 


Glu 


Gin 


Pro 


Ser 


Val 


Asn 


Thr 






115 










120 










125 








Gly 


Ser 


Asn 


Thr 


Gly 


Gly 


Gly 


Gly 


Thr 


Gly 


Thr 


Val 


Pro 


Gly 


Glu 


Gly 




130 










135 










140 










Leu 


Leu 


Pro 


Pro 


Pro 


Pro 


Pro 


Thr 


Pro 


Thr 


Pro 


Thr 


Pro 


Pro 


Pro 


Thr 


145 










150 










155 










160 


Pro 


Thr 


Pro 


Thr 


Pro 


Pro 


Pro 


Pro 


Pro 


Thr 


Arg 


Thr 


Pro 


Ser 


Pro 


Ser 










165 










170 










175 




Ser 


Ser 


Leu 


Gly 


Glu 


Asp 


Asp 


Asp 


Asp 


Asp 


He 


Asp 


He 


Asp 


Phe 


Asp 








180 










185 










190 






Asp 


Asn 


Asp 


He 


Asp 


Glu 


Phe 


Leu 


Asp 


Ser 


Gly 


Glu 


Glu 


Met 


Glu 


Glu 






195 










200 










205 








Asp 


Glu 


Glu 


Glu 


Glu 


Asp 


Leu 


Asp 


Thr 


Leu 


Leu 


Ser 


Arg 


Leu 


Glu 


Thr 




210 










215 










220 










Gly 


Met 


Ser 


Gly 


Glu 


Glu 


Val 


Asp 


Phe 


Asp 


Ala 


Ser 


Ser 


Ala 


Tyr 


He 


225 










230 










235 










240 


Gin 


Pro 


Asp 


Pro 


Val 


Val 


Val 


Lys 


Asn 


He 


Glu 


Arg 


Ser 


Asp 


Tyr 


Thr 










245 










250 










255 




Leu 


Asp 


Pro 


Met 


Glu 


Ser 


Trp 


Lys 


Val 


Leu 


Asn 


Arg 


Ser 


Glu 


Gly 


Asp 








260 










265 










270 






lie 


Arg 


Phe 


Phe 


Val 


Asp 


Arg 


Gly 


He 


Thr 


Asn 


Lys 


He 


Lys 


Ala 


Met 






275 










280 










285 








Thr 


Glu 


Asp 


Leu 


Lys 


Glu 


Leu 





















290 295 



<210> 86 
<211> 603 
<212> DNA 
<213> SHRIMP 



<400> 86 

atgaccatgt 

ggtgttttct 

tctaaacaag 

ccagtcactt 

tttatggcat 

gacgaggctg 

cgttatggga 

acagtacaag 

gatggacacg 

ggaaaaacgt 

tag 

<210> 87 
<211> 196 
<212> PRT 



ggaacaaaat 
ttattctagc 
aaaaatactc 
acttgtcaaa 
atgttgatgt 
gctgggccca 
gaggagaat t 
atatgtcgga 
gaatgcaatc 
catcagaaac 



tgtaataact 
aataactgcc 
cacttctcat 
aacgggtaaa 
gcacaataga 
cttttgcctg 
tggaccagaa 
gccatatata 
tgttttggag 
ttataatctt 



actaaacgaa 
ttagctgtcg 
atcaatgaac 
ttaaaggaca 
acaaaaactc 
ctggcttctg 
aaacattccc 
acccacatct 
aagaatagga 
agtgataagt 



tgaattggcc 
ctctctatat 
aatttactgc 
tgcacttgac 
tgaagcaccc 
ctgaagctta 
tagctgaaac 
tcaagaagaa 
ataaaatcag 
ctatatctat 



tatggttgtt 
acgacatgct 
caaacagt tg 
ccattccgat 
catgtgtact 
tcgtcgcatc 
tatccaatcg 
cacggatg tt 
aatgggtgat 
tgttggtgta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

603 
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<213> SHRIMP 



<400> 87 



Met 


Thr 


Met 


Trp 


Asn 


Lys 


He 


Val 


He 


Thr 


Thr 


Lys 


Arg 


Met 


Asn 


Trp 


1 








5 










10 










15 




Pro 


Met 


Val 


Val 


Gly 


Val 


Phe 


Phe 


He 


Leu 


Ala 


He 


Thr 


Ala 


Val 


Tyr 








20 










25 










30 






He 


Arg 


His 


Ala 


Ser 


Lys 


Gin 


Glu 


Lys 


Tyr 


Ser 


Thr 


Ser 


His 


He 


Asn 






35 










40 










45 








Glu 


Gin 


Phe 


Thr 


Ala 


Lys 


Gin 


Leu 


Pro 


Val 


Thr 


Tyr 


Leu 


Ser 


Lys 


Thr 




50 










55 










60 










Gly 


Lys 


Leu 


Lys 


Asp 


Met 


His 


Leu 


Thr 


His 


Ser 


Asp 


Phe 


Met 


Ala 


Tyr 


65 










70 










75 










80 


Val 


Asp 


Val 


His 


Asn 


Arg 


Thr 


Lys 


Thr 


Leu 


Lys 


His 


Pro 


Met 


Cys 


Thr 










85 










90 










95 




Asp 


Glu 


Ala 


Gly 


Trp 


Ala 


His 


Phe 


Cys 


Leu 


Leu 


Ala 


Ser 


Ala 


Glu 


Ala 








100 










105 










110 






Tyr 


Arg 


Arg 


He 


Arg 


Tyr 


Gly 


Arg 


Gly 


Glu 


Phe 


Gly 


Pro 


Glu 


Lys 


His 






115 










120 










125 








Ser 


Leu 


Ala 


Glu 


Thr 


He 


Gin 


Ser 


Thr 


Val 


Gin 


Asp 


Met 


Ser 


Glu 


Pro 




130 










135 










140 










Tyr 


He 


Thr 


His 


He 


Phe 


Lys 


Lys 


Asn 


Thr 


Asp 


Val 


Asp 


Gly 


His 


Gly 


145 










150 










155 










160 


Met 


Gin 


Ser 


Val 


Leu 


Glu 


Lys 


Asn 


Arg 


Asn 


Lys 


He 


Arg 


Met 


Gly 


Asp 










165 










170 










175 




Gly 


Lys 


Thr 


Ser 


Ser 


Glu 


Thr 


Tyr 


Asn 


Leu 


Ser 


Asp 


Lys 


Ser 


He 


Ser 








180 










185 










190 






He 


Val 


Gly 


Val 



























195 



<210> 88 
<211> 861 
<212> DNA 
<213> SHRIMP 



<400> 88 

atgagctcgg 

gaaggggaag 

attactgcca 

ggactaatag 

atgaggctag 

gatactattg 

gagaaaaaca 

aaaggggagg 

gattggatgg 

ttacaaagtg 

agcttttatc 

ttatcaaaag 

aaacaggaga 

gaatatccgc 

aacttttctg 



gtagtatcaa 
aacaggattt 
tgttatctgt 
caactgcgtc 
ataaagacgt 
aagttgactc 
taaaaaccaa 
cagtcaaact 
cgaacatctg 
ggcaattaat 
acattgctat 
gatcaaagga 
ttgctgcaat 
ctgtgcctat 
agaaattatg 



caaccatccc 
tgatgtatta 
tgcagctcct 
tgcgacagca 
gtgccaacta 
cattattcgc 
ctctactgaa 
ggctaagaat 
ccaaagcaac 
atggacaact 
aatgtggaaa 
caatcttttg 
gcttcaatct 
cactgcaatt 
a 



tcctcaaaca 
gaactggact 
cctcccaatt 
ccagcagccg 
atcgaaaggg 
caactgttat 
aaggagccag 
accccagttc 
aagatcgggg 
ttcccagctg 
cttttaggtt 
cctcgtgatg 
agacaaaata 
ctatcacgta 



tggacaccaa 
actccaaaat 
ccatactgga 
aaacaggaaa 
acatagaatt 
attttggaga 
tttacttccc 
tagatacgat 
ttgaaaattt 
ctgtatatgc 
cctttataaa 
atatacaggt 
ttctcggaag 
ctataattcc 



caaaatggag 
catccatgat 
cgcatctgat 
ctctaacagg 
agtgaagagt 
atctgcatca 
caaagaaccg 
aacaaaacta 
agcctctgcg 
ttcgttggac 
catcgaagct 
tgtccatgct 
aggaccgtcg 
cctgttgagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

861 



<210> 89 
<211> 286 
<212> PRT 
<213> SHRIMP 



<400> 89 

Met Ser Ser Gly Ser He Asn Asn His Pro Ser Ser Asn Met Asp Thr 
1 5 10 15 
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Asn Lys Met Glu 
20 

Asp Tyr Ser Lys 
35 

Ala Pro Pro Pro 
50 

Thr Ala Ser Ala 
65 

Met Arg Leu Asp 

Leu Val Lys Ser 
100 

Leu Tyr Phe Gly 
115 

Thr Glu Lys Glu 

130 

Val Lys Leu Ala 
145 

Asp Trp Met Ala 

Leu Ala Ser Ala 
180 

Ala Ala Val Tyr 
195 

Trp Lys Leu Leu 
210 

Ser Lys Asp Asn 
225 

Lys Gin Glu lie 

Arg Gly Pro Ser 
260 

Arg Thr He He 
275 



Glu Gly Glu Glu 

He He His Asp 
40 

Asn Ser He Leu 
55 

Thr Ala Pro Ala 
70 

Lys Asp Val Cys 
85 

Asp Thr He Glu 

Glu Ser Ala Ser 
120 

Pro Val Tyr Phe 
135 

Lys Asn Thr Pro 
150 

Asn He Cys Gin 
165 

Leu Gin Ser Gly 

Ala Ser Leu Asp 
200 

Gly Ser Phe He 
215 

Leu Leu Pro Arg 
230 

Ala Ala Met Leu 
245 

Glu Tyr Pro Pro 

Pro Leu Leu Arg 
280 



Gin Asp Phe Asp 
25 

He Thr Ala Met 

Asp Ala Ser Asp 
60 

Ala Glu Thr Gly 
75 

Gin Leu He Glu 
90 

Val Asp Ser He 
105 

Glu Lys Asn He 

Pro Lys Glu Pro 
140 

Val Leu Asp Thr 
155 

Ser Asn Lys He 
170 

Gin Leu He Trp 
185 

Ser Phe Tyr His 

Asn He Glu Ala 
220 

Asp Asp He Gin 
235 

Gin Ser Arg Gin 
250 

Val Pro He Thr 
265 

Asn Phe Ser Glu 



Val Leu Glu Leu 
30 

Leu Ser Val Ala 
45 

Gly Leu He Ala 

Asn Ser Asn Arg 
80 

Arg Asp He Glu 
95 

He Arg Gin Leu 
110 

Lys Thr Asn Ser 
125 

Lys Gly Glu Ala 

He Thr Lys Leu 
160 

Gly Val Glu Asn 
175 

Thr Thr Phe Pro 
190 

He Ala He Met 
205 

Leu Ser Lys Gly 

Val Val His Ala 
240 

Asn He Leu Gly 
255 

Ala He Leu Ser 

270 
Lys Leu 
285 



<210> 90 
<211> 696 
<212> DNA 
<213> SHRIMP 



<400> 90 

atggaccctg 

acaaggcatt 

gtcgaggagg 

tcttcttcct 

gaagaagaag 

ggagaggagg 

cgatcttctt 

tttatcaagt 

cgtaatcggc 

aagaaacaga 

atgtctcctg 

acttctataa 



gagcatcggc 
atcagcagca 
aacatgggca 
catcctcctc 
aagaagaaga 
aggaagaaga 
cacggaattc 
acgaaaagga 
aaccccaaat 
aaaccaataa 
caagtagact 
tagtactgtc 



tgcttccagg 
attgaaccgt 
ggtaacaaca 
ttcttctccc 
atatgactct 
ttcagataca 
cacgaccaga 
tattttactt 
acataagcta 
gaagaaaact 
gaaatttttc 
gataatggcc 



agggcttta t 
gcactgaata 
acaaacaaag 
acgtcttcgg 
gaatcagaca 
gaatccacat 
aacaggctaa 
gctgacagga 
aacaacaaaa 
tggagaagac 
tctgcttgta 
ctttag 



ggtctagcac 
aga t tgaaga 
aaatggcttc 
ccatcccttc 
ctaacgtcga 
ctgctgatgc 
taaaaaagta 
ataaaaggaa 
gactcaaaaa 
tccctaaatt 
taatatcagg 



tgtaacaaac 
agaagatgat 
tacttctaca 
aagcgatgag 
ttctcttctt 
aaactttcta 
cgttgataga 
gaagaggcac 
accaacagac 
tattaaaaag 
gataaaaata 



<210> 91 
<211> 231 
<212> PRT 
<213> SHRIMP 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

696 



<400> 91 
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Met 


Asp 


Pro 


Gly 


Ala 


Ser 


Ala 


Ala 


Ser 


Arg 


Arg 


Ala 


Leu 


Trp 


Ser 


Ser 


1 








5 










10 










15 




Thr 


Val 


Thr 


Asn 


Thr 


Arg 


His 


Tyr 


Gin 


Gin 


Gin 


Leu 


Asn 


Arg 


Ala 


Leu 








20 










25 










30 






Asn 


Lys 


He 


Glu 


Glu 


Glu 


Asp 


Asp 


Val 


Glu 


Glu 


Glu 


His 


Gly 


Gin 


Val 






35 










40 










45 








Thr 


Thr 


Thr 


Asn 


Lys 


Glu 


Met 


Ala 


Ser 


Thr 


Ser 


Thr 


Ser 


Ser 


Ser 


Ser 




50 










55 










60 










Ser 


Ser 


Ser 


Ser 


Ser 


Pro 


Thr 


Ser 


Ser 


Ala 


He 


Pro 


Ser 


Ser 


Asp 


Glu 


65 










70 










75 










80 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Tyr 


Asp 


Ser 


Glu 


Ser 


Asp 


Thr 


Asn 


Val 










85 










90 










95 




Asp 


Ser 


Leu 


Leu 


Gly 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Ser 


Asp 


Thr 


Glu 


Ser 








100 










105 










110 






Thr 


Ser 


Ala 


Asp 


Ala 


Asn 


Phe 


Leu 


Arg 


Ser 


Ser 


Ser 


Arg 


Asn 


Ser 


Thr 






115 










120 










125 








Thr 


Arg 


Asn 


Arg 


Leu 


He 


Lys 


Lys 


Tyr 


Val 


Asp 


Arg 


Phe 


He 


Lys 


Tyr 




130 










135 










140 










Glu 


Lys 


Asp 


He 


Leu 


Leu 


Ala 


Asp 


Arg 


Asn 


Lys 


Arg 


Lys 


Lys 


Arg 


His 


145 










150 










155 










160 


Arg 


Asn 


Arg 


Gin 


Pro 


Gin 


He 


His 


Lys 


Leu 


Asn 


Asn 


Lys 


Arg 


Leu 


Lys 










165 










170 










175 




Lys 


Pro 


Thr 


Asp 


Lys 


Lys 


Gin 


Lys 


Thr 


Asn 


Lys 


Lys 


Lys 


Thr 


Trp 


Arg 








180 










185 










190 






Arg 


Leu 


Pro 


Lys 


Phe 


He 


Lys 


Lys 


Met 


Ser 


Pro 


Ala 


Ser 


Arg 


Leu 


Lys 






195 










200 










205 








Phe 


Phe 


Ser 


Ala 


Cys 


He 


He 


Ser 


Gly 


He 


Lys 


He 


Thr 


Ser 


He 


He 




210 










215 










220 










Val 


Leu 


Ser 


He 


Met 


Ala 


Leu 





















225 230 



<210> 92 
<211> 4608 
<212> DNA 
<213> SHRIMP 



<400> 92 

atggctcaca 

cacccagaac 

aaaaatgtaa 

caggcactag 

aacatgtcca 

aaaacgagtg 

gcagccttcg 

gtagggccat 

gtgtggaaaa 

tgtcaacagg 

tcatggagaa 

aaaagtgggt 

gatattagat 

tcaa taggag 

cgcacaaa tc 

aaggttatgc 

caaaatatta 

agagttt tgc 

ttcaagagtg 

aataagggcc 

cgtattttct 

ttccagagca 

tacacgagca 



aacttttatt 
cttcattcgc 
ccgatgccta 
cagtgcgtct 
ttatagacct 
gaggatttat 
tcatgagtgt 
caatttttgg 
ggcacccctt 
gcatgcgaga 
aactacaagt 
ccatgacagt 
ttttcactct 
atttcttaac 
tggaatttgt 
ccattgacaa 
gtgactcttt 
gcaacataca 
caatgagtga 
gagttacaat 
tagagatctt 
atgctgattt 
tacaagcctt 



tctggaggaa 
tctttatgag 
ccccaaattt 
agccctgatt 
tgtgaaagat 
catcggagat 
catactacaa 
tcaagattcg 
ttttatatgg 
agtggttaca 
gccttgtgct 
agatgtggaa 
cagggatgtt 
agaaacgccc 
actgaaattt 
ctttgtcaca 
tagaaaggaa 
aaaaagggca 
cgacagaaac 
gtccaactac 
aaagtatgat 
gcacatgatt 
caaagaagtg 



gaagatgcta 
agtgaaacat 
cttcctcggc 
caattctaca 
aaggtcgaaa 
ggtactggtg 
gaaaaggcac 
gacaaggtaa 
ttgacgtgct 
aattctagag 
aataaaccaa 
aattctgtat 
aaggaatttc 
accattttat 
ataacaggag 
gcattattat 
ctggcaaaga 
aacccttcag 
tttaaagtga 
ttgaaactat 
acgttcacgg 
gaccacatac 
tccagtgcta 



aagagattgg 
tccggtcggt 
cgatggacat 
aagggagagg 
ggaattttaa 
taggaaaaac 
tcttagatgt 
ttacggccat 
ctaagcctct 
gtcttcgtga 
cgtctttcaa 
cgtctgctaa 
atagtaagcg 
ttatgacgta 
gaacagacct 
gtgacgagtt 
cttgggagga 
tgtcagacct 
agcgcatgaa 
tatctcaagc 
ttatggccag 
taaggaaaag 
ctccagatgc 



caccctctcc 
gggcttctgc 
taattctgtg 
gtggaagaag 
agtggacaag 
aagagaattg 
acaaaaacac 
caacagtggc 
tttcaacagc 
tcctaaattt 
gagcgacggg 
agactcggtg 
ttcttcaaga 
ctctgattta 
cgattccaac 
ccacaagact 
agaagatact 
tattaatagg 
atcatccaat 
tgatgcgttc 
cgctacgcca 
tgcgccagcc 
tatggctgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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cacagcgagt 
caattggtta 
gcctcccctc 
caagtactca 
atacgagaac 
gagattaatt 
gtaatggaaa 
catgatggga 
aaaaaggatg 
agcagcattg 
tccaaaggta 
tggggtgaag 
acagcaagta 
gtcaccgatg 
ctagagcaaa 
gccaaggatg 
cctgttgcta 
tcagctttca 
atgcttgtgc 
gattcaatca 
tgttcgcgta 
acaaacaaat 
cctaaaggca 
cgttaccatt 
aggactcaca 
cccgcagaga 
acatatgccg 
caatttatcg 
gccagtatga 
agaggagggg 
attgtcctcg 
tcagacgatc 
tccattgtta 
tttaaatctg 
gccctgatag 
gaaagtgatg 
atggatgttt 
gatgttcccg 
acgttagctc 
ctagtgacgg 
tgtttagatg 
gttatacaga 
ctagcacatt 
ttgtcccgtg 
gagtctaaga 
aggtatgatg 
gtcaggaaga 
catataataa 
gaagacgacg 
gaagaagaag 
gggcacgatg 
agtgtttttg 
ggttctttga 
ttttgcctgt 



acgtgacagt 
gtaggagtat 
ttcaaaaata 
tcgattctga 
accaagaagg 
ccccgtccag 
ata tcgacag 
aaacaacact 
cgctttctaa 
atatggttgc 
aaactgattt 
tatttgacga 
cttctgtggc 
cagtgaagcg 
ctggggattc 
aaagtgacgc 
atacaatatt 
caa tctcctt 
cttcagtccc 
cacaatcaat 
taacgaatag 
gtattagtgc 
ataccggtct 
gcgttttaga 
gaaacggtca 
gaagattttt 
atagatacag 
atcaagggtt 
caaaggtaca 
ttgcat tcgt 
cgatgcacat 
ttgggcatgc 
aa tccgcatc 
at tgtgacaa 
acactttgaa 
caccttcatt 
tctctaatat 
acaatgatga 
tatcaaactt 
tgagaattgt 
tacctgaatt 
gtttggctaa 
cccacagaga 
gtttcattaa 
ttatgtacga 
gcctttgtga 
agccagagtg 
atgacgaaga 
acgaagaatt 
gagaaagtgc 
tggacataat 
ttaatgaaca 
tacaaata tg 
acgactcctc 



gtttctggaa 
tagcatggct 
cgctatagac 
aaaagtagga 
aggaatattg 
aaagaggaaa 
gcgttttaag 
aagaagtata 
tggaggagga 
tcaagattta 
taacgaagat 
cgagtgtttt 
agcatgtaaa 
tttaaggaca 
ttttctcaag 
aaaatacggc 
ctcaggctat 
aaagaataaa 
tgacacggaa 
aggggaggat 
gggattgttt 
attcaacaac 
ctctttacat 
tgtaccttat 
actttcagtc 
tgatagtttg 
taataattcc 
agtcctaaag 
cttgatggaa 
agaaggtttg 
agcgctagta 
tctttctttc 
tcaattcgtt 
tcttctccca 
tacgaaaaat 
aactgctcta 
aatggcatat 
agacgaaggg 
cccccatgat 
tggtcaagga 
agatatgaca 
agaaaatcct 
agggtacgga 
ttttagacag 
tatattcttg 
tatgagaatg 
tgttttcatt 
ggaagaaacg 
tgaagacgag 
ttacgacatt 
actgtgcaat 
catagactct 
ctttgataac 
atctaaagac 



caggttatta 
ggagtcgact 
gaattggctt 
ggacatgtca 
gaagaggaag 
cgtgctgcaa 
gtggtggttg 
gtacaagatg 
ggaataataa 
tacgatgcta 
tatgacgacg 
gaaaagttga 
ggtgcactat 
acaaacgaat 
aatttgacta 
atcgtagacg 
agacttttgt 
accaatagaa 
ccgctcatgg 
agtaacgctg 
ctggttaaaa 
acgaaagaag 
gactcttcta 
aacgccattg 
cctcagttcc 
gataagagga 
atagatattg 
actatgggga 
catttttcaa 
acgttagaaa 
gttataggtg 
acgtctgtac 
ttttctaatt 
cgcgagaagc 
aacgaagtta 
atgctccctt 
aataacaata 
tgtttacctt 
tatgatagag 
gagaaagagg 
aatctcatac 
ggtttgcttt 
ggttctcatc 
ttccaaaacc 
agcgtcaaag 
aacagcatga 
acaaaattat 
agggagaggt 
gaggaggaac 
tctgtcatca 
agaaagaaac 
tttatggttg 
tgcacaggcg 
aaagacacca 



aacttttaag 
gctctactac 
cgtactgttt 
gaagggcctt 
atgttgaaaa 
atgatgatga 
tgcgcgatag 
ctataaaaac 
cgtctccgga 
tcaagaaaaa 
gtgctaatga 
ggagacaata 
tgaatatcaa 
caaaaaagat 
cacgcatctt 
tggggacatt 
gccgtgcagt 
gaacttcccc 
cgctagcagg 
aaataacaaa 
acaacactaa 
tcgacgttat 
ataactccat 
ctttcctaca 
tcatattctc 
tcaaagattc 
ctgccgcagt 
atatcgtaca 
agatgatgat 
atggtatctt 
cccaaaataa 
ttcctcataa 
tgtgtctgca 
gagtgcgtga 
cttccaagac 
ctggtcctag 
acggcatgga 
tgcaagaaga 
ccataaaaga 
gagtgattcc 
ctgtagtgac 
ttactataca 
ttctgggttt 
aacttttctc 
cgatcatggc 
tggatgcaag 
tggacaagaa 
ttggcgggga 
aagcagaaag 
atgataaaaa 
taacactcac 
gaaatttaat 
agtttgaagg 
ttccgtag 



aaatagagga 
taattgtaaa 
aaacgctaga 
cacaaagatt 
attagtagca 
tttatatgaa 
ggatgtcgca 
ctactcacag 
agttgacata 
ggaaaaacct 
agaagacgga 
cttcattaat 
ggcaacttct 
ggtcatgtca 
acaaacgatc 
cgattcatcg 
catgatggct 
agctcacgtt 
caacccta ta 
caggaaattg 
aactgccaac 
aatgctggga 
gtacgctaaa 
gactatcggt 
tacagactcg 
aaaggcagga 
tatgagggaa 
gattgtcaca 
gaggacaaat 
cactgaagta 
aatcacatca 
tcaaatattg 
tttagtccac 
tgcagcatct 
aaataaaata 
gaatagaaag 
ctttgatgaa 
aaatgctaca 
tgcacatcaa 
tatcagtgaa 
ggcaactaat 
taatgctgca 
ggctaagaaa 
acctaaaaaa 
cagagatgat 
tttcttaaag 
ttttaggaga 
ggaagaagaa 
ggagtggggg 
taacacaatc 
taaagaaaat 
aggtgcggaa 
tttacccaaa 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4608 



<210> 93 
<211> 1523 
<212> PRT 
<213> SHRIMP 



<400> 93 
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Met 


Ala 


His 


Lys 


Leu 


Leu 


Phe 


Leu 


Glu 


Glu 


Glu 


Asp 


Ala 


Lys 


.Glu 


He 


1 








5 










10 










15 




Gly 


Thr 


Leu 


Ser 


His 


Pro 


Glu 


Pro 


Ser 


Phe 


Ala 


Leu 


Tyr 


Glu 


Ser 


Glu 








20 










25 










30 






Thr 


Phe 


Arg 


Ser 


Val 


Gly 


Phe 


Cys 


Lys 


Asn 


Val 


Thr 


Asp 


Ala 


Tyr 


Pro 






35 










40 










45 








Lys 


Phe 


Leu 


Pro 


Arg 


Pro 


Met 


Asp 


He 


Asn 


Ser 


Val 


Gin 


Ala 


Val 


Arg 




50 










55 










60 










Leu 


Ala 


Leu 


lie 


Gin 


Phe 


Tyr 


Lys 


Gly 


Arg 


Gly 


Trp 


Lys 


Lys 


Asn 


Met 


65 










70 










75 










80 


Ser 


lie 


lie 


Asp 


Leu 


Val 


Lys 


Asp 


Lys 


Val 


Glu 


Arg 


Asn 


Phe 


Lys 


Val 










85 










90 










95 




Asp 


Lys 


Lys 


Thr 


Ser 


Gly 


Gly 


Phe 


He 


He 


Gly 


Asp 


Gly 


Thr 


Gly 


Val 








100 










105 










110 






Gly 


Lys 


Thr 


Arg 


Glu 


Leu 


Ala 


Ala 


Phe 


Val 


Met 


Ser 


Val 


He 


Leu 


Gin 






115 










120 










125 








Glu 


Lys 


Ala 


Leu 


Leu 


Asp 


Val 


Gin 


Lys 


His 


Val 


Gly 


Pro 


Ser 


He 


Phe 




130 










135 










140 










Gly 


Gin 


Asp 


Ser 


Asp 


Lys 


Val 


He 


Thr 


Ala 


He 


Asn 


Ser 


Gly 


Val 


Trp 


145 










150 










155 










160 


Lys 


Arg 


His 


Pro 


Phe 


Phe 


He 


Trp 


Leu 


Thr 


Cys 


Ser 


Lys 


Pro 


Leu 


Phe 










165 










170 










175 




Asn 


Ser 


Cys 


Gin 


Gin 


Gly 


Met 


Arg 


Glu 


Val 


Val 


Thr 


Asn 


Ser 


Arg 


Gly 








180 










185 










190 






Leu 


Arg 


Asp 


Pro 


Lys 


Phe 


Ser 


Trp 


Arg 


Lys 


Leu 


Gin 


Val 


Pro 


Cys 


Ala 






195 










200 










205 








Asn 


Lys 


Pro 


Thr 


Ser 


Phe 


Lys 


Ser 


Asp 


Gly 


Lys 


Ser 


Gly 


Ser 


Met 


Thr 




210 










215 










220 










Val 


Asp 


Val 


Glu 


Asn 


Ser 


Val 


Ser 


Ser 


Ala 


Lys 


Asp 


Ser 


Val 


Asp 


He 


225 










230 










235 










240 


Arg 


Phe 


Phe 


Thr 


Leu 


Arg 


Asp 


Val 


Lys 


Glu 


Phe 


His 


Ser 


Lys 


Arg 


Ser 










245 










250 










255 




Ser 


Arg 


Ser 


He 


Gly 


Asp 


Phe 


Leu 


Thr 


Glu 


Thr 


Pro 


Thr 


He 


Leu 


Phe 








260 










265 










270 






Met 


Thr 


Tyr 


Ser 


Asp 


Leu 


Arg 


Thr 


Asn 


Leu 


Glu 


Phe 


Val 


Leu 


Lys 


Phe 






275 










280 










285 








lie 


Thr 


Gly 


Gly 


Thr 


Asp 


Leu 


Asp 


Ser 


Asn 


Lys 


Val 


Met 


Pro 


He 


Asp 




290 










295 










300 










Asn 


Phe 


Val 


Thr 


Ala 


Leu 


Leu 


Cys 


Asp 


Glu 


Phe 


His 


Lys 


Thr 


Gin 


Asn 


305 










310 










315 










320 


lie 


Ser 


Asp 


Ser 


Phe 


Arg 


Lys 


Glu 


Leu 


Ala 


Lys 


Thr 


Trp 


Glu 


Glu 


Glu 










325 










330 










335 




Asp 


Thr 


Arg 


Val 


Leu 


Arg 


Asn 


He 


Gin 


Lys 


Arg 


Ala 


Asn 


Pro 


Ser 


Val 








340 










345 










350 






Ser 


Asp 


Leu 


He 


Asn 


Arg 


Phe 


Lys 


Ser 


Ala 


Met 


Ser 


Asp 


Asp 


Arg 


Asn 






355 










360 










365 








Phe 


Lys 


Val 


Lys 


Arg 


Met 


Lys 


Ser 


Ser 


Asn 


Asn 


Lys 


Gly 


Arg 


Val 


Thr 




o "7 n 










375 










380 










Met 


Ser 


Asn 


Tyr 


Leu 


Lys 


Leu 


Leu 


Ser 


Gin 


Ala 


Asp 


Ala 


Phe 


Arg 


He 


385 










390 










395 










400 


Phe 


Leu 


Glu 


He 


Leu 


Lys 


Tyr 


Asp 


Thr 


Phe 


Thr 


Val 


Met 


Ala 


Ser 


Ala 










405 










410 










415- 




Thr 


Pro 


Phe 


Gin 


Ser 


Asn 


Ala 


Asp 


Leu 


His 


Met 


He 


Asp 


His 


He 


Leu 








420 










425 










430 






Arg 


Lys 


Ser 


Ala 


Pro 


Ala 


Tyr 


Thr 


Ser 


He 


Gin 


Ala 


Phe 


Lys 


Glu 


Val 






435 










440 










445 








Ser 


Ser 


Ala 


Thr 


Pro 


Asp 


Ala 


Met 


Ala 


Glu 


His 


Ser 


Glu 


Tyr 


Val 


Thr 




450 










455 










460 










Val 


Phe 


Leu 


Glu 


Gin 


Val 


He 


Lys 


Leu 


Leu 


Arg 


Asn 


Arg 


Gly 


Gin 


Leu 


465 










470 










475 










480 


Val 


Ser 


Arg 


Ser 


He 


Ser 


Met 


Ala 


Gly 


Val 


Asp 


Cys 


Ser 


Thr 


Thr 


Asn 
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485 



Cys 


Lys 


Asp 


Leu 


Gin 


Lys 


Tyr 


Ala 








500 










Leu 


Asn 


Ala 


Arg 


Gin 


Val 


Leu 


He 






515 










520 


Val 


Arg 


Arg 


Ala 


Phe 


Thr 


Lys 


He 




530 










535 




lie 


Glu 


Glu 


Asp 


Val 


Glu 


Lys 


Leu 


545 










550 






Arg 


Lys 


Arg 


Lys 


Arg 


Ala 


Ala 


Asn 










565 








Glu 


Asn 


He 


Asp 


Arg 


Arg 


Phe 


Lys 








580 










Val 


Ala 


His 


Asp 


Gly 


Lys 


Thr 


Thr 






595 










600 


lie 


Lys 


Thr 


Tyr 


Ser 


Gin 


Lys 


Lys 




610 










615 




Gly 


He 


He 


Thr 


Ser 


Pro 


Glu 


Val 


625 










630 






Ala 


Gin 


Asp 


Leu 


Tyr 


Asp 


Ala 


He 










645 








Gly 


Lys 


Thr 


Asp 


Phe 


Asn 


Glu 


Asp 








660 










Asp 


Gly 


Trp 


Gly 


Glu 


Val 


Phe 


Asp 






675 










680 


Arg 


Gin 


Tyr 


Phe 


He 


Asn 


Thr 


Ala 




690 










695 




Gly 


Ala 


Leu 


Leu 


Asn 


He 


Lys 


Ala 


705 










710 






Arg 


Leu 


Arg 


Thr 


Thr 


Asn 


Glu 


Ser 










725 








Gin 


Thr 


Gly 


Asp 


Ser 


Phe 


Leu 


Lys 








740 










Thr 


He 


Ala 


Lys 


Asp 


Glu 


Ser 


Asp 






755 










760 


Gly 


Thr 


Phe 


Asp 


Ser 


Ser 


Pro 


Val 




770 










775 




Arg 


Leu 


Leu 


Cys 


Arg 


Ala 


Val 


Met 


785 










790 






Leu 


Lys 


Asn 


Lys 


Thr 


Asn 


Arg 


Arg 










805 








Val 


Pro 


Ser 


Val 


Pro 


Asp 


Thr 


Glu 








820 










Asp 


Ser 


He 


Thr 


Gin 


Ser 


He 


Gly 






835 










840 


Asn 


Arg 


Lys 


Leu 


Cys 


Ser 


Arg 


He 




850 










855 




Lys 


Asn 


Asn 


Thr 


Lys 


Thr 


Ala 


Asn 


865 










870 






Asn 


Asn 


Thr 


Lys 


Glu 


Val 


Asp 


Val 










885 








Thr 


Gly 


Leu 


Ser 


Leu 


His 


Asp 


Ser 








900 










Arg 


Tyr 


His 


Cys 


Val 


Leu 


Asp 


Val 






915 










920 


Gin 


Thr 


He 


Gly 


Arg 


Thr 


Asn 


Gly 




930 










935 




He 


Phe 


Ser 


Thr 


Asp 


Ser 


Pro 


Ala 


945 










950 






Asp 


Lys 


Arg 


He 


Lys 


Asp 


Ser 


Lys 



965 





490 










495 




He 


Asp 


Glu 


Leu 


Ala 


Ser 


Tyr 


Cys 


505 










510 






Asp 


Ser 


Glu 


Lys 


Val 


Gly 


Gly 


His 










525 








He 


Arg 


Glu 


His 


Gin 


Glu 


Gly 


Gly 








540 










Val 


Ala 


Glu 


He 


Asn 


Ser 


Pro 


Ser 






555 










560 


Asp 


Asp 


Asp 


Leu 


Tyr 


Glu 


Val 


Met 




570 










575 




Val 


Val 


Val 


Val 


Arg 


Asp 


Arg 


Asp 


585 










590 






Leu 


Arg 


Ser 


He 


Val 


Gin 


Asp 


Ala 










605 








Asp 


Ala 


Leu 


Ser 


Asn 


Gly 


Gly 


Gly 








620 










Asp 


He 


Ser 


Ser 


He 


Asp 


Met 


Val 






635 










640 


Lys 


Lys 


Lys 


Glu 


Lys 


Pro 


Ser 


Lys 




650 










655 




Tyr 


Asp 


Asp 


Gly 


Ala 


Asn 


Glu 


Glu 


665 










670 






Asp 


Glu 


Cys 


Phe 


Glu 


Lys 


Leu 


Arg 










685 








Ser 


Thr 


Ser 


Val 


Ala 


Ala 


Cys 


Lys 








700 










Thr 


Ser 


Val 


Thr 


Asp 


Ala 


Val 


Lys 






715 










720 


Lys 


Lys 


Met 


Val 


Met 


Ser 


Leu 


Glu 




730 










735 




Asn 


Leu 


Thr 


Thr 


Arg 


He 


Leu 


Gin 


745 










750 






Ala 


Lys 


Tyr 


Gly 


He 


Val 


Asp 


Val 










765 








Ala 


Asn 


Thr 


He 


Phe 


Ser 


Gly 


Tyr 








780 










Met 


Ala 


Ser 


Ala 


Phe 


Thr 


He 


Ser 






795 










800 


Thr 


Ser 


Pro 


Ala 


His 


Val 


Met 


Leu 




810 










815 




Pro 


Leu 


Met 


Ala 


Gly Asn 


Pro 


He 


825 










830 






Glu 


Asp 


Ser 


Asn 


Ala 


Glu 


He 


Thr 










845 








Thr 


Asn 


Arg 


Gly 


Leu 


Phe 


Leu 


Val 








860 










Thr 


Asn 


Lys 


Cys 


He 


Ser 


Ala 


Phe 






875 










880 


He 


Met 


Leu 


Gly 


Pro 


Lys 


Gly 


Asn 




890 










895 




Ser 


Asn 


Asn 


Ser 


Met 


Tyr 


Ala 


Lys 


905 










910 






Pro 


Tyr 


Asn 


Ala 


He 


Ala 


Phe 


Leu 










925 








Gin 


Leu 


Ser 


Val 


Pro 


Gin 


Phe 


Leu 








940 










Glu 


Arg 


Arg 


Phe 


Phe 


Asp 


Ser 


Leu 






955 










960 


Ala 


Gly 


Thr 


Tyr 


Ala 


Asp 


Arg 


Tyr 




97 0 










975 
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Ser Asn Asn Ser lie Asp He Ala Ala Ala Val Met Arg Glu Gin Phe 

980 985 990 

He Asp Gin Gly Leu Val Leu Lys Thr Met Gly Asn He Val Gin He 

995 1000 1005 

Val Thr Ala Ser Met Thr Lys Val His Leu Met Glu His Phe Ser Lys 

1010 1015 1020 

Met Met Met Arg Thr Asn Arg Gly Gly Val Ala Phe Val Glu Gly Leu 

1025 1030 1035 1040 

Thr Leu Glu Asn Gly He Phe Thr Glu Val He Val Leu Ala Met His 

1045 1050 1055 

He Ala Leu Val Val He Gly Ala Gin Asn Lys He Thr Ser Ser Asp 

1060 1065 1070 

Asp Leu Gly His Ala Leu Ser Phe Thr Ser Val Leu Pro His Asn Gin 

1075 1080 1085 

He Leu Ser He Val Lys Ser Ala Ser Gin Phe Val Phe Ser Asn Leu 

1090 1095 1100 

Cys Leu His Leu Val His Phe Lys Ser Asp Cys Asp Asn Leu Leu Pro 

1105 1110 1115 1120 

Arg Glu Lys Arg Val Arg Asp Ala Ala Ser Ala Leu He Asp Thr Leu 

1125 1130 1135 

Asn Thr Lys Asn Asn Glu Val Thr Ser Lys Thr Asn Lys He Glu Ser 

1140 1145 1150 

Asp Ala Pro Ser Leu Thr Ala Leu Met Leu Pro Ser Gly Pro Arg Asn 

1155 1160 1165 

Arg Lys Met Asp Val Phe Ser Asn He Met Ala Tyr Asn Asn Asn Asn 

1170 1175 1180 

Gly Met Asp Phe Asp Glu Asp Val Pro Asp Asn Asp Glu Asp Glu Gly 

1185 1190 1195 1200 

Cys Leu Pro Leu Gin Glu Glu Asn Ala Thr Thr Leu Ala Leu Ser Asn 

1205 1210 1215 

Phe Pro His Asp Tyr Asp Arg Ala He Lys Asp Ala His Gin Leu Val 

1220 1225 1230 

Thr Val Arg He Val Gly Gin Gly Glu Lys Glu Gly Val He Pro He 

1235 1240 1245 

Ser Glu Cys Leu Asp Val Pro Glu Leu Asp Met Thr Asn Leu He Pro 

1250 1255 1260 

Val Val Thr Ala Thr Asn Val He Gin Ser Leu Ala Lys Glu Asn Pro 

1265 1270 1275 1280 

Gly Leu Leu Phe Thr He His Asn Ala Ala His Ser His Arg Glu Gly 

1285 1290 1295 

Tyr Gly Gly Ser His Leu Leu Gly Leu Ala Lys Lys Leu Ser Arg Gly 

1300 1305 1310 

Phe He Asn Phe Arg Gin Phe Gin Asn Gin Leu Phe Ser Pro Lys Lys 

1315 1320 1325 

Glu Ser Lys He Met Tyr Asp He Phe Leu Ser Val Lys Ala He Met 

1330 1335 1340 

Ala Arg Asp Asp Arg Tyr Asp Gly Leu Cys Asp Met Arg Met Asn Ser 

1345 1350 1355 1360 

Met Met Asp Ala Ser Phe Leu Lys Val Arg Lys Lys Pro Glu Cys Val 

1365 1370 1375 

Phe He Thr Lys Leu Leu Asp Lys Asn Phe Arg Arg His He He Asn 

1380 1385 1390 

Asp Glu Glu Glu Glu Thr Arg Glu Arg Phe Gly Gly Glu Glu Glu Glu 

1395 1400 1405 

Glu Asp Asp Asp Glu Glu Phe Glu Asp Glu Glu Glu Glu Gin Ala Glu 

1410 1415 1420 

Arg Glu Trp Gly Glu Glu Glu Gly Glu Ser Ala Tyr Asp He Ser Val 

1425 1430 1435 1440 

He Asn Asp Lys Asn Asn Thr He Gly His Asp Val Asp He He Leu 

1445 1450 1455 

Cys Asn Arg Lys Lys Leu Thr Leu Thr Lys Glu Asn Ser Val Phe Val 
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1460 1465 1470 

Asn Glu His He Asp Ser Phe Met Val Gly Asn Leu He Gly Ala Glu 

1475 1480 1485 

Gly Ser Leu He Gin He Cys Phe Asp Asn Cys Thr Gly Glu Phe Glu 

1490 1495 1500 

Gly Leu Pro Lys Phe Cys Leu Tyr Asp Ser Ser Ser Lys Asp Lys Asp 
1505 1510 1515 1520 

Thr He Pro 



<210> 94 
<211> 2919 
<212> DNA 
<213> SHRIMP 



<400> 94 

atggatgaac 

gcgcaaataa 

gctgcaatgg 

ttagggatga 

gaatacctag 

caccccgctc 

tcagaaacga 

tcatctcttc 

tctaacaaat 

aaggataata 

atccatcacc 

tcagaaggac 

aacccagttc 

gttcgtgtga 

gatcctaaac 

accactggag 

gatgatgggg 

acatcgatga 

tcttcgacac 

ttccgcaact 

attaaatgca 

aagaaggatg 

agaggtatgt 

tttgttacgt 

tcaaacaggc 

atgggaattt 

aaatcaagta 

acgcctttaa 

agcactgttt 

ttacatagaa 

ttctgtactg 

caaaaaccct 

tcgggtagtg 

ccctcctctt 

ggtgaccaaa 

agattgtctc 

atatatctca 

atcataaaaa 

ggaccttatg 

aaacctgcat 

cccacatcct 

cacaa tagga 

cccggaagtg 

gacgacgatg 



gacgaagaga 
cgttattcgt 
cctataatgt 
aaagtaaatt 
aagccagagg 
tcctggacat 
aggaaaggaa 
tcgtgcctga 
atctagatgc 
tagaaaaaga 
atagtgcatc 
atgatgaaga 
ctttgaaaga 
tgacaccatt 
cttattgttg 
aacgtgtagt 
gctggcaatg 
ctgcttgcgc 
caatctatta 
cttcttacat 
acgatcctga 
tactaaataa 
atgaatgtat 
t tgatgtgag 
atgcaataat 
tcttggccga 
tcgacgaaac 
acctggatgc 
tacctcctct 
gt tgtttagc 
caccctt tta 
atgaacatag 
ttcacggcgg 
atatcagaac 
ggctggaaga 
agacagaata 
aatcaagttc 
aatctgagga 
ctgctactgt 
caaaaacttc 
actctgttct 
tatttgacag 
aaaaattgtt 
cgtggggttt 



tccgttactt 
gaccgtgttc 
agccaaacca 
gaaagcagtt 
taggcatgac 
gatgaatttg 
ttctcggttc 
acaccatata 
tgtatcgggg 
cgatactgaa 
aggcgtaagt 
tgtggatatt 
gggacaaact 
tgttgctgga 
gtcaggaaac 
taaagagtgt 
cagacctaaa 
tttcaatcct 
tgatgtatta 
ctccaaattg 
atttttatac 
cgaccttctt 
ggaaaacccc 
cacaatgaag 
aggcgatgac 
ccaatctaaa 
tactgccaag 
aacaaatact 
agaaaatatt 
tcccgtcctg 
cattgagcca 
tatgttggcc 
atctgaactt 
tccccctgga 
aataagggac 
cgtaataaag 
ttgggatagt 
tacttttgta 
gctagcgcga 
cagtacattg 
tcccgaagaa 
cgaaacaatc 
cggaacatca 
gcaatcgttt 



taccctacca 
gtgttaggct 
atgtctgtta 
caaggagcta 
ggggtggaag 
actgttaaag 
gaaagtcatg 
gatagattga 
aaaaagttta 
ttgtacgatt 
ggtgacgctc 
ttggcctaca 
tgcacatctg 
ggaacattta 
gtgcccggtg 
tctgtgcaca 
tacccaacat 
tcaacccaca 
aagaaacaac 
cgtcaatctt 
aaaaacccca 
tctcaagatg 
tgtgttatga 
tgtgtaccgg 
aggacaccgt 
agaggggacc 
aaaattgcac 
tctaagaatg 
ccccatgtaa 
aacaaacctt 
gcagctaatg 
acagaatgtt 
ttattctcta 
ggtactccag 
ttttttgaac 
aaacacgcga 
ttaatgaaga 
ttgaaagagg 
gatatgtttg 
aaagtgtcca 
ggagctaccg 
cccagctatt 
tcttcctcat 
cgtttggatc 



accgaagtat 
gtatttttgc 
attttcaagc 
accctgaaaa 
atgcaagcaa 
ggaataaatg 
atttagctgc 
gtgaggccac 
gacaaaggat 
ctttatttgg 
ctcctcctcc 
atactggcgg 
tgtgttatac 
tcacacataa 
accacataga 
catctatagt 
atttcggagg 
aaggaccacc 
aaatacggaa 
cttccttggc 
tcacctgttt 
taacaaagga 
tgcccaacat 
gcgttaataa 
tagtggggac 
agattcacca 
tggcccaagc 
tgctctttgt 
tcatccgccc 
cctcggggca 
ttttggccgg 
tgcgaaatag 
cacttttgag 
ctccagagta 
gtaattttaa 
gagggatgag 
ggaaagaatt 
ggttattaat 
atttagatta 
atcctttgca 
acgatatttt 
ttgattgctc 
ccgagtacag 
ataaccccaa 



gcgcttcacc 
gctagtctgc 
tatacacgaa 
gacactagaa 
ttacccacct 
gaacgttccg 
aaatagatcc 
ca tcgaaaaa 
ggtcaat t tg 
tgtacatgat 
tccatctaca 
gtactgttcc 
ctcgagagca 
atctggagaa 
aacatcccca 
cgttctgaca 
ttcgggagga 
acctccttca 
tcatacagaa 
ggaattcaaa 
ctgtaataat 
tatgaaattc 
agaccct teg 
tccccaagac 
tgt tcctgcc 
gcaaagacct 
gcccatca tc 
accaatacct 
gtcttcattg 
gcataggcca 
aaacatacct 
tcgcatggtt 
tcaaaataaa 
caattctaca 
cgatgagagg 
gacaagtgaa 
tctaagacat 
gcgttcatac 
cctaaaagga 
gtacgctttc 
ttcagtagac 
caatgtaact 
agtggatatt 
gagtggcccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 
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gtagtacagt cagaccctcg tctagcgttt gatgcgtcaa acatttcctc aacccccgaa 2700 

ggagctacaa taacacccct ctctctattc aagaagtcac tcgtcgagtg ggggcataag 27 60 

aaggcggacg tgcaagaaac ttcgtggttc agggacggtg tggatacttc tgaggcttat 2820 

agacgattac ttgtagagac tagtatggct gtacgtaact cctggttctc tttagcatgg 2880 

gaaaataaaa actattattt tgccaaaaat agcagctaa 2919 



<210> 95 
<211> 962 
<212> PRT 
<213> SHRIMP 



<400> 95 



Met 


Asp 


Glu 


Arg 


Arg 


Arg 


Asp 


Pro 


Leu 


Leu 


Tyr 


Pro 


Thr 


Asn 


Arg 


Ser 


1 








5 










10 










15 




Met 


Arg 


Phe 


Thr 


Ala 


Gin 


He 


Thr 


Leu 


Phe 


Val 


Thr 


Val 


Phe 


Val 


Leu 








20 










25 










30 






Gly 


Cys 


He 


Phe 


Ala 


Leu 


Val 


Cys 


Ala 


Ala 


Met 


Ala 


Tyr 


Asn 


Val 


Ala 






35 










40 










45 








Lys 


Pro 


Met 


Ser 


Val 


Asn 


Phe 


Gin 


Ala 


He 


His 


Glu 


Leu 


Gly 


Met 


Lys 




50 










55 










60 






Ser 


Lys 


Leu 


Lys 


Ala 


Val 


Gin 


Gly 


Ala 


Asn 


Pro 


Glu 


Lys 


Thr 


Leu 


Glu 


65 










70 










75 










80 


Glu 


Tyr 


Leu 


Glu 


Arg 


Arg 


His 


Asp 


Gly 


Val 


Glu 


Asp 


Asn 


Tyr 


Pro 


Pro 










85 










90 










95 




His 


Pro 


Ala 


Leu 


Leu 


Asp 


Met 


Met 


Asn 


Leu 


Thr 


Val 


Lys 


Gly 


Asn 


Lys 








100 










105 










110 






Trp 


Asn 


Val 


Pro 


Ser 


Glu 


Thr 


Lys 


Glu 


Arg 


Asn 


Ser 


Arg 


Phe 


Glu 


Ser 






115 










120 










125 








His 


Asp 


Leu 


Ala 


Ala 


Asn 


Arg 


Ser 


Ser 


Ser 


Leu 


Leu 


Val 


Pro 


Glu 


His 




130 




Arg 






135 










140 










His 


lie 


Asp 


Leu 


Ser 


Glu 


Ala 


Thr 


He 


Glu 


Lys 


Ser 


Asn 


Lys 


Tyr 


145 










150 










155 










160 


Leu 


Asp 


Ala 


Val 


Ser 


Gly 


Lys 


Lys 


Phe 


Arg 


Gin 


Arg 


Met 


Val 


Asn 


Leu 










165 










170 










175 




Lys 


Asp 


Asn 


He 


Glu 


Lys 


Asp 


Asp 


Thr 


Glu 


Leu 


Tyr 


Asp 


Ser 


Leu 


Phe 








18 0 










185 










190 






Gly 


Val 


His 


Asp 


He 


His 


His 


His 


Ser 


Ala 


Ser 


Gly 


Val 


Ser 


Gly 


Asp 






195 










200 










205 








Ala 


Pro 


Pro 


Pro 


Pro 


Pro 


Ser 


Thr 


Ser 


Glu 


Gly 


His 


Asp 


Glu 


Asp 


Val 




210 










215 










220 










Asp 


He 


Leu 


Ala 


Tyr 


Asn 


Thr 


Gly 


Gly 


Tyr 


Cys 


Ser 


Asn 


Pro 


Val 


Pro 


225 










230 










235 










240 


Leu 


Lys 


Glu 


Gly 


Gin 


Thr 


Cys 


Thr 


Ser 


Val 


Cys 


Tyr 


Thr 


Ser 


Arg 


Ala 










245 










250 










255 




Val 


Arg 


Val 


Met 


Thr 


Pro 


Phe 


Val 


Ala 


Gly 


Gly 


Thr 


Phe 


He 


Thr 


His 








260 










265 










270 






Lys 


Ser 


Gly 


Glu 


Asp 


Pro 


Lys 


Pro 


Tyr 


Cys 


Trp 


Ser 


Gly 


Asn 


Val 


Pro 






275 










280 










285 








Gly 


Asp 


His 


He 


Glu 


Thr 


Ser 


Pro 


Thr 


Thr 


Gly 


Glu 


Arg 


Val 


Val 


Lys 




290 










295 










300 










Glu 


Cys 


Ser 


Val 


His 


Thr 


Ser 


He 


Val 


Val 


Leu 


Thr 


Asp 


Asp 


Gly 


Gly 


305 










310 










315 










320 


Trp 


Gin 


Cys 


Arg 


Pro 


Lys 


Tyr 


Pro 


Thr 


Tyr 


Phe 


Gly 


Gly 


Ser 


Gly 


Gly 










325 










330 










335 




Thr 


Ser 


Met 


Thr 


Ala 


Cys 


Ala 


Phe 


Asn 


Pro 


Ser 


Thr 


His 


Lys 


Gly 


Pro 








340 










345 










350 






Pro 


Pro 


Pro 


Ser 


Ser 


Ser 


Thr 


Pro 


He 


Tyr 


Tyr 


Asp 


Val 


Leu 


Lys 


Lys 






355 










360 










365 








Gin 


Gin 


He 


Arg 


Asn 


His 


Thr 


Glu 


Phe 


Arg 


Asn 


Ser 


Ser 


Tyr 


He 


Ser 




370 










375 










380 










Lys 


Leu 


Arg 


Gin 


Ser 


Ser 


Ser 


Leu 


Ala 


Glu 


Phe 


Lys 


He 


Lys 


Cys 


Asn 



wo 01/38351 
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^5 Q 
Jj tf D 










390 










395 










400 


Asp 


Pro 


Glu 


Phe 


Leu 


Tyr 


Lys 


Asn 


Pro 


He 


Thr 


Cys 


Phe 


Cys 


Asn 


Asn 










405 










410 










415 




Lys 


Lys 


Asp 


Val 


Leu 


Asn 


Asn 


Asp 


Leu 


Leu 


Ser 


Gin 


Asp 


Val 


Thr 


Lys 








420 










425 










430 






Asp 


Met 


Lys 


Phe 


Arg 


Gly 


Met 


Tyr 


Glu 


Cys 


Met 


Glu 


Asn 


Pro 


Cys 


Val 






435 










440 










445 








Met 


Met 


Pro 


Asn 


He 


Asp 


Pro 


Ser 


Phe 


Val 


Thr 


Phe 


Asp 


Val 


Ser 


Thr 




4 50 










455 










460 








Met 


Lys 


Cys 


Val 


Pro 


Gly 


Val 


Asn 


Asn 


Pro 


Gin 


Asp 


Ser 


Asn 


Arg 


His 


4 65 










470 










475 








480 


Ala 


He 


He 


Gly 


Asp 


Asp 


Arg 


Thr 


Pro 


Leu 


Val 


Gly 


Thr 


Val 


Pro 


Ala 










485 










490 










495 




Met 


Gly 


He 


Phe 


Leu 


Ala 


Asp 


Gin 


Ser 


Lys 


Arg 


Gly 


Asp 


Gin 


He 


His 








500 










505 










510 






Gin 


Gin 


Arg 


Pro 


Lys 


Ser 


Ser 


He 


Asp 


Glu 


Thr 


Thr 


Ala 


Lys 


Lys 


He 






515 










520 










525 








Ala 


Gin 


Ala 


Pro 


He 


He 


Thr 


Pro 


Leu 


Asn 


Leu 


Asp 


Ala 


Thr 


Asn 


Thr 




530 










535 










540 










Ser 


Lys 


Asn 


Val 


Leu 


Phe 


Val 


Pro 


He 


Pro 


Ser 


Thr 


Val 


Leu 


Pro 


Pro 


/I 










550 










555 










560 


Leu 


Glu 


Asn 


He 


Pro 


His 


Val 


He 


He 


Arg 


Pro 


Ser 


Ser 


Leu 


Leu 


His 










565 










570 










575 




Arg 


Ser 


Cys 


Leu 


Ala 


Pro 


Val 


Leu 


Asn 


Lys 


Pro 


Ser 


Ser 


Gly 


Gin 


His 








580 










585 










590 






Arg 


Pro 


Phe 


Cys 


Thr 


Ala 


Pro 


Phe 


Tyr 


He 


Glu 


Pro 


Ala 


Ala 


Asn 


Val 






595 










600 










605 








Leu 


Ala 


Gly 


Asn 


He 


Pro 


Gin 


Lys 


Pro 


Tyr 


Glu 


His 


Ser 


Met 


Leu 


Ala 




bl U 










615 










620 










Thr 


Glu 


Cys 


Leu 


Arg 


Asn 


Ser 


Arg 


Met 


Val 


Ser 


Gly 


Ser 


Val 


His 


Gly 


iT O C 

625 










630 










635 










640 


Gly 


Ser 


Glu 


Leu 


Leu 


Phe 


Ser 


Thr 


Leu 


Leu 


Ser 


Gin 


Asn 


Lys 


Pro 


Ser 










645 










650 










655 




Ser 


Tyr 


He 


Arg 


Thr 


Pro 


Pro 


Gly 


Gly 


Thr 


Pro 


Ala 


Pro 


Glu 


Tyr 


Asn 








660 










665 










670 




Ser 


Thr 


Gly 


Asp 


Gin 


Arg 


Leu 


Glu 


Glu 


He 


Arg 


Asp 


Phe 


Phe 


Glu 


Arg 






675 










680 










685 






Asn 


Phe 


Asn 


Asp 


Glu 


Arg 


Arg 


Leu 


Ser 


Gin 


Thr 


Glu 


Tyr 


Val 


He 


Lys 




^ A 










695 










700 








Lys 


His 


Arg 


Met 


Arg 


Thr 


Ser 


Glu 


He 


Tyr 


Leu 


Lys 


Ser 


Ser 


Ser 


Trp 


"inc. 

/ub 










710 










715 










720 


Asp 


Ser 


Leu 


Met 


Lys 


Arg 


Lys 


Glu 


Phe 


Leu 


Arg 


His 


He 


He 


Lys 


Lys 










725 










730 










735 




Ser 


Glu 


Asp 


Thr 


Phe 


Val 


Leu 


Lys 


Glu 


Gly 


Leu 


Leu 


Met 


Arg 


Ser 


Tyr 








74 0 










745 










750 




Gly 


Pro 


Tyr 


Ala 


Ala 


Thr 


Val 


Leu 


Ala 


Arg 


Asp 


Met 


Phe 


Asp 


Leu 


Asp 
















7 60 










765 






Tyr 


Leu 


Lys 


Kjiy 


Lys 


Pro 


Ala 


Ser 


Lys 


Thr 


Ser 


Ser 


Thr 


Leu 


Lys 


Val 




770 










775 










780 








Ser 


Asn 


Pro 


Leu 


Gin 


Tyr 


Ala 


Phe 


Pro 


Thr 


Ser 


Tyr 


Ser 


Val 


Leu 


Pro 


785 










790 










795 








800 


Glu 


Glu 


Gly 


Ala 


Thr 


Asp 


Asp 


He 


Phe 


Ser 


Val 


Asp 


His 


Asn 


Arg 


He 










805 










810 










815 




Phe 


Asp 


Ser 


Glu 


Thr 


He 


Pro 


Ser 


Tyr 


Phe 


Asp 


Cys 


Ser 


Asn 


Val 


Thr 








820 










825 










830 






Pro 


Gly 


Ser 


Glu 


Lys 


Leu 


Phe 


Gly 


Thr 


Ser 


Ser 


Ser 


Ser 


Ser 


Glu 


Tyr 






835 










840 










845 






Arg 


Val 


Asp 


He 


Asp 


Asp 


Asp 


Ala 


Trp 


Gly 


Leu 


Gin 


Ser 


Phe 


Arg 


Leu 




850 










855 










860 








Asp 


His 


Asn 


Pro 


Lys 


Ser 


Gly 


Pro 


Val 


Val 


Gin 


Ser 


Asp 


Pro 


Arg 


Leu 


865 










870 










875 








880 
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Ala Phe Asp Asn 

Leu Ser Leu Phe 
900 

Asp Val Gin Glu 
915 

Ala Tyr Arg Arg 

930 

Trp Phe Ser Leu 
945 

Ser Ser 



lie Ser Ser Thr 
885 

Lys Lys Ser Leu 

Thr Ser Trp Phe 
920 

Leu Leu Val Glu 
935 

Ala Trp Glu Asn 
950 



Pro Glu Gly Ala 
890 

Val Glu Trp Gly 
905 

Arg Asp Gly Val 

Thr Ser Met Ala 
940 

Lys Asn Tyr Tyr 
955 



Thr lie Thr Pro 
895 

His Lys Lys Ala 
910 

Asp Thr Ser Glu 
925 

Val Arg Asn Ser 

Phe Ala Lys Asn 
960 



<210> 96 
<211> 681 
<212> DNA 
<213> SHRIMP 



<400> 96 

atggcctcag 

gttcctaccc 

attatggaga 

ttggcgtatt 

cagatcaagt 

gagatcaagg 

atcttcggaa 

aagggtgtaa 

ttgtggatgt 

aatgcatcta 

gtgtcgtacg 

tctgagtttt 



tctttgaaga 
gccccaatat 
accgcctgat 
tcttcccctc 
cgaattgcct 
actgtggacc 
acagtgtttt 
cattctatat 
gtgacaaact 
agccaaaatt 
ttattgaggg 
tggtacaata 



ccctgctgat 
cctgttcttc 
ccataaaggg 
caacaatgaa 
gtctttcttt 
atactgtact 
tggacaagac 
atttgttgat 
gaagagaatg 
ttctgtatgt 
ccacggaatg 



ctctttgcca 
gaaggcctac 
aagtgtggag 
gaaaatacga 
attagggatt 
ttttcccaat 
tttactattg 
ggatctgatt 
aatgccaatg 
aaaacatgta 
ggacattcag 



acatggactt 
tccccaattc 
cat ttgaagc 
agaaactcaa 
ttttgaacga 
acatggatgg 
tcgccatgga 
ctatggagaa 
tggtaaaagt 
ggtgggagtt 
atttgtcatg 



gactggcaaa 
tggcaaggag 
agatacacaa 
cattggtttc 
ctggttggag 
ggacaaagaa 
ctggatcgac 
tatggcctct 
gtttgttgat 
cccaggtcca 
tgatgagatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

681 



<210> 97 
<211> 224 
<212> PRT 
<213> SHRIMP 



<400> 97 
Met Ala Ser Val 
1 

Leu Thr Gly Lys 
20 

Pro Asn Ser Gly 
35 

Lys Cys Gly Ala 
50 

Ser Asn Asn Glu 
65 

Lys Ser Asn Cys 

Leu Glu Glu lie 
100 

Met Asp Gly Asp 
115 

Phe Thr He Val 
130 

He Phe Val Asp 
145 

Met Cys Asp Lys 



Phe 


Glu 


Asp 


Pro 


5 








Val 


Pro 


Trp 


Asn 


Lys 


Glu 


He 


Met 








40 


Phe 


Glu 


Ala 


Asp 






55 




Glu 


Asn 


Thr 


Lys 




70 






Leu 


Ser 


Phe 


Phe 


85 








Lys 


Asp 


Cys 


Gly 


Lys 


Glu 


He 


Phe 








120 


Ala 


Met 


Asp 


Trp 






135 




Gly 


Ser 


Asp 


Ser 




150 






Leu 


Lys 


Arg 


Met 



165 



Ala Asp Leu Phe 
10 

He Leu Phe Phe 

25 

Glu Asn Arg Leu 

Thr Gin Leu Ala 
60 

Lys Leu Asn He 
75 

He Arg Asp Phe 
90 

Pro Tyr Cys Thr 
105 

Gly Asn Ser Val 

He Asp Lys Gly 
140 

Met Glu Asn Met 
155 

Asn Ala Asn Val 
170 



Ala 


Asn 


Met 


Asp 






15 




Glu 


Gly 


Leu 


Leu 




30 






He 


His 


Lys 


Gly 


45 








Tyr 


Phe 


Phe 


Pro 


Gly 


Phe 


Gin 


He 








80 


Leu 


Asn 


Asp 


Trp 






95 




Phe 


Ser 


Gin 


Tyr 




110 






Phe 


Gly 


Gin 


Asp 


125 








Val 


Thr 


Phe 


Tyr 


Ala 


Ser 


Leu 


Trp 








160 


Val 


Lys 


Val 


Phe 



175 
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Val Asp Asn Ala Ser Lys Pro Lys Phe Ser Val Cys Lys Thr Cys Arg 

180 185 190 

Trp Glu Phe Pro Gly Pro Val Ser Tyr Val lie Glu Gly His Gly Met 

195 200 205 

Gly His Ser Asp Leu Ser Cys Asp Glu lie Ser Glu Phe Leu Val Gin 

210 215 220 



<210> 98 
<211> 747 
<212> DNA 
<213> SHRIMP 



<400> 98 

atgattgcca tcgccaatca caaacacgat gtctctgatg cactagttgg agcccatggg 60 

gcaaagatta atatgttgta tggtaaatct agtactttaa gtgtaactga agcagcatta 120 

ttgatgttca atgataccgc attaacacag tttgcccaga gaggatacga gcctagtata 180 

cccaccatat tgaaagctgc tttagatttc tctctccaag aagaagaacc ccttgttgct 240 

gccaccggtc tcgatgtcaa taaagcacct cgttcttggc caatactgaa ttgtcgcttg 300 

gggtatattg catcctcaaa ttatccttgg gctgaacaca taatttctgg ggataaggaa 360 

gaaattaaaa gggctctaga agaacatgag aagaatgcta atgtgcgttt cgacagcgat 420 

aattgtccag tgtgtttaga agatttcagt agtaccaata tcatcaggac gacacgctgt 480 

ggacattgta ttgatgaaaa atgttgggac agattggtgt tgagtacgca acgtggagaa 540 

attacccggt gtcctgtatg cagagaacgt acttccctaa gacctgacgc tgatcaagtt 600 

aaagagatgt tggttgaacc aatagtgtct tgtaaaagaa tggcagtgcc tgatgaacaa 660 

gtatcttgta aacgtagaag aatagggtat aatagatacc agttcctaat aaatgatgtg 720 
tggacagatg agtccgaaac tgtatga 747 

<210> 99 
<211> 248 
<212> PRT 
<213> SHRIMP 



<400> 99 



Met 


He 


Ala 


He 


Ala 


Asn 


His 


Lys 


His 


Asp 


Val 


Ser 


Asp 


Ala 


Leu 


Val 


1 








5 










10 










15 




Gly 


Ala 


His 


Gly 


Ala 


Lys 


He 


Asn 


Met 


Leu 


Tyr 


Gly 


Lys 


Ser 


Ser 


Thr 








20 










25 










30 






Leu 


Ser 


Val 


Thr 


Glu 


Ala 


Ala 


Leu 


Leu 


Met 


Phe 


Asn 


Asp 


Thr 


Ala 


Leu 






35 










40 










45 








Thr 


Gin 


Phe 


Ala 


Gin 


Arg 


Gly 


Tyr 


Glu 


Pro 


Ser 


He 


Pro 


Thr 


He 


Leu 




50 










55 










60 










Lys 


Ala 


Ala 


Leu 


Asp 


Phe 


Ser 


Leu 


Gin 


Glu 


Glu 


Glu 


Pro 


Leu 


Val 


Ala 


65 










70 










75 










80 


Ala 


Thr 


Gly 


Leu 


Asp 


Val 


Asn 


Lys 


Ala 


Pro 


Arg 


Ser 


Trp 


Pro 


He 


Leu 










85 










90 










95 




Asn 


Cys 


Arg 


Leu 


Gly 


Tyr 


He 


Ala 


Ser 


Ser 


Asn 


Tyr 


Pro 


Trp 


Ala 


Glu 








100 










105 










110 






His 


He 


He 


Ser 


Gly 


Asp 


Lys 


Glu 


Glu 


He 


Lys 


Arg 


Ala 


Leu 


Glu 


Glu 






115 










120 










125 








His 


Glu 


Lys 


Asn 


Ala 


Asn 


Val 


Arg 


Phe 


Asp 


Ser 


Asp 


Asn 


Cys 


Pro 


Val 




130 










135 










140 










Cys 


Leu 


Glu 


Asp 


Phe 


Ser 


Ser 


Thr 


Asn 


He 


He 


Arg 


Thr 


Thr 


Arg 


Cys 


145 










150 










155 






160 


Gly 


His 


Cys 


He 


Asp 


Glu 


Lys 


Cys 


Trp 


Asp 


Arg 


Leu 


Val 


Leu 


Ser 


Thr 










165 










170 










175 




Gin 


Arg 


Gly 


Glu 


He 


Thr 


Arg 


Cys 


Pro 


Val 


Cys 


Arg 


Glu 


Arg 


Thr 


Ser 








180 










185 










190 






Leu 


Arg 


Pro 


Asp 


Ala 


Asp 


Gin 


Val 


Lys 


Glu 


Met 


Leu 


Val 


Glu 


Pro 


He 






195 










200 










205 








Val 


Ser 


Cys 


Lys 


Arg 


Met 


Ala 


Val 


Pro 


Asp 


Glu 


Gin 


Val 


Ser 


Cys 


Lys 
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210 215 220 

Arg Arg Arg lie Gly Tyr Asn Arg Tyr Gin Phe Leu lie Asn Asp Val 
225 230 235 240 

Trp Thr Asp Glu Ser Glu Thr Val 
245 



<210> 100 
<211> 1788 
<212> DNA 
<213> SHRIMP 



<400> 100 

atggctgaag 

ccatatatgt 

attacctcca 

gaagagttta 

cccatctgcg 

ggcctttacg 

atggataatg 

agcagaactg 

atgagaagga 

gcaagaagtg 

ttttcaagaa 

cttgaacctt 

acaaaggcaa 

ctagaaacat 

agatctatag 

attatcaagt 

actacaaaaa 

gaaacaatcc 

gccgaaatga 

ttgtgtggtg 

aattacttca 

gataatctta 

gaagttaaag 

gcttttctta 

ctattgtgtt 

aaacttaaca 

atatcttccc 

ttttccttta 

tgcaattttc 

agtatagtga 



cagctccacg 
catttctgga 
gaaattctcc 
gagatattat 
agggaatgac 
aaaacatcca 
gatttttgag 
gattcaagt t 
tcaagaagtt 
gtattgtttt 
tcgtcaata t 
ttgacatatc 
aacgactgat 
caccatcatc 
ctagattgaa 
cttttctttc 
ctgttgctga 
agcattttca 
tggacaatat 
caatactgtt 
agatctgtt t 
ttgtgaaaag 
gctatgttag 
ccaagaatcc 
attgtatcat 
ttctgaagcc 
cctttaccac 
ttaatgaaaa 
ttgatgctct 
atgctagaag 



ataccgtcag 
cgttttcaca 
tcctgttcct 
ctacttcttt 
ttttatcaac 
cgacgacttt 
acgcgagacg 
tact tcaaag 
ggacggtgtt 
ctacgttgaa 
cacctctgat 
tttcggatac 
atccaatggc 
cgtgtcaact 
ttggaccaac 
ctcgatagaa 
aatggcagta 
aaccctttac 
tctaaaaaat 
gtggatgatt 
gagggagatt 
gatcgtatca 
attgtgtgta 
gtggctcaaa 
taacaaatat 
t tctgcgaaa 
aagagtaggt 
taatgaaaac 
tgctaaaaaa 
gcgagcgatg 



gtgcttgaag 
gaaagggaat 
tcaagtagtt 
atcaacaaca 
gcattgacaa 
ttgttcgtaa 
cacggcatca 
gaacaggctg 
ttggctagcc 
aaatgttcga 
tacgtggctg 
aagtattttg 
aattttatta 
gacttcagag 
gaagaaaggg 
gaagaaatgg 
gagtttgttg 
agtatatttg 
gttgcaataa 
aacagtaggc 
atggtaagaa 
cataaaaacg 
aaagtctcgt 
cacacttatt 
tcccatacta 
ggcatgtcac 
attgaaattc 
gtattccatg 
tacaagaata 
gtgcaacgcc 



aagtacttga 
tggcccttct 
tcaagaaatt 
ataccaagtc 
ccgtctgcaa 
aatattctct 
agtttggcac 
aggaagaaag 
tgaaaaagtc 
gtgttattag 
aaatgaaaaa 
tggacgaatc 
tcgggagacc 
aagaaatgaa 
ctagtgctta 
tagaagagta 
atgtgtttat 
acacgatgcc 
acgaagcagt 
cctttgaaga 
aaaagaccga 
ttgtgataac 
gtttcatgga 
ttgatgaaaa 
gcaaactagt 
ctctaatggt 
taacaacaaa 
ctgcagctgt 
taatatacga 
cctaatga 



aaacattgaa 
gaatgacata 
agacaataaa 
tgattcttcc 
aaccttcaga 
tctggtctct 
tggtgatgac 
agagaaggtt 
tactagttct 
attcaggctc 
atctgcacca 
tccttgtgtt 
tttttcttgc 
catggacgct 
caggagtgtg 
ctgtgagact 
agagaaggct 
aaaattctct 
gggtagtggt 
gattgattac 
caaactgtgc 
agaccctcat 
agatcttgaa 
ggggaatact 
caaacaagaa 
ttgcgctgca 
cagcttggcc 
tgcaacctcg 
tttcgataga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1788 



<210> 101 
<211> 586 
<212> PRT 
<213> SHRIMP 



<400> 101 



Met 


Ala 


Glu 


Ala 


Ala 


Pro 


Arg 


Tyr 


Arg 


Gin 


Val 


Leu 


Glu 


Glu 


Val 


Leu 


1 








5 










10 










15 




Glu 


Asn 


lie 


Glu 


Pro 


Tyr 


Met 


Ser 


Phe 


Leu 


Asp 


Val 


Phe 


Thr 


Glu 


Arg 








20 










25 










30 




Glu 


Leu 


Ala 


Leu 


Leu 


Asn 


Asp 


lie 


He 


Thr 


Ser 


Arg 


Asn 


Ser 


Pro 


Pro 






35 










40 










4 5" 








Val 


Pro 


Ser 


Ser 


Ser 


Phe 


Lys 


Lys 


Leu 


Asp 


Asn 


Lys 


Glu 


Glu 


Phe 


Arg 




50 










55 










60 








Asp 


lie 


lie 


Tyr 


Phe 


Phe 


He 


Asn 


Asn 


Asn 


Thr 


Lys 


Ser 


Asp 


Ser 


Ser 


65 










70 










75 










80 


Pro 


lie 


Cys 


Glu 


Gly 


Met 


Thr 


Phe 


He 


Asn 


Ala 


Leu 


Thr 


Thr 


Val 


Cys 



wo 01/38351 

184 











85 










90 


Lys 


Thr 


Phe 


Arg 


Gly 


Glu 


Asn 


He 


His 


Asp 








100 










105 




Tyr 


Ser 


Leu 


Leu 


Val 


Ser 


Met 


Asp 


Asn 


Gly 






115 










120 






His 


Giy 


He 


Lys 


Phe 


Gly 


Thr 


Gly 


Asp 


Asp 




130 










135 








Phe 


Thr 


Ser 


Lys 


Glu 


Gin 


Ala 


Glu 


Glu 


Glu 


145 










150 










Arg 


He 


Lys 


Lys 


Leu 


Asp 


Gly 


Val 


Leu 


Ala 










165 










170 


Ser 


Ser 


Ala 


Arg 


Ser 


Gly 


He 


Val 


Phe 


Tyr 








180 










185 




Val 


He 


Arg 


Phe 


Arg 


Leu 


Phe 


Ser 


Arg 


He 






195 










200 






Tyr 


Val 


Ala 


Glu 


Met 


Lys 


Lys 


Ser 


Ala 


Pro 




210 










215 








Ser 


Phe 


Gly 


Tyr 


Lys 


Tyr 


Phe 


Val 


Asp 


Glu 


225 










230 










Ala 


Lys 


Arg 


Leu 


He 


Ser 


Asn 


Gly 


Asn 


Phe 










245 










250 


Ser 


Cys 


Leu 


Glu 


Thr 


Ser 


Pro 


Ser 


Ser 


Val 








260 










265 




Glu 


Met 


Asn 


Met 


Asp 


Ala 


Arg 


Ser 


He 


Ala 






275 










280 






Glu 


Glu 


Arg 


Ala 


Ser 


Ala 


Tyr 


Arg 


Ser 


Val 




290 










295 








Ser 


Ser 


He 


Glu 


Glu 


Glu 


Met 


Val 


Glu 


Glu 


305 










310 










Lys 


Thr 


Val 


Ala 


Glu 


Met 


Ala 


Val 


Glu 


Phe 










325 










330 


Lys 


Ala 


Glu 


Thr 


He 


Gin 


His 


Phe 


Gin 


Thr 








340 










345 




Pro 


Lys 


Phe 


Ser 


Ala 


Glu 


Met 


Met 


Asp 


Asn 






355 










360 






He 


Asn 


Glu 


Ala 


Val 


Gly 


Ser 


Gly 


Leu 


Cys 




370 










375 








Met 


He 


Asn 


Ser 


Arg 


Pro 


Phe 


Glu 


Glu 


He 


385 










390 










He 


Cys 


Leu 


Arg 


Glu 


He 


Met 


Val 


Arg 


Lys 










405 










410 


Asp 


Asn 


Leu 


He 


Val 


Lys 


Arg 


He 


Val 


Ser 








420 










425 




Thr 


Asp 


Phe 


Val 


Lys 


Gly 


Tyr 


Val 


Arg 


Leu 






435 










440 






Phe 


Met 


Glu 


Asp 


Leu 


Glu 


Ala 


Phe 


Leu 


Thr 




450 










455 








His 


Thr 


Tyr 


Phe 


Asp 


Glu 


Lys 


Gly 


Asn 


Thr 


465 










470 










He 


Asn 


Lys 


Tyr 


Ser 


His 


Thr 


Ser 


Lys 


Leu 










485 










4 90 


Asn 


He 


Leu 


Lys 


Pro 


Ser 


Ala 


Lys 


Gly 


Met 








500 










505 




Ala 


Ala 


He 


Ser 


Ser 


Pro 


Phe 


Thr 


Thr 


Arg 






515 










520 






Thr 


Thr 


Asn 


Ser 


Leu 


Ala 


Phe 


Ser 


Phe 


He 




530 










535 








Val 


Phe 


His 


Ala 


Ala 


Ala 


Val 


Ala 


Thr 


Ser 


545 










550 










Lys 


Lys 


Tyr 


Lys 


Asn 


He 


He 


Tyr 


Asp 


Phe 
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95 



Asp 


Phe 


Leu 


Phe 


Val 


Lys 








110 






Phe 


' Leu 


Arg 


Arg 


Glu 


Thr 






125 








Ser 


Arg 


Thr 


Gly 


Phe 


Lys 




140 










Arg 


Glu 


Lys 


Val 


Met 


Arg 


155 










160 


Ser 


Leu 


Lys 


Lys 


Ser 


Thr 










175 




Val 


Glu 


Lys 


Cys 


Ser 


Ser 








190 






Val 


Asn 


He 


Thr 


Ser 


Asp 






205 








Leu 


Glu 


Pro 


Phe 


Asp 


He 




220 










Ser 


Pro 


Cys 


Val 


Thr 


Lys 


235 










240 


He 


He 


Gly 


Arg 


Pro 


Phe 










255 




Ser 


Thr 


Asp 


Phe 


Arg 


Glu 








270 






Arg 


Leu 


Asn 


Trp 


Thr 


Asn 






285 








He 


He 


Lys 


Ser 


Phe 


Leu 




300 










Tyr 


Cys 


Glu 


Thr 


Thr 


Thr 


315 










320 


Val 


Asp 


Val 


Phe 


He 


Glu 










335 




Lys 


He 


Phe 


Asp 


Thr 


Met 








350 






He 


Leu 


Lys 


Asn 


Val 


Ala 






365 








Gly 


Ala 


He 


Leu 


Leu 


Trp 




380 










Asp 


Tyr 


Asn 


Tyr 


Phe 


Lys 


395 










400 


Lys 


Thr 


Asp 


Lys 


Leu 


Cys 










415 




His 


Lys 


Asn 


Val 


Val 


He 








430 






Cys 


Val 


Lys 


Val 


Ser 


Cys 






445 








Lys 


Asn 


Pro 


Trp 


Leu 


Lys 




460 










Leu 


Leu 


Cys 


Tyr 


Cys 


He 


475 










480 


Val 


Lys 


Gin 


Glu 


Lys 


Leu 










4 95 




Ser 


Pro 


Leu 


Met 


Val 


Cys 








510 






Val 


Gly 


He 


Glu 


He 


Leu 






525 








Asn 


Glu 


Asn 


Asn 


Glu 


Asn 




540 










Cys 


Asn 


Phe 


Leu 


Asp 


Ala 


555 










560 


Asp 


Arg 


Ser 


He 


Val 


Asn 



575 
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Ala Arg Arg Arg Ala Met Val Gin Arg Pro 
580 585 



<210> 102 

<211> 1197 

<212> DNA 

<213> SHRIMP 



<400> 102 

atggccggaa 

gggtgtatcg 

tcaatcaaaa 

aaggatatgg 

gaaaatgatg 

gaaataaagt 

ggatgtcttc 

tttgataaga 

actaaaatag 

gagggcaaaa 

actcctttac 

aatgtaaaaa 

tgctgggttc 

catcaggcag 

aatgaagcag 

ttggagttgc 

agaacaagga 

cttcacaccc 

gatagggtca 

aggattaatc 



ctgacatcat 
tcagcaagaa 
cttcctcttc 
ttaagaagtg 
ctctgagagc 
atggggaagg 
ttgcaaagaa 
atgggttcat 
ctaagaaggt 
taccttgcca 
cccccactcc 
ataagccagt 
acaaactctt 
gaaggcgtga 
aagacactga 
tagacacggc 
ttgtgaggga 
cctttgatgc 
gagacggcag 
agctccagca 



cagcagcagc 
aggaaagaca 
atcagaaatg 
tgccagctgt 
atctatagaa 
ggagataggg 
tgaaaagtcc 
gatgttgtct 
ttttggaggc 
ggagttccat 
tgctcctgtc 
ttttgtgcct 
ctctgatgac 
cgccctcatg 
gaggtttgtg 
taatatgccc 
ctatgatgct 
accaaatgtt 
acgcaacact 
ccagtttctg 



agcagcggca 
ataaaaggga 
atgaagaaac 
aagaaggttg 
agtacctgct 
gatg tt t tga 
ttttataagt 
gaaggtatga 
tgtaaggccg 
aaacctagtt 
aaggtgttgc 
gacttggctg 
cctgaggaaa 
gaagattatg 
tctaacgcac 
cctcccgcat 
agtcctgtac 
aatttgaacc 
tcacggagga 
tactatagct 



gcagcagcag 
agaatatcgt 
a taagaaaag 
attatagctc 
cagcactcaa 
gtgcaattag 
ttttcttgag 
aaaggattga 
cccctctaaa 
catataaggg 
cccctcttct 
taggggaggc 
gaaaacggct 
gtgttattcc 
ttgaatacca 
ccacacctgt 
cttctcctta 
caggttcagg 
catctgccgt 
ctgatagtga 



caagaaggga 
ctttaagaca 
aatggaaata 
atcaaccttg 
taggtttcct 
gctaatggcg 
gggttttcag 
gaaaatgcat 
agaggacagg 
tgaatatacc 
tccctataag 
taagaaacca 
ctttgagagg 
taataatgac 
ggcacagatg 
tagaagggga 
ttcttcacct 
taggatggtt 
catggccagg 
tttttaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



<210> 103 

<211> 396 

<212> PRT 

<213> SHRIMP 



<400> 103 
Met Ala Gly Thr 
1 

Ser Lys Lys Gly 
20 

Gly Lys Asn lie 
35 

Glu Met Met Lys 
50 

Lys Lys Cys Ala 
65 

Glu Asn Asp Ala 

Asn Arg Phe Pro 
100 

Leu Ser Ala lie 
115 

Lys Ser Phe Tyr 
130 

Gly Phe Met Met 
145 

Thr Lys lie Ala 

Lys Glu Asp Arg 
180 



Asp 

5 
Gly 

Val 

Lys 

Ser 

Leu 
85 

Glu 

Arg 

Lys 

Leu 

Lys 
165 
Glu 



lie lie 

Cys lie 

Phe Lys 

His Lys 

55 
Cys Lys 
70 

Arg Ala 

lie Lys 

Leu Met 

Phe Phe 
135 
Ser Glu 
150 

Lys Val 



Ser 

Val 

Thr 
40 

Lys 

Lys 

Ser 

Tyr 

Ala 
120 
Leu 



Gly 
Phe 
Gly Lys lie 



Ser 


Ser 


Ser 


Ser 


Gly 


Ser 


Ser 


Ser 




10 










15 




Ser 


Lys 


Lys 


Gly 


Lys 


Thr 


He 


Lys 


25 










30 






Ser 


He 


Lys 


Thr 


Ser 


Ser 


Ser 


Ser 










45 








Arg 


Met 


Glu 


He 


Lys 


Asp 


Met 


Val 








60 










Val 


Asp 


Tyr 


Ser 


Ser 


Ser 


Thr 


Leu 






75 










80 


He 


Glu 


Ser 


Thr 


Cys 


Ser 


Ala 


Leu 




90 










95 




Gly 


Glu 


Gly 


Glu 


He 


Gly 


Asp 


Val 


105 










110 






Gly 


Cys 


Leu 


Leu 


Ala 


Lys 


Asn 


Glu 










125 








Arg 


Gly 


Phe 


Gin 


Phe 


Asp 


Lys 


Asn 








140 










Met 


Lys 


Arg 


He 


Glu 


Lys 


Met 


His 






155 










160 


Gly 


Gly 


Cys 


Lys 


Ala 


Ala 


Pro 


Leu 




170 










175 




Pro 


Cys 


Gin 


Glu 


Phe 


His 


Lys 


Pro 


185 










190 
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Ser 


Ser 


Tyr 
195 


Lys 


Gly 


Glu 


Tyr 


Thr 
200 


Thr 


Pro 


Leu 


Pro 


Pro 
205 


Thr 


Pro 


Ala 


Pro 


Val 
210 


Lys 


Val 


Leu 


Pro 


Pro 
215 


Leu 


Leu 


Pro 


Tyr 


Lys 
220 


Asn 


Val 


Lys 


Asn 


Lys 


Pro 


Val 


Phe 


Val 


Pro 


Asp 


Leu 


Ala 


Val 


Gly 


Glu 


Ala 


Lys 


Lys 


Pro 


225 










230 










235 










240 


Cys 


Trp 


Val 


His 


Lys 
245 


Leu 


Phe 


Ser 


Asp 


Asp 
250 


Pro 


Glu 


Glu 


Arg 


Lys 
255 


Arg 


Leu 


Phe 


Glu 


Arg 
260 


His 


Gin 


Ala 


Gly 


Arg 
265 


Arg 


Asp 


Ala 


Leu 


Met 
270 


Glu 


Asp 


Tyr 


Gly 


Val 
275 


lie 


Pro 


Asn 


Asn 


Asp 
280 


Asn 


Glu 


Ala 


Glu 


Asp 
285 


Thr 


Glu 


Arg 


Phe 


Val 
290 


Ser 


Asn 


Ala 


Leu 


Glu 

295 


Tyr 


Gin 


Ala 


Gin 


Met 
300 


Leu 


Glu 


Leu 


Leu 


Asp 


Thr 


Ala 


Asn 


Met 


Pro 


Pro 


Pro 


Ala 


Ser 


Thr 


Pro 


Val 


Arg 


Arg 


Gly 


305 










310 










315 










320 


Arg 


Thr 


Arg 


He 


Val 
325 


Arg 


Asp 


Tyr 


Asp 


Asp 
330 


Val 


Pro 


Ser 


Pro 


Tyr 
335 


Ser 


Ser 


Pro 


Leu 


His 
340 


Thr 


Pro 


Phe 


Asp 


Ala 

345 


Pro 


Asn 


Val 


Asn 


Leu 
350 


Asn 


Pro 


Gly 


Ser 


Gly 
355 


Arg 


Met 


Val 


Asp 


Arg 
360 


Val 


Arg 


Asp 


Gly 


Arg 
365 


Arg 


Asn 


Thr 


Ser 


Arg 
370 


Arg 


Thr 


Ser 


Ala 


Val 
375 


Met 


Ala 


Arg 


Arg 


He 
380 


Asn 


Gin 


Leu 


Gin 


His 


Gin 


Phe 


Leu 


Tyr 


Tyr 


Ser 


Ser 


Asp 


Ser 


Asp 


Phe 











385 390 395 



<210> 104 
<211> 3796 
<212> DNA 
<213> SHRIMP 



<400> 104 

a tggcgtcct 

caagaaaaac 

ccatctcaag 

cctcctccac 

gaacggctaa 

acaaacgagc 

aaggataatt 

a taggcgcac 

ctcgata taa 

agggaccatg 

accaaa tgcg 

aaactggtgg 

agaggcgcta 

aatgaaaaca 

atatcgcacg 

ggcaacactt 

atagggtgga 

gacgacaat t 

aaatt taaaa 

aagactcaag 

tttgcagaaa 

acaaaaagca 

gcctt ttatg 

ttctttaacc 

t tgtccacat 

tccgtctcag 

gtgggtgtga 



cggggggatt 
aagagaaaaa 
ctccagatcc 
ctaatcctct 
agacgtcaaa 
gtgttgatgc 
acttccaggt 
acgtgaaaca 
aaaataagtg 
agaaccttat 
tgggggcaga 
taaaaaaact 
ccagaaattt 
aggacattcc 
tcttttccac 
tttatcaacg 
cagaaagtga 
tattcgtgct 
acgtcctcta 
agacgctaat 
tcgacaagtg 
ctgtcataca 
ctctaaataa 
ttttctcctt 
tgacgttcgt 
ccaagcggct 
gggattttgg 



ctttacagga 
taaaccaact 
agtcccagac 
ccctcctcct 
aatacgttta 
gttgaaagag 
tttaaagtgc 
ggtggccaaa 
ctctttagtg 
actacaaatt 
aatatgtgtt 
ccccaatgca 
actggagaat 
tccttcagaa 
tctacacaga 
aaaaccaaca 
agcatcaaaa 
accccattct 
taaaaatagt 
aaatccccag 
tttggatgtt 
atatagagga 
ggcaaaacat 
catgtattgt 
ctatcaacac 
catggatatc 
ttcaccttca 



atagatgacc 
caagcaccag 
ccagttccta 
cctcctcctc 
aacaaagctc 
aaccaagcat 
tcgataacac 
agtagcgaaa 
tacaacgaaa 
gccgtccagt 
aaaggcaacg 
ccatcctcat 
aatttcaaca 
cgagccaacc 
ctggacacta 
ttcgataata 
caaaccacta 
ttcaataatt 
accgcacatc 
attgattcgg 
cttttggcca 
aagtttagaa 
tctcgcgcag 
catggtccgt 
atgtttttcc 
gattccgccc 
aaaacaagtc 



tt ttcaagac 
aaacagaacc 
aaacaccaac 
ctcctaagcc 
ttagtgatat 
taaatacaga 
cttctgtacc 
tcgaactggc 
atgagtcgtt 
tattctctag 
aaaaaaacaa 
cttcaactgt 
agggagaaaa 
tggacacgac 
aaaggaagct 
aattcaggtg 
aatcgctaga 
tggcagacca 
ccggcaaacg 
cgaaagagta 
tagggaagaa 
ggtatttaat 
tatcccctct 
ttctcca ttc 
ccatgggcac 
taatgaaagg 
tccatacaag 



agtgattcaa 
aaaaccaggg 
caatt tctgt 
ctcaagagaa 
tgttgaagcc 
atatgacaag 
aacagctatt 
eg tgaacgaa 
aaaatttttc 
gcacgataac 
gtttgttaac 
gctagaaatt 
taacactgtc 
caaggcagaa 
tttctttaaa 
gacagaagtt 
caagccaacg 
cttacgtttg 
aaattactac 
caagatggtc 
tgacaaatac 
attctgctac 
accatttaat 
cgccagtttt 
agccgcccca 
aggaaagggg 
aacattggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 
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tctttcttag 
gaagtgcgtg 
gattcagtca 
aaagtatcac 
aaggtgaggg 
gaagaggcgg 
aaagccctcg 
tacaatttag 
gccgaaggag 
gacaaatact 
gaacaaaaca 
tacgaagaag 
ttgaatagag 
atactcgata 
gccgtgtggt 
ggcaactacg 
aattttggaa 
tttcacagag 
acatccctat 
cgaacccacg 
acaagagg tg 
gggcacacga 
caagagaatg 
ttgggaagag 
gctgcagcca 
gactcttcta 
tcggccgcca 
gcaccagcag 
ttcttttctg 
tgtcccaata 
gaaagtgaaa 
gaaatggtat 
aatcaagacg 
gatatggttg 
attgcagccc 
gaagaccctg 
tactgacgag 



gttttgctga 
tatctccagc 
tctttatata 
tcgaatcggc 
ccgccaagag 
gcctgacact 
gattacctgt 
gggattttgt 
aaaccgcagg 
acggcgaagg 
caccgtatga 
aaaaggatgt 
acgtggagga 
aaactaacca 
gcaaaaggaa 
aatggcttgg 
tattaggtga 
acagtgatag 
ctgatcgaaa 
taggaagagt 
ataaagacat 
attcagctaa 
gattgtgggg 
gacgtttagg 
ttgcaatttc 
aagtgagaaa 
gagaaaaggc 
ctggttcatc 
acccagacaa 
caggat ttct 
cctctatagt 
gcccttcaac 
ctgtactgac 
aagacgagga 
gtatcagtaa 
aattgcaaag 
atctaa 



aatggctatg 
tctccaacaa 
tttcaccttt 
gcttcgcctc 
atgccgaata 
ctcttatgcc 
ccctaagata 
aggcgtggaa 
atttctcggc 
agccttttca 
cacatcttca 
tccacagcag 
atatttaatg 
aaaagaacgt 
cgtactgaaa 
tgctaaaatg 
cttgaaaata 
attaccctca 
acaattggcc 
gactgcaaca 
gtttgccgcc 
ttttgttcca 
gtatactcgc 
gggcctgaat 
ttctgcctta 
aatagcctcc 
gagagaagct 
cagaaacagt 
gcgccagaag 
tcatgctgct 
gagattacgt 
agctcccgaa 
gttggatgat 
agcgcgtgaa 
agtgtgtgag 
tgtggatgaa 



ggaacaatga 
aggatatcta 
gttttattcc 
atcatggggc 
gaagcagcgg 
catctattgg 
aaccctctca 
caacttctaa 
atgtttgata 
gacgtggttg 
gcgttgatga 
gaagaaaatt 
gcttctccta 
ttcatgtctg 
aaggattgga 
tgcaaccatt 
accaataaac 
gttgcagatc 
cttgttcact 
tcatgggctg 
ctatcttcat 
tattttagta 
agaacctctg 
aaggtagggg 
gatatggggg 
acctgcttaa 
agtattaaac 
aacaggtttc 
cttataaagg 
gttcctgatt 
ttgagactga 
gctaataaga 
gaagataaca 
agattacgcc 
cggaaaaatc 
caattgatac 



cggctctctt 
aatccctaga 
acagattcag 
agacgcacgc 
aaatggaagg 
gtcttcctta 
tgacagcatc 
aggctaagag 
atctagtgaa 
aaaatgtaaa 
cacctatccc 
ctacacaaca 
tgaagatggt 
ttggggatat 
acgaatacgc 
tacttttagc 
ttgacacaaa 
agaaaaaatt 
cgtgcgttaa 
tcgatgcgct 
cgctggatat 
gaaattacct 
aaaaattggc 
tggctaaaac 
aagtagaagc 
atgttaatgc 
gtcttcttct 
tcctcaaaga 
gtgaagcagt 
ttgttattga 
ttaaacctga 
agaggaaatt 
tcgttaaata 
accaggacaa 
caaagaaaaa 
gggaactggc 



atctggtgta 
aagatggtgt 
tggtgcgaaa 
ccacacaaat 
tgtggaagaa 
ctctatacaa 
ttcttctcaa 
agagtttcca 
agattctatt 
acaaggcatg 
taaagcattc 
aagatataga 
gtttgtgtct 
tgcccttctg 
tatcgctaaa 
tgatttagtg 
taccgacacc 
tataaaaaac 
cgtgagcacc 
tcgtacctat 
gtaccatctt 
atgtaacgaa 
caaagaagaa 
agaactggct 
tgtaatggac 
agccaaggtc 
ggccactaa t 
tttgtggggg 
ttctgtacta 
gtattccttc 
aaaacaagac 
agtaaggaa t 
caacaaatat 
acaatcggtt 
acgtcgttta 
tgccattgcc 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3796 



<210> 105 
<211> 1255 
<212> PRT 
<213> SHRIMP 



<400> 105 



Met 


Ala 


Ser 


Ser 


Gly 


Gly 


Phe 


Phe 


Thr 


Gly 


He 


Asp 


Asp 


Leu 


Phe 


Lys 


1 








5 










10 










15 




Thr 


Val 


lie 


Gin 


Gin 


Glu 


Lys 


Gin 


Glu 


Lys 


Asn 


Lys 


Pro 


Thr 


Gin 


Ala 








20 










25 










30 






Pro 


Glu 


Thr 


Glu 


Pro 


Lys 


Pro 


Gly 


Pro 


Ser 


Gin 


Ala 


Pro 


Asp 


Pro 


Val 






35 










40 










45 






Pro 


Asp 


Pro 


Val 


Pro 


Lys 


Thr 


Pro 


Thr 


Asn 


Phe 


Cys 


Pro 


Pro 


Pro 


Pro 




50 










55 










60 










Asn 


Pro 


Leu 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Lys 


Pro 


Ser 


Arg 


Glu 


65 










70 










75 










80 


Glu 


Arg 


Leu 


Lys 


Thr 


Ser 


Lys 


He 


Arg 


Leu 


Asn 


Lys 


Ala 


Leu 


Ser 


Asp 










85 










90 










95 




lie 


Val 


Glu 


Ala 


Thr 


Asn 


Glu 


Arg 


Val 


Asp 


Ala 


Leu 


Lys 


Glu 


Asn 


Gin 








100 










105 










110 






Ala 


Leu 


Asn 


Thr 


Glu 


Tyr 


Asp 


Lys 


Lys 


Asp 


Asn 


Tyr 


Phe 


Gin 


Val 


Leu 






115 










120 










125 








Lys 


Cys 


Ser 


He 


Thr 


Pro 


Ser 


Val 


Pro 


Thr 


Ala 


lie 


He 


Gly 


Ala 


His 
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130 










135 










140 










Val 


Lys 


Gin 


Val 


Ala 


Lys 


Ser 


Ser 


Glu 


He 


Glu 


Leu 


Ala 


Val 


Asn 


Glu 


145 










150 










155 










160 


Leu 


Asp 


lie 


Lys 


Asn 


Lys 


Cys 


Ser 


Leu 


Val 


Tyr 


Asn 


Glu 


Asn 


Glu 


Ser 










165 










170 










175 




Leu 


Lys 


Phe 


Phe 


Arg 


Asp 


His 


Glu 


Asn 


Leu 


He 


Leu 


Gin 


He 


Ala 


Val 








180 










185 










190 






Gin 


Leu 


Phe 


Ser 


Arg 


His 


Asp 


Asn 


Thr 


Lys 


Cys 


Val 


Gly 


Ala 


Glu 


He 






195 










200 










205 








Cys 


Val 


Lys 


Gly 


Asn 


Glu 


Lys 


Asn 


Lys 


Phe 


Val 


Asn 


Lys 


Leu 


Val 


Val 




210 










215 










220 










Lys 


Lys 


Leu 


Pro 


Asn 


Ala 


Pro 


Ser 


Ser 


Ser 


Ser 


Thr 


Val 


Leu 


Glu 


He 


225 










230 










235 










240 


Arg 


Gly 


Ala 


Thr 


Arg 


Asn 


Leu 


Leu 


Glu 


Asn 


Asn 


Phe 


Asn 


Lys 


Gly 


Glu 










245 










250 










255 




Asn 


Asn 


Thr 


Val 


Asn 


Glu 


Asn 


Lys 


Asp 


He 


Pro 


Pro 


Ser 


Ala 


Asn 


Leu 








260 










265 










270 






Asp 


Thr 


Thr 


Lys 


Ala 


Glu 


lie 


Ser 


His 


Val 


Phe 


Ser 


Thr 


Leu 


His 


Arg 






275 










280 










285 








Leu 


Asp 


Thr 


Lys 


Arg 


Lys 


Leu 


Phe 


Phe 


Lys 


Gly 


Asn 


Thr 


Phe 


Tyr 


Gin 




290 










295 










300 










Arg 


Lys 


Pro 


Thr 


Phe 


Asp 


Asn 


Lys 


Phe 


Arg 


Trp 


Thr 


Glu 


Val 


He 


Gly 


305 










310 










315 










320 


Trp 


Thr 


Glu 


Ser 


Glu 


Ala 


Ser 


Lys 


Gin 


Thr 


Thr 


Lys 


Ser 


Leu 


Asp 


Lys 










325 










330 










335 




Pro 


Thr 


Asp 


Asp 


Asn 


Leu 


Phe 


Val 


Leu 


Pro 


His 


Ser 


Phe 


Asn 


Asn 


Leu 








340 










345 










350 






Ala 


Asp 


His 


Leu 


Arg 


Leu 


Lys 


Phe 


Lys 


Asn 


Val 


Leu 


Tyr 


Lys 


Asn 


Ser 






355 










360 










365 








Thr 


Ala 


His 


Pro 


Gly 


Lys 


Arg 


Asn 


Tyr 


Tyr 


Lys 


Thr 


Gin 


Glu 


Thr 


Leu 




370 










375 










380 










lie 


Asn 


Pro 


Gin 


lie 


Asp 


Ser 


Ala 


Lys 


Glu 


Tyr 


Lys 


Met 


Val 


Phe 


Ala 


385 










390 










395 










400 


Glu 


lie 


Asp 


Lys 


Cys 


Leu 


Asp 


Val 


Leu 


Leu 


Ala 


He 


Gly 


Lys 


Asn 


Asp 










405 










410 










415 




Lys 


Tyr 


Thr 


Lys 


Ser 


Thr 


Val 


lie 


Gin 


Tyr 


Arg 


Gly 


Lys 


Phe 


Arg 


Arg 








420 










425 










430 






Tyr 


Leu 


lie 


Phe 


Cys 


Tyr 


Ala 


Phe 


Tyr 


Ala 


Leu 


Asn 


Lys 


Ala 


Lys 


His 






435 










440 










445 








Ser 


Arg 


Ala 


Val 


Ser 


Pro 


Leu 


Pro 


Phe 


Asn 


Phe 


Phe 


Asn 


Leu 


Phe 


Ser 




450 










455 










460 










Phe 


Met 


Tyr 


Cys 


His 


Gly 


Pro 


Phe 


Leu 


His 


Ser 


Ala 


Ser 


Phe 


Leu 


Ser 


A65 










470 










475 










480 


Thr 


Leu 


Thr 


Phe 


Val 


Tyr 


Gin 


His 


Met 


Phe 


Phe 


Pro 


Met 


Gly 


Thr 


Ala 










485 










490 










495 




Ala 


Pro 


Ser 


Val 


Ser 


Ala 


Lys 


Arg 


Leu 


Met 


Asp 


He 


Asp 


Ser 


Ala 


Leu 








500 










505 










510 






Met 


Lys 


Gly 


Gly 


Lys 


Gly 


Val 


Gly 


Val 


Arg 


Asp 


Phe 


Gly 


Ser 


Pro 


Ser 






515 










520 










525 








Lys 


Thr 


Ser 


Leu 


His 


Thr 


Arg 


Thr 


Leu 


Val 


Ser 


Phe 


Leu 


Gly 


Phe 


Ala 




530 










535 










540 










Glu 


Met 


Ala 


Met 


Gly 


Thr 


Met 


Thr 


Ala 


Leu 


Leu 


Ser 


Gly 


Val 


Glu 


Val 


545 










550 










555 










560 


Arg 


Val 


Ser 


Pro 


Ala 


Leu 


Gin 


Gin 


Arg 


He 


Ser 


Lys 


Ser 


Leu 


Glu 


Arg 










565 










570 










575 




Trp 


Cys 


Asp 


Ser 


Val 


lie 


Phe 


He 


Tyr 


Phe 


Thr 


Phe 


Val 


Leu 


Phe 


His 








580 










585 










590 






Arg 


Phe 


Ser 


Gly 


Ala 


Lys 


Lys 


Val 


Ser 


Leu 


Glu 


Ser 


Ala 


Leu 


Arg 


Leu 






595 










600 










605 








lie 


Met 


Gly 


Gin 


Thr 


His 


Ala 


His 


Thr 


Asn 


Lys 


Val 


Arg 


Ala 


Ala 


Lys 




610 










615 










620 
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Arg Cys Arg lie Glu Ala Ala Glu Met Glu Gly Val Glu Glu Glu Glu 
625 630 635 640 

Ala Gly Leu Thr Leu Ser Tyr Ala His Leu Leu Gly Leu Pro Tyr Ser 

645 650 655 

lie Gin Lys Ala Leu Gly Leu Pro Val Pro Lys lie Asn Pro Leu Met 

660 665 670 

Thr Ala Ser Ser Ser Gin Tyr Asn Leu Gly Asp Phe Val Gly Val Glu 

675 680 685 

Gin Leu Leu Lys Ala Lys Arg Glu Phe Pro Ala Glu Gly Glu Thr Ala 

690 695 700 

Gly Phe Leu Gly Met Phe Asp Asn Leu Val Lys Asp Ser lie Asp Lys 
705 710 715 720 

Tyr Tyr Gly Glu Gly Ala Phe Ser Asp Val Val Glu Asn Val Lys Gin 

725 730 735 

Gly Met Glu Gin Asn Thr Pro Tyr Asp Thr Ser Ser Ala Leu Met Thr 

740 745 750 

Pro lie Pro Lys Ala Phe Tyr Glu Glu Glu Lys Asp Val Pro Gin Gin 

755 760 765 

Glu Glu Asn Ser Thr Gin Gin Arg Tyr Arg Leu Asn Arg Asp Val Glu 

770 775 780 

Glu Tyr Leu Met Asp Met Lys Met Val Phe Val Ser lie Leu Asp Lys 
785 790 795 800 

Thr Asn Gin Lys Glu Arg Phe Met Ser Val Gly Asp lie Ala Leu Leu 

805 810 815 

Ala Val Trp Cys Lys Arg Asn Val Leu Lys Lys Asp Trp Asn Glu Tyr 

820 825 830 

Ala lie Ala Lys Gly Asn Tyr Glu Trp Leu Gly Ala Lys Met Cys Asn 

835 840 845 

His Leu Leu Leu Ala Asp Leu Val Asn Phe Gly lie Leu Gly Asp Leu 

850 855 860 

Lys lie Thr Asn Lys Leu Asp Thr Asn Thr Asp Thr Phe His Arg Asp 
865 870 875 880 

Ser Asp Arg Leu Pro Ser Val Ala Asp Gin Lys Lys Phe lie Lys Asn 

885 890 895 

Thr Ser Leu Ser Asp Arg Lys Gin Leu Ala Leu Val His Ser Cys Val 

900 905 910 

Asn Val Ser Thr Arg Thr His Val Gly Arg Val Thr Ala Thr Ser Trp 

915 920 925 

Ala Val Asp Ala Leu Arg Thr Tyr Thr Arg Gly Asp Lys Asp Met Phe 

930 935 940 

Ala Ala Leu Ser Ser Ser Leu Asp Met Tyr His Leu Gly His Thr Asn 
945 950 955 960 

Ser Ala Asn Phe Val Pro Tyr Phe Ser Arg Asn Tyr Leu Cys Asn Glu 

965 970 975 

Gin Glu Asn Gly Leu Trp Gly Tyr Thr Arg Arg Thr Ser Glu Lys Leu 

980 985 990 

Ala Lys Glu Glu Leu Gly Arg Gly Arg Leu Gly Gin Lys Val Gly Val 

995 1000 1005 

Ala Lys Thr Glu Leu Ala Ala Ala Ala lie Ala lie Ser Ser Ala Leu 

1010 1015 1020 

Asp Met Gly Glu Val Glu Ala Val Met Asp Asp Ser Ser Lys Val Arg 
1025 1030 1035 1040 

Lys lie Ala Ser Thr Cys Leu Asn Val Asn Ala Ala Lys Val Ser Ala 

1045 1050 1055 

Ala Arg Glu Lys Ala Arg Glu Ala Ser lie Lys Arg Leu Leu Leu Ala 

1060 1065 1070 

Thr Asn Ala Pro Ala Ala Gly Ser Ser Arg Asn Ser Asn Arg Phe Leu 

1075 1080 1085 

Leu Lys Asp Leu Trp Gly Phe Phe Ser Asp Pro Asp Lys Arg Gin Lys 

1090 1095 1100 

Leu lie Lys Gly Glu Ala Val Ser Val Leu Cys Pro Asn Thr Gly Phe 
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1105 1110 1115 1120 

Leu His Ala Ala Val Pro Asp Phe Val lie Glu Tyr Ser Phe Glu Ser 

1125 1130 1135 

Glu Thr Ser lie Val Arg Leu Arg Leu Arg Leu He Lys Pro Glu Lys 

1140 1145 1150 

Gin Asp Glu Met Val Cys Pro Ser Thr Ala Pro Glu Ala Asn Lys Lys 

1155 1160 1165 

Arg Lys Leu Val Arg Asn Asn Gin Asp Ala Val Leu Thr Leu Asp Asp 

1170 1175 1180 

Glu Asp Asn He Val Lys Tyr Asn Lys Tyr Asp Met Val Glu Asp Glu 
1185 1190 1195 1200 

Glu Ala Arg Glu Arg Leu Arg His Gin Asp Lys Gin Ser Val He Ala 

1205 1210 1215 

Ala Arg He Ser Lys Val Cys Glu Arg Lys Asn Pro Lys Lys Lys Arg 

1220 1225 1230 

Arg Leu Glu Asp Pro Glu Leu Gin Ser Val Asp Glu Gin Leu He Arg 

1235 1240 1245 

Glu Leu Ala Ala He Ala Tyr 
1250 1255 



<210> 106 
<211> 1824 
<212> DNA 
<213> SHRIMP 



<400> 106 

atgagagacg 

tacgattgtt 

ggcgtggacg 

gataaatgca 

ctaactaaac 

agttgtaata 

aataatagt'a 

ggtgccgaat 

gagaaagaag 

ggctcggcag 

ggctcttgga 

gaagt tttag 

tccga tctgt 

gcaagaaat t 

gaagagaatg 

gt ttctgaga 

ga t tgttgtg 

caacccaaga 

aaaaccaatc 

cggaaaatga 

tccgctgctg 

aacaacccct 

tgccctcaaa 

gcagaaagtg 

actcaacaca 

gtccagcatg 

agaattgaaa 

cat tcggcta 

aggactgctt 

atacctttaa 

atgctagctg 



atacttttaa 
gttgcccatt 
ccacgtctgt 
taccctttag 
ccgatttagc 
acactcaagg 
aaaagtatga 
ctattcacaa 
ccttcgccga 
ccggaggagg 
cgtcgttgtt 
tcagtaacaa 
tatatgccct 
tgtgtgttat 
gacctttcga 
caaattcatc 
atgatgatga 
aacaaaccaa 
acgacaacat 
tggattcttt 
ctggtggtgg 
cttcagtgag 
tagaagaagt 
tagtcaaaca 
gaaaaaagta 
taggatttaa 
ttgttttttc 
ttctaaggct 
cggtgaaagg 
gcaacaagga 
gtttgggaaa 



ccaagaaact 
ggttaaccgc 
gcgaagccga 
gataacgtcc 
tgctgcagct 
caattttgat 
agttacatgc 
gaaacccatg 
aagaattgaa 
aggtatgtac 
gttccacaca 
gataaaacac 
atgctccacc 
tgaaggtggg 
ctccaaaaca 
ggcactggcc 
cgatgatgat 
gaaacaaaaa 
gttgatgaat 
gtcaggaaaa 
tgcttcatcc 
tcctactaac 
cactattttt 
ttctgttgtt 
tatcctgcct 
caaatttcgc 
cgaccaatct 
attgtcatac 
aatcgatttt 
aataagagaa 
atga 



gcggtgaaac 
gtggagcgcc 
ccggctcttt 
cttattgagg 
tttgatgtat 
gtctcttcaa 
ccgtcatgca 
tctcttct tg 
ctcaagaaat 
aaggacagct 
tctaaaaagg 
acatcaagat 
actaacaact 
gaatttttat 
gaccttcaat 
gcttcttctt 
gatgatgaag 
acaacaacat 
gtacttaaaa 
aagggccaac 
gacgttgttg 
aatagggata 
tcacaacaca 
attaatggaa 
cacgaaaata 
attttgactg 
gactctgtgg 
ataagagaaa 
gtcgtgaaat 
cggcaa ttat 



ttgtacgatg 
ttctaggatc 
atgaagaaga 
gtatactttt 
cagaaaagct 
tgacca tatg 
ctgtcgagaa 
ccttctttaa 
tgtacctctc 
cccaacaatc 
acaagactcg 
tgcagcctag 
ctgcatctta 
attttaaa ta 
cattagttaa 
cctct ttaga 
acgaaaaaac 
caacacttcc 
aaggagctgt 
actctaaaaa 
caggagaaaa 
gaaaagacta 
ggatgaacaa 
attgt ttaaa 
ttcttttttg 
gcgtttcttg 
tgttgagtaa 
actcattgaa 
cacaggacac 
gctcagcttc 



gtatacagag 
gttcggagga 
taagaaggga 
ggaaagggct 
ggtgtattgt 
gattgatggc 
aattagtgga 
caatctggta 
ct tactaacg 
ttccttcaac 
tttagaggct 
gtgcgtctgt 
cgcgtacaag 
cacaatcttt 
taacgagcct 
agacgacgat 
taagaagaaa 
acctatcagc 
taatggaaaa 
attgaaaacc 
tgaggaagag 
tgtgcttcca 
taacaagttg 
cttgtttgtt 
cccaccttta 
cttttttgat 
taatgctgcc 
gcgaagtgtg 
taatataggc 
aaccctgagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1824 



<210> 107 
<211> 603 
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<212> PRT 
<213> SHRIMP 



<400> 107 






Met 


Arg 


Asp 


Asp 


Thr 


1 








5 


Trp 


Tyr 


Thr 


Glu 


Tyr 








20 




Airg 


Leu 


Leu 


Gly 


Ser 






35 






Ser 


Arg 


Pro 


Ala 


Leu 




50 








Pro 


Phe 


Arg 


lie 


Thr 












Leu 


Thr 


Lys 


Pro 


Asp 










85 


Leu 


Val 


Tyr 


Cys 


Ser 








100 




Ser 


Met 


Thr 


lie 


Trp 






lie 

lib 






Thr 


Cys 


Pro 


Ser 


Cys 




UU 








lie 


His 


Lys 


Lys 


Pro 


1 4 O 










Glu 


Lys 


Glu 


Ala 


Phe 










165 


Ser 


Leu 


Leu 


Thr 


Gly 








180 




ber 


Ser 


Gin 


Gin 


Ser 






1 y b 






His 


Thr 


Ser 


Lys 


Lys 




210 








Ser 


Asn 


Lys 


lie 


Lys 












Ser 


Asp 


Leu 


Leu 


Tyr 










245 


Tyr 


Ala 


Tyr 


Lys 


Ala 








260 




Leu 


Tyr 


Phe 


Lys 


Tyr 






275 






Lys 


Thr 


Asp 


Leu 


Gin 




1 Q A 








Asn 


Ser 


Ser 


Ala 


Ala 


J w J 










Cys 


Asp 


Asp 


Asp 


Asp 










325 


Lys 


Lys 


Gin 


Pro 


Lys 








340 




Thr 


Leu 


Pro 


Pro 


He 






355 






Val 


Leu 


Lys 


Lys 


Gly 




370 








Leu 


Ser 


Gly 


Lys 


Lys 


385 










Ala 


Ala 


Gly 


Gly 


Gly 










405 


Glu 


Glu 


Asn 


Asn 


Pro 








420 




Lys 


Asp 


Tyr 


Val 


Leu 






435 






Ser 


Gin 


His 


Arg 


Met 



Phe Asn Gin Glu Thr 
10 

Asp Cys Cys Cys Pro 
25 

Phe Gly Gly Gly Val 
40 

Tyr Glu Glu Asp Lys 
55 

Ser Leu He Glu Gly 
70 

Leu Ala Ala Ala Ala 
90 

Cys Asn Asn Thr Gin 
105 

He Asp Gly Asn Asn 
120 

Thr Val Glu Lys He 
135 

Met Ser Leu Leu Ala 
150 

Ala Glu Arg He Glu 
170 

Ser Ala Ala Gly Gly 
185 

Ser Phe Asn Gly Ser 
200 

Asp Lys Thr Arg Leu 
215 

His Thr Ser Arg Leu 
230 

Ala Leu Cys Ser Thr 
250 

Arg Asn Leu Cys Val 
265 

Thr He Phe Glu Glu 
280 

Ser Leu Val Asn Asn 
295 

Ser Ser Ser Ser Leu 
310 

Asp Asp Asp Asp Asp 
330 

Lys Gin Thr Lys Lys 
345 

Ser Lys Thr Asn His 
360 

Ala Val Asn Gly Lys 
375 

Gly Gin His Ser Lys 
390 

Ala Ser Ser Asp Val 
410 

Ser Ser Val Ser Pro 
425 

Pro Cys Pro Gin He 
440 

Asn Asn Asn Lys Leu 



Ala 


Val 


Lys 


Leu 


Val 


Arg 










15 




Leu 


Val 


Asn 


Arg 


Val 


Glu 








30 






Asp 


Ala 


Thr 


Ser 


Val 


Arg 






45 








Lys 


Gly 


Asp 


Lys 


Cys 


He 




60 










He 


Leu 


Leu 


Glu 


Arg 


Ala 


75 










80 


Phe 


Asp 


Val 


Ser 


Glu 


Lys 










95 




Gly Asn 


Phe 


Asp 


Val 


Ser 








HO 






Ser 


Lys 


Lys 


Tyr 


Glu 


Val 






125 








Ser 


Gly 


Gly 


Ala 


Glu 


Ser 




140 










Phe 


Phe 


Asn 


Asn 


Leu 


Val 


155 










160 


Leu 


Lys 


Lys 


Leu 


Tyr 


Leu 










175 




Gly 


Gly 


Met 


Tyr 


Lys 


Asp 








190 






Trp 


Thr 


Ser 


Leu 


Leu 


Phe 






205 








Glu 


Ala 


Glu 


Val 


Leu 


Val 




220 










Gin 


Pro 


Arg 


Cys 


Val 


Cys 


235 










240 


Thr 


Asn 


Asn 


Ser 


Ala 


Ser 










255 




He 


Glu 


Gly 


Gly 


Glu 


Phe 








270 






Asn 


Gly 


Pro 


Phe 


Asp 


Ser 






285 








Glu 


Pro 


Val 


Ser 


Glu 


Thr 




300 










Glu 


Asp 


Asp 


Asp 


Asp 


Cys 


315 










320 


Glu 


Asp 


Glu 


Lys 


Thr 


Lys 










335 




Gin 


Lys 


Thr 


Thr 


Thr 


Ser 








350 






Asp 


Asn 


Met 


Leu 


Met 


Asn 






365 








Arg 


Lys 


Met 


Met 


Asp 


Ser 




380 










Lys 


Leu 


Lys 


Thr 


Ser 


Ala 


395 










400 


Val 


Ala 


Gly 


Glu 


Asn 


Glu 










415 




Thr 


Asn 


Asn 


Arg 


Asp 


Arg 








430 






Glu 


Glu 


Val 


Thr 


He 


Phe 






445 








Ala 


Glu 


Ser 


Val 


Val 


Lys 
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450 










455 










460 










His 


Ser 


Val 


Val 


He 


Asn 


Gly 


Asn 


Cys 


Leu 


Asn 


Leu 


Phe 


Val 


Thr 


Gin 


465 










470 










475 










480 


His 


Arg 


Lys 


Lys 


Tyr 
485 


He 


Leu 


Phe 


Asn 


He 
490 


Leu 


Phe 


Cys 


Pro 


Pro 
495 


Leu 


Val 


Gin 


His 


Val 
500 


Gly 


Phe 


Asn 


Lys 


Phe 
505 


Arg 


He 


Leu 


Thr 


Gly 
510 


Val 


Ser 


Cys 


Phe 


Phe 


Asp 


Arg 


He 


Glu 


He 


Val 


Phe 


Ser 


Asp 


Gin 


Ser 


Asp 


Ser 






515 










520 










525 






Val 


Val 


liBU 


Ser 


Asn 


Asn 


Ala 


Ala 


His 


Ser 


Ala 


He 


Leu 


Arg 


Leu 


Leu 




530 










535 










540 








Ser 


Tyr 


lie 


Arg 


Glu 


Asn 


Ser 


Leu 


Lys 


Arg 


Ser 


Val 


Arg 


Thr 


Ala 


Ser 


545 










550 










555 










560 


Val 


Lys 


Gly 


lie 


Asp 
565 


Phe 


Val 


Val 


Lys 


Ser 
570 


Gin 


Asp 


Thr 


Asn 


He 
575 


Gly 


lie 


Pro 


Leu 


Ser 
580 


Asn 


Lys 


Glu 


He 


Arg 
585 


Glu 


Arg 


Gin 


Leu 


Cys 
590 


Ser 


Ala 


Ser 


Thr 


Leu 
595 


Ser 


Met 


Leu 


Ala 


Gly 
600 


Leu 


Gly 


Lys 













<210> 108 
<211> 2403 
<212> DNA 
<213> SHRIMP 



<400> 108 

atgctcagct 

gtcattttac 

ggaataatga 

gtggaagaaa 

aaggaactaa 

accggttcgg 

attagatccc 

ttgcccacca 

acaagagaga 

atgcacctca 

atgtcagaga 

tggaaaatta 

agagtagaaa 

ggcatggatt 

tcttctactt 

acgggcattg 

ctatcggaat 

gtagaacctt 

tggttacctc 

gaagaagaaa 

actgattttt 

tgcaaaataa 

aaatatgatt 

gttgccctca 

caaacccctt 

actgacgtgt 

gaaaatgaaa 

ggtaattctg 

cagtacttta 

gccccgtcat 

agacctagaa 

aggtgcatat 

acagaacacg 

aaccaatacc 



tcaaccctga 
ccgtttccaa 
cactggaccc 
gactagctca 
ataatagtag 
caatcttcga 
ccgctttgaa 
taaccaaatc 
agatagtggt 
cagacggaga 
ccaatgcttc 
tgcctaacaa 
cattagtgga 
tagtggatat 
ccaataatac 
gatccttgtc 
ataggactgg 
caatagacgg 
ctctagttaa 
acggggacaa 
ggtctaatgt 
acgatcgggc 
ggacaaataa 
tgacctttag 
ctcagggaag 
acggttttga 
ccgggattga 
tcagggtcgt 
cagaaatggc 
ctgtaaagga 
aagataacaa 
accacactga 
tatctttcaa 
aaacaaacta 



gtatgctagc 
ggatgttgta 
tcacgatttg 
atacaatact 
aaatatgaaa 
agacaacatg 
ttctgctgta 
tctaaggaga 
ggattctttc 
gatgacgtac 
gtccatctac 
tttcaataaa 
gtttgcgtgc 
gcaaagaact 
tcctagagta 
cattcttgca 
attatccatt 
cgttcaagca 
aaggccagaa 
taaaccttct 
ccccg tcaca 
aa tgattagt 
aactacaaag 
aaaattccaa 
tgaatacgca 
atcgaatgga 
aagcatatac 
cagactggaa 
catacctcct 
gtacaagtat 
cgacggcggt 
acgttgttta 
caaggatttg 
caaaagca ta 



tggtttggga 
ctctttggat 
gatataaaaa 
ctacctatgg 
gagtcaatat 
ttcaacggag 
ttttcaagca 
agtcaagcta 
agcatactta 
gtccccgatg 
ttgatcataa 
acacttgaaa 
aagtgcgtcg 
ataaggtcta 
gcgataatga 
gaagatggat 
actgaaaata 
gagcatcctc 
gtggtagcag 
gataaagata 
cctctaatta 
agttggaaaa 
gttgattatt 
gacatactag 
gtgaccatgt 
aataagccat 
gtcctaaata 
aaggaaatga 
attaatgaaa 
ttttgtgatc 
gtggagcatt 
gtccatcttt 
aacatattag 
ttcgaaattg 



aaatgatcac 
cacgaggaca 
tcacatctaa 
attttacacg 
tcacgggaat 
gaggttcagc 
agaactacat 
gagataagca 
gtgctatagc 
ggcactgcgt 
acgaccccac 
tgagagacgg 
catcatcctt 
tggatttcct 
catctggaag 
caacacacca 
atagagaggt 
tatccccttc 
cagcagcagc 
acgaagacaa 
cacccaagaa 
ataatctagt 
ttgataagat 
cggataacta 
ctaacgtggc 
tgtttgccct 
tcataggaaa 
gtttcctctt 
aatgcaaatg 
taacagcacc 
ctgcgttgac 
actctgagcc 
aaattggaaa 
tggacgttcc 



cgacccgggg 
atcagatgtg 
acgcataggt 
ggcaatggaa 
attt ttagac 
tttgcgctta 
catcaaacaa 
agtggataaa 
tgctcaagta 
taatgttgtc 
tggttcggga 
tgtaatagat 
gattaaaagg 
ccctccagct 
tagtactact 
ccaaatcaag 
gtcttttacg 
tattcttcag 
agcagtagta 
gtacagtgat 
atggagagcg 
gaaactccac 
ggctgccttt 
tgttcctcct 
tacactcttt 
agaacagcta 
ttcccctgat 
gaaggcgaag 
gacagataag 
catttcaaag 
ttatacacct 
agaaaaaata 
aaatattacc 
cataattgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 
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gcctctatgt catcaacaaa aacaatgact gtaaacaact acataatttc aacaccttct 2100 

gccacgacca agtttgttca ggatccgcca aaaacaggga aacaacttct ggcagttgaa 2160 

gaggtcagaa actttaaact caaatctgtt cttgttcctc ctccttattt tagggacaat 2220 

aagcgcaaca caactctttg ttcacaaata actgaacaaa attgcccgtc atcttctgaa 2280 

ggtgggcgtt tttcatgtcc atcagagtca cttattctca agtactctaa tctctctaaa 2340 

aagcgcgcac tggaagaaat tgccccagag actgagacta gcattttgtc actagccatg 2400 

taa 2403 

<210> 109 
<211> 794 
<212> PRT 
<213> SHRIMP 

<400> 109 

Met Leu Ser Phe Asn Pro Glu Tyr Ala Ser Trp Phe Gly Lys Met lie 

15 10 15 

Thr Asp Pro Gly Val He Leu Pro Val Ser Lys Asp Val Val Leu Phe 

20 25 30 

Gly Ser Arg Gly Gin Ser Asp Val Gly He Met Thr Leu Asp Pro His 

35 40 45 

Asp Leu Asp He Lys lie Thr Ser Lys Arg He Gly Val Glu Glu Arg 

50 55 60 

Leu Ala Gin Tyr Asn Thr Leu Pro Met Asp Phe Thr Arg Ala Met Glu 
65 70 75 80 

Lys Glu Leu Asn Asn Ser Arg Asn Met Lys Glu Ser He Phe Thr Gly 

85 90 95 

He Phe Leu Asp Thr Gly Ser Ala He Phe Glu Asp Asn Met Phe Asn 

100 105 110 

Gly Gly Gly Ser Ala Leu Arg Leu He Arg Ser Pro Ala Leu Asn Ser 

115 120 125 

Ala Val Phe Ser Ser Lys Asn Tyr He He Lys Gin Leu Pro Thr He 

130 135 140 

Thr Lys Ser Leu Arg Arg Ser Gin Ala Arg Asp Lys Gin Val Asp Lys 
145 150 155 160 

Thr Arg Glu Lys He Val Val Asp Ser Phe Ser He Leu Ser Ala He 

165 170 175 

Ala Ala Gin Val Met His Leu Thr Asp Gly Glu Met Thr Tyr Val Pro 

180 185 190 

Asp Gly His Cys Val Asn Val Val Met Ser Glu Thr Asn Ala Ser Ser 

195 200 205 

He Tyr Leu He He Asn Asp Pro Thr Gly Ser Gly Trp Lys He Met 

210 215 220 

Pro Asn Asn Phe Asn Lys Thr Leu Glu Met Arg Asp Gly Val He Asp 
225 230 235 240 

Arg Val Glu Thr Leu Val Glu Phe Ala Cys Lys Cys Val Ala Ser Ser 

245 250 255 

Leu He Lys Arg Gly Met Asp Leu Val Asp Met Gin Arg Thr He Arg 

260 265 270 

Ser Met Asp Phe Leu Pro Pro Ala Ser Ser Thr Ser Asn Asn Thr Pro 

275 280 285 

Arg Val Ala He Met Thr Ser Gly Ser Ser Thr Thr Thr Gly He Gly 

290 295 300 

Ser Leu Ser He Leu Ala Glu Asp Gly Ser Thr His His Gin He Lys 
305 310 315 320 

Leu Ser Glu Tyr Arg Thr Gly Leu Ser He Thr Glu Asn Asn Arg Glu 

325 330 335 

Val Ser Phe Thr Val Glu Pro Ser He Asp Gly Val Gin Ala Glu His 

340 345 350 

Pro Leu Ser Pro Ser He Leu Glh Trp Leu Pro Pro Leu Val Lys Arg 

355 360 365 

Pro Glu Val Val Ala Ala Ala Ala Ala Ala Val Val Glu Glu Glu Asn 
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370 










375 










380 










Gly 


Asp 


Asn 


Lys 


Pro 


Ser 


Asp 


Lys 


Asp 


Asn 


Glu 


Asp 


Lys 


Tyr 


Ser 


Asp 


385 










390 










395 










400 


Thr 


Asp 


Phe 


Trp 


Ser 


Asn 


Val 


Pro 


Val 


Thr 


Pro 


Leu 


He 


Thr 


Pro 


Lys 










405 










410 










415 




Lys 


Trp 


Arg 


Ala 


Cys 


Lys 


He 


Asn 


Asp 


Arg 


Ala 


Met 


He 


Ser 


Ser 


Trp 








420 










425 










430 






Lys 


Asn 


Asn 


Leu 


Val 


Lys 


Leu 


His 


Lys 


Tyr 


Asp 


Trp 


Thr 


Asn 




Thr 






435 










440 










445 








Thr 


Lys 


Val 


Asp 


Tyr 


Phe 


Asp 


Lys 


Met 


Ala 


Ala 


Phe 


Val 


Met 


Thr 


Phe 




450 










455 










460 










Arg 


Lys 


Phe 


Gin 


Asp 


He 


Leu 


Ala 


Asp 


Asn 


Tyr 


Val 


Pro 


Pro 


Gin 


Thr 


4 65 










470 










475 










480 


Pro 


Ser 


Gin 


Gly 


Ser 


Glu 


Tyr 


Ala 


Val 


Thr 


Met 


Ser 


Asn 


Val 


Ala 


Thr 










485 










490 










495 




Leu 


Phe 


Thr 


Asp 


Val 


Tyr 


Gly 


Phe 


Glu 


Ser 


Asn 


Gly 


Asn 


Lys 


Pro 


Leu 








500 










505 










510 






Phe 


Ala 


Leu 


Glu 


Gin 


Leu 


Glu 


Asn 


Glu 


Thr 


Gly 


He 


Glu 


Ser 


He 


Tyr 






515 










520 










525 








val 


Leu 


Asn 


He 


He 


Gly 


Asn 


Ser 


Pro 


Asp 


Gly 


Asn 


Ser 


Val 


Arg 


Val 




530 










535 










540 










Val 


Arg 


Leu 


Glu 


Lys 


Glu 


Met 


Ser 


Phe 


Leu 


Leu 


Lys 


Ala 


Lys 


Gin 


Tyr 


545 










550 










555 










560 


Phe 


Thr 


Glu 


Met 


Ala 


He 


Pro 


Pro 


He 


Asn 


Glu 


Lys 


Cys 


Lys 


Trp 


Thr 










565 










570 










575 




Asp 


Lys 


Ala 


Pro 


Ser 


Ser 


Val 


Lys 


Glu 


Tyr 


Lys 


Tyr 


Phe 


Cys 


Asp 


Leu 








580 










585 










590 






Thr 


Ala 


Pro 


He 


Ser 


Lys 


Arg 


Pro 


Arg 


Lys 


Asp 


Asn 


Asn 


Asp 


Gly 


Gly 






595 










600 










605 








Val 


Glu 


His 


Ser 


Ala 


Leu 


Thr 


Tyr 


Thr 


Pro 


Arg 


Cys 


He 


Tyr 


His 


Thr 




610 










615 










620 










Glu 


Arg 


Cys 


Leu 


Val 


His 


Lys 


Glu 


Pro 


Glu 


Lys 


He 


Thr 


Glu 


His 


Val 


625 










630 










635 










640 


Ser 


Phe 


Asn 


Lys 


Asp 


Leu 


Asn 


He 


He 


Gly 


Lys 


Asn 


He 


Thr 


Asn 


Gin 










645 










650 










655 




Tyr 


Gin 


Thr 


Asn 


Tyr 


Lys 


Ser 


He 


Phe 


Glu 


He 


Val 


Asp 


Val 


Pro 


He 








660 










665 










670 






lie 


Val 


Ala 


Ser 


Met 


Ser 


Ser 


Thr 


Lys 


Thr 


Met 


Thr 


Val 


Asn 


Asn 


Tyr 






675 










680 










685 








He 


He 


Ser 


Thr 


Pro 


Ser 


Ala 


Thr 


Thr 


Lys 


Phe 


Val 


Gin 


Asp 


Pro 


Pro 




690 










695 










700 










Lys 


Thr 


Gly 


Lys 


Gin 


Leu 


Leu 


Ala 


Val 


Glu 


Glu 


Val 


Arg 


Asn 


Phe 


Lys 


705 










710 










715 










720 


Leu 


Lys 


Ser 


Val 


Leu 


Val 


Pro 


Pro 


Pro 


Tyr 


Phe 


Arg 


Asp 


Asn 


Lys 


Arg 










725 










730 










735 




Asn 


Thr 


Thr 


Leu 


Cys 


Ser 


Gin 


He 


Thr 


Glu 


Gin 


Asn 


Cys 


Pro 


Ser 


Ser 








740 










745 










750 






Ser 


Glu 


Gly 


Gly Arg 


Phe 


Ser 


Cys 


Pro 


Ser 


Glu 


Ser 


Leu 


He 


Leu 


Lys 






755 










760 










765 








Tyr 


Ser 


Asn 


Leu 


Ser 


Lys 


Lys 


Arg 


Ala 


Leu 


Glu 


Glu 


He 


Ala 


Pro 


Glu 




770 










775 










780 










Thr 


Glu 


Thr 


Ser 


He 


Leu 


Ser 


Leu 


Ala 


Met 















785 790 



<210> 110 
<211> 1581 
<212> DNA 
<213> SHRIMP 



<400> 110 
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atgggagtac 
gttgttacac 
tcttcctatt 
agtggaacca 
aattccatat 
cctttgggct 
catgaaacaa 
attctaccta 
cagaagtgtg 
caaggggaat 
gcctaccttt 
gaagataata 
tattttgtca 
tcggacatcc 
gacctcctaa 
aatcccggtg 
gcgctacctt 
acaacaacaa 
gattttgatt 
ttcattagat 
tccatcgcca 
tgggttgggg 
tttacttctg 
gaaactctta 
aagaaaattg 
ggtattgttt 
ataaaaa tgc 



agaagaatat 
ttctaggaac 
cctttactcc 
agaaggacac 
tcagacatga 
tgaaccttat 
cagaacctga 
aaaaatggtg 
ggtgcgggca 
ggcttgctgc 
tcataagcct 
tgatgatggg 
acgatttttc 
tgaacaagag 
ctaaacatca 
ttgaggagga 
acattaagaa 
ccactactac 
acgagtcttt 
ttcttgatca 
actttaaaat 
atgaacatac 
cacctattat 
gaagcaaaat 
gggaagattg 
gtataacagt 
ctaaattcta 



cctggtcggt 
agtaacagaa 
tgaatcgagt 
cttaatcaac 
ctgttctgaa 
tttctgttca 
cgatttattg 
gtctgatgtt 
ggcatttgtt 
tcacactaac 
tcaacgaact 
aagaatgtca 
agacgatata 
ggaagaatgg 
actgggggag 
agaagaagag 
aagtggaaaa 
tactactact 
taatgaacca 
aaaaaattgg 
tgaacaagag 
ccctaaattc 
tgtccagaat 
tgttgatcgg 
ggctaataaa 
aacagttatt 
a 



ggtggtggtg 
ggagcaccgg 
gttttctggg 
gttctgggga 
actcgttcta 
tgtgcgagtc 
tgggacggat 
gtgtggacat 
tcttccttta 
ggaaagacat 
acactcaagc 
ggcacaccca 
ggaagtactc 
atagctgttt 
cgggaatatg 
agagtggaag 
cttatcggac 
actaatccta 
gaaatctttg 
agacttggta 
tcatcaaaag 
agacttagtg 
gtaggttatt 
tcaagggacc 
aaacaaactg 
tctatatttt 



gtgtatcatt 
cagtcccacc 
tcgaaggaaa 
agaagattcc 
ttcaatggcc 
atgaacatca 
caagaaaaac 
ctttatggag 
ct tctacgca 
ctgagggtga 
ctatcatcac 
tgaaccctaa 
ctcagtgtct 
ggggtgttgc 
ggagcgaagg 
aagaagaaga 
ctcgtagaag 
ttgttagaga 
gcagtaactc 
tcatgagcag 
ccttattttg 
tatggaagaa 
cctctgatgt 
tgatagaaac 
tagttgctat 
caattgtaat 



gttattgggt 
tttttcatcg 
tcgtgtttta 
ttattatgct 
agaaacttcc 
acaccgtact 
taccaccata 
ggataacgac 
gaaagaagtc 
tactaattca 
tgacgtcaca 
ggatatgaca 
tgtatcgaat 
agactctaaa 
aagaagaaga 
agtagaagtt 
acctttgaca 
ggttgtggaa 
aaaacttccc 
agtttcttct 
tttggcagtc 
ctggaagcct 
tttctggcat 
aaaagtgaca 
gtttatttca 
atattacaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1581 



<210> 111 
<211> 522 
<212> PRT 
<213> SHRIMP 



<400> 111 



Met 


Gly 


Val 


Gin 


Lys 


Asn 


He 


Leu 


Val 


Gly 


Gly 


Gly 


Gly 


Gly 


Val 


Ser 


1 








5 










10 










15 




Leu 


Leu 


Leu 


Gly 


Val 


Val 


Thr 


Leu 


Leu 


Gly 


Thr 


Val 


Thr 


Glu 


Gly 


Ala 








20 










25 










30 






Pro 


Ala 


Val 


Pro 


Pro 


Phe 


Ser 


Ser 


Ser 


Ser 


Tyr 


Ser 


Phe 


Thr 


Pro 


Glu 






35 










40 










45 








Ser 


Ser 


Val 


Phe 


Trp 


Val 


Glu 


Gly 


Asn 


Arg 


Val 


Leu 


Ser 


Gly 


Thr 


Lys 




50 










55 










60 










Lys 


Asp 


Thr 


Leu 


He 


Asn 


Val 


Leu 


Gly 


Lys 


Lys 


He 


Pro 


Tyr 


Tyr 


Ala 


65 










70 










75 










80 


Asn 


Ser 


He 


Phe 


Arg 


His 


Asp 


Cys 


Ser 


Glu 


Thr 


Arg 


Ser 


He 


Gin 


Trp 










85 










90 










95 




Pro 


Glu 


Thr 


Ser 


Pro 


Leu 


Gin 


Leu 


He 


Phe 


Cys 


Ser 


Cys 


Ala 


Ser 


His 








100 










105 










110 






Glu 


His 


Gin 


His 


Arg 


Thr 


His 


Glu 


Thr 


Thr 


Glu 


Pro 


Asp 


Asp 


Leu 


Leu 






115 










120 










125 








Trp 


Asp 


Gly 


Ser 


Arg 


Lys 


Thr 


Thr 


Thr 


He 


He 


Leu 


Pro 


Lys 


Lys 


Trp 




130 










135 










140 










Trp 


Ser 


Asp 


Val 


Val 


Trp 


Thr 


Ser 


Leu 


Trp 


Arg 


Asp 


Asn 


Asp 


Gin 


Lys 


145 










150 










155 










160 


Cys 


Gly 


Cys 


Gly 


Gin 


Ala 


Phe 


Val 


Ser 


Ser 


Phe 


Thr 


Ser 


Thr 


Gin 


Lys 










165 










170 










175 




Glu 


Val 


Gin 


Gly 


Glu 


Trp 


Leu 


Ala 


Ala 


His 


Thr 


Asn 


Gly 


Lys 


Thr 


Ser 








180 










185 










190 






Glu 


Gly 


Asp 


Thr 


Asn 


Ser 


Ala 


Tyr 


Leu 


Phe 


He 


Ser 


Leu 


Gin 


Arg 


Thr 






195 










200 










205 








Thr 


Leu 


Lys 


Pro 


He 


He 


Thr 


Asp 


Val 


Thr 


Glu 


Asp 


Asn 


Met 


Met 


Met 
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210 










215 










220 










Gly 


Arg 


Met 


Ser 


Gly 


Thr 


Pro 


Met 


Asn 


Pro 


Lys 


Asp 


Met 


Thr 


Tyr 


Phe 


225 










230 










235 










240 


Val 


Asn 


Asp 


Phe 


Ser 


Asp 


Asp 


He 


Gly 


Ser 


Thr 


Pro 


Gin 


Cys 


Leu 


Val 










245 










250 










255 




Ser 


Asn 


Ser 


Asp 


He 


Leu 


Asn 


Lys 


Arg 


Glu 


Glu 


Trp 


He 


Ala 


Val 


Trp 








260 










265 










270 






Gly 


Val 


Ala 


Asp 


Ser 


Lys 


Asp 


Leu 


Leu 


Thr 


Lys 


His 


Gin 


Leu 


Gly 


Glu 






275 










280 










285 








Arg 


Glu 


Tyr 


Gly 


Ser 


Glu 


Gly 


Arg 


Arg 


Arg 


Asn 


Pro 


Gly 


Val 


Glu 


Glu 




290 










295 










300 










Glu 


Glu 


Glu 


Glu 


Arg 


Val 


Glu 


Glu 


Glu 


Glu 


Glu 


Val 


Glu 


Val 


Pro 


Tyr 


305 










310 










315 










320 


lie 


Lys 


Lys 


Ser 


Gly 


Lys 


Leu 


He 


Gly 


Pro 


Arg 


Arg 


Arg 


Pro 


Leu 


Thr 










325 










330 










335 




Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Asn 


Pro 


He 


Val 


Arg 








340 










345 










350 






Glu 


Val 


Val 


Glu 


Asp 


Phe 


Asp 


Tyr 


Glu 


Ser 


Phe 


Asn 


Glu 


Pro 


Glu 


He 






355 










360 










365 








Phe 


Gly 


Ser 


Asn 


Ser 


Lys 


Leu 


Pro 


Phe 


He 


Arg 


Phe 


Leu 


Asp 


Gin 


Lys 




370 










375 










380 










Asn 


Trp 


Arg 


Leu 


Gly 


He 


Met 


Ser 


Arg 


Val 


Ser 


Ser 


Ser 


He 


Ala 


Asn 


385 










390 










395 










400 


Phe 


Lys 


lie 


Glu 


Gin 


Glu 


Ser 


Ser 


Lys 


Ala 


Leu 


Phe 


Cys 


Leu 


Ala 


Val 










405 










410 










415 




Trp 


Val 


Gly 


Asp 


Glu 


His 


Thr 


Pro 


Lys 


Phe 


Arg 


Leu 


Ser 


Val 


Trp 


Lys 








420 










425 










430 






Asn 


Trp 


Lys 


Pro 


Phe 


Thr 


Ser 


Ala 


Pro 


He 


He 


Val 


Gin 


Asn 


Val 


Gly 






435 










440 










445 








Tyr 


Ser 


Ser 


Asp 


Val 


Phe 


Trp 


His 


Glu 


Thr 


Leu 


Arg 


Ser 


Lys 


He 


Val 




450 










455 










460 










Asp 


Arg 


Ser 


Arg 


Asp 


Leu 


He 


Glu 


Thr 


Lys 


Val 


Thr 


Lys 


Lys 


He 


Gly 


4 65 










470 










475 










480 


Glu 


Asp 


Trp 


Ala 


Asn 


Lys 


Lys 


Gin 


Thr 


Val 


Val 


Ala 


Met 


Phe 


He 


Ser 










485 










4 90 










495 




Gly 


lie 


Val 


Cys 


He 


Thr 


Val 


Thr 


Val 


He 


Ser 


He 


Phe 


Ser 


He 


Val 








500 










505 










510 






lie 


Tyr 


Tyr 


Lys 


lie 


Lys 


Met 


Pro 


Lys 


Phe 















515 520 



<210> 112 
<211> 627 
<212> DNA 
<213> SHRIMP 



<400> 112 

atgtatatct 

atagatacag 

tactacaact 

agagtggtgg 

gagcaaggaa 

tacatcgcag 

gaagctgtca 

tggatggctg 

agagtagtct 

tataactttg 

aggcatagga 



tcgtcgaagg 
gatcatgtgg 
ggcctgccga 
atggaatgtt 
aagagagtt t 
aatggttgag 
agagtggatt 
aaattattag 
tcaatggcaa 
gaacaaacta 
ttgtacattt 



ttcccccctc 
aatgtctttc 
aatcgggaca 
tgaacctgtc 
gaaggaatat 
acaaaggcca 
cttgaacgag 
agagaagagg 
tgtttgtagt 
taacaatgtt 
cttataa 



acagggaaga 
ctcaattttc 
gaacatctcc 
ctaaagacct 
ctggactaca 
ctagccttcc 
gaggatctag 
ggcaatattc 
gcatgtttct 
gtacattgtg 



gttcatggat 
ttcgtatgaa 
agttaggttt 
ttgtcgactc 
acggccaagt 
acgtgtttac 
atatggatac 
aagaaataaa 
ctaacactaa 
atttgttgtg 



gtccaagttg 
cacttctgac 
cagagaaacc 
gtggaagaaa 
catggagatc 
ctatacagat 
tgcaaccaag 
agtgacccct 
gagaaacttg 
cccttttgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 113 
<211> 208 
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<212> PRT 
<213> SHRIMP 



<400> 113 



Met 


Tyr 


He 


Phe 


Val 


Glu 


Gly 


Ser 


Pro 


Leu 


Thr 


Gly 


Lys 


Ser 


Ser 


Trp 


1 








5 










10 










15 




Met 


Ser 


Lys 


Leu 


He 


Asp 


Thr 


Gly 


Ser 


Cys 


Gly 


Met 


Ser 


Phe 


Leu 


Asn 








20 










25 










30 






Phe 


Leu 


Arg 


Met 


Asn 


Thr 


Ser 


Asp 


Tyr 


Tyr 


Asn 


Trp 


Pro 


Ala 


Glu 


He 






35 










40 










45 








Gly 


Thr 


Glu 


His 


Leu 


Gin 


Leu 


Gly 


Phe 


Arg 


Glu 


Thr 


Arg 


Val 


Val 


Asp 




50 










55 










60 










Gly 


Met 


Phe 


Glu 


Pro 


Val 


Leu 


Lys 


Thr 


Phe 


Val 


Asp 


Ser 


Trp 


Lys 


Lys 


65 










70 










75 










80 


Glu 


Gin 


Gly 


Lys 


Glu 


Ser 


Leu 


Lys 


Glu 


Tyr 


Leu 


Asp 


Tyr 


Asn 


Gly 


Gin 










85 










90 










95 




Val 


Met 


Glu 


He 


Tyr 


He 


Ala 


Glu 


Trp 


Leu 


Arg 


Gin 


Arg 


Pro 


Leu 


Ala 








100 










105 










110 






Phe 


His 


Val 


Phe 


Thr 


Tyr 


Thr 


Asp 


Glu 


Ala 


Val 


Lys 


Ser 


Gly 


Phe 


Leu 






115 










120 










125 








Asn 


Glu 


Glu 


Asp 


Leu 


Asp 


Met 


Asp 


Thr 


Ala 


Thr 


Lys 


Trp 


Met 


Ala 


Glu 




130 










135 










140 










He 


He 


Arg 


Glu 


Lys 


Arg 


Gly 


Asn 


He 


Gin 


Glu 


He 


Lys 


Val 


Thr 


Pro 


145 










150 










155 










160 


Arg 


Val 


Val 


Phe 


Asn 


Gly 


Asn 


val 


Cys 


Ser 


Ala 


Cys 


Phe 


Ser 


Asn 


Thr 










165 










170 










175 




Lys 


Arg 


Asn 


Leu 


Tyr 


Asn 


Phe 


Gly 


Thr 


Asn 


Tyr 


Asn 


Asn 


Val 


Val 


His 








180 










185 










190 






Cys 


Asp 


Leu 


Leu 


Cys 


Pro 


Phe 


Ala 


Arg 


His 


Arg 


He 


Val 


His 


Phe 


Leu 






195 










200 










205 









<210> 114 
<211> 375 
<212> DNA 
<213> SHRIMP 



<400> 114 

atgtggagat cttgtattag caacattaga gaaatggggg acaataaaga ttatgaaaca 60 
agactgattc aaagaattaa tgatttggaa tcagaaattg aaaataaaac tgaattatgt 120 
gaaaaaatca atgagcagat gaaaaataca caactaaaat atgataaatg tttcgtagag 180 
gaggagacgg aaaaattccg caagatggag gaaagagtta aatacctcaa agagcaggga 240 
atccctctag acccagaaga aagacgtaca atgttggctg aaattgacaa gagtaacaaa 300 
gagttagatg cccttcttga ggaaaatgaa cgtataataa agctcattga tgaagagttg 360 
gaaagtatga aataa 375 



<210> 115 
<211> 124 
<212> PRT 
<213> SHRIMP 



<400> 115 




























Met 


Trp 


Arg 


Ser 


Cys 


He 


Ser 


Asn 


He 


Arg 


Glu 


Met 


Gly 


Asp 


Asn 


Lys 


1 








5 










10 










15 




Asp 


Tyr 


Glu 


Thr 


Arg 


Leu 


He 


Gin 


Arg 


He 


Asn 


Asp 


Leu 


Glu 


Ser 


Glu 








20 










25 










30 






He 


Glu 


Asn 


Lys 


Thr 


Glu 


Leu 


Cys 


Glu 


Lys 


He 


Asn 


Glu 


Gin 


Met 


Lys 






35 










40 










45 








Asn 


Thr 


Gin 


Leu 


Lys 


Tyr 


Asp 


Lys 


Cys 


Phe 


Val 


Glu 


Glu 


Glu 


Thr 


Glu 




50 










55 










60 










Lys 


Phe 


Arg 


Lys 


Met 


Glu 


Glu 


Arg 


Val 


Lys 


Tyr 


Leu 


Lys 


Glu 


Gin 


Gly 
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65 70 75 80 

He Pro Leu Asp Pro Glu Glu Arg Arg Thr Met Leu Ala Glu He Asp 

85 90 95 

Lys Ser Asn Lys Glu Leu Asp Ala Leu Leu Glu Glu Asn Glu Arg He 

100 105 110 

He Lys Leu He Asp Glu Glu Leu Glu Ser Met Lys 
115 120 



<210> 116 
<211> 252 
<212> DMA 
<213> SHRIMP 



<400> 116 

atgcaaaaaa aatatgataa attatttgaa gatgataaaa ggttccgaga aatagaggaa 60 
cgaatccttc aacaaaaaga gaagggaaac cctctagacc cagaagaaag acttgtattg 120 
tcggctgata ttgataggag tatgaaagag attgatgatt gtctcgagga aataaaccat 180 
atagaattat ccattgatac attattggat gaatgtgaaa acttgcatta tggtcttcaa 240 
acaactaaat aa 252 

<210> 117 
<211> 83 
<212> PRT 
<213> SHRIMP 



<4Q0> 117 




























Met 


Gin 


Lys 


Lys 


Tyr 


Asp 


Lys 


Leu 


Phe 


Glu 


Asp 


Asp 


Lys 


Arg 


Phe 


Arg 


1 








5 










10 










15 




Glu 


He 


Glu 


Glu 


Arg 


He 


Leu 


Gin 


Gin 


Lys 


Glu 


Lys 


Gly 


Asn 


Pro 


Leu 








20 










25 










30 






Asp 


Pro 


Glu 


Glu 


Arg 


Leu 


Val 


Leu 


Ser 


Ala 


Asp 


He 


Asp 


Arg 


Ser 


Met 






35 










40 










45 








Lys 


Glu 


He 


Asp 


Asp 


Cys 


Leu 


Glu 


Glu 


He 


Asn 


His 


He 


Glu 


Leu 


Ser 




50 










55 










60 










He 


Asp 


Thr 


Leu 


Leu 


Asp 


Glu 


Cys 


Glu 


Asn 


Leu 


His 


Tyr 


Gly 


Leu 


Gin 


65 










70 










75 






80 


Thr 


Thr 


Lys 





























<210> 118 
<211> 2253 
<212> DNA 
<213> SHRIMP 



<400> 118 

atggaaaaaa 

aaaaggtcca 

tacctttctt 

aggtttgagg 

aggttagtag 

tggccagata 

aagaaattga 

gtccaagaaa 

atggatttca 

ctacgcaccc 

gggttcgata 

agtgattggg 

actcttcata 

tcacctgaag 



agactgagac 
gaaataaaga 
ccataatcaa 
agaggtgtaa 
tggacgagtg 
tgacgaagga 
aagaaggatc 
cattcctatc 
ataccatcac 
atttccggac 
aaaagtatac 
tggacagtgg 
ggaaacattt 
agaaatatcc 



ggctgcaaca 
acccaaaaca 
gagtgaaagt 
atccgtaagc 
tctgagacgt 
taacttttac 
tacacatcct 
ccagcaagga 
ctacgccgcc 
acgacaacag 
gaaactcgta 
tgatttggaa 
atgtgttatt 
gataccctca 



acagaaaaag 
acttctactg 
agtagaagta 
aagatgatgg 
tacaaccatc 
gtccaattgt 
gttgtagaag 
gaaggaaata 
aaacaactaa 
agggccatat 
caacactgga 
agggtaaaag 
agtgataaga 
atactaaatt 



acccagaacc 
tttacacttc 
atgtcacctc 
tcaaaggttc 
tagaagacga 
tgaggaaggg 
atgtttggaa 
atcccataaa 
aaacttgctt 
ctggatggtt 
taat tggatg 
aaggaacgaa 
agaatggtac 
attacaagtt 



gtctgtcagt 
tgtaaagtgt 
aaccaaagaa 
actgtttttg 
aatcgataaa 
tttagacaag 
ttcccccatc 
gagacatctc 
cgaaacaaac 
agctgaaaac 
tacctacaag 
aaatttcgtg 
aatttcctat 
tctacaaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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gagtatccac 
atacactatt 
tggaaggata 
tacctattat 
atccagacgg 
actgtttcca 
tacccgccca 
tcctgttcgg 
aaacaatact 
aagaacaata 
gaaattataa 
ttgaaggtag 
aggaaaaatg 
gaattaatcc 
tttgcctgca 
ataaaggaaa 
aaagtatgcc 
aataagaaaa 
gacgaagaaa 
agatgctcgt 
gatatctcta 
tcca ttacat 
attgttcata 
ccgttgaagt 



aaaacaagaa 
gtacgtttga 
tggaagaacg 
tcgacgttaa 
acggcgaagg 
agaaaagtaa 
ctaatgccaa 
tatttgatac 
atcaagaatc 
aagagtacaa 
atgatggtaa 
tgaacgagca 
ccatgaaagt 
ctaaaaggga 
cgggaagagg 
gagtcggagg 
atcgctgcaa 
gaaaaccgcc 
atggaaaatg 
ccgaaaagac 
tggaaacgcc 
gtgtcttggg 
aatttctagg 
atcactttat 



aatacagaaa 
ccaaacgacc 
acacaaacaa 
aaagattaag 
cgtctctgta 
gaaaaataaa 
gtacgtagtg 
ccgtcaaaaa 
ttggatgaca 
^gaggcgtta 
cggtgatact 
ctacaggtta 
atactctaga 
taaaattgaa 
tgagcaatac 
tgataagagg 
tcaaccttta 
gacaatagta 
gaagaaggct 
gcaattcggt 
agtaccttct 
aggaaaattc 
gttttgggat 
tcgagttgcc 



atgatagttg 
attcaaggaa 
tcagaagata 
aaattgcgtc 
ctattttcca 
aaacctagag 
ggtgtagatc 
sicrggtagtga 
gatagaagaa 
gaggaaataa 
tctacaccca 
ctgtggaacg 
aaacaaaagt 
gattaccaca 
gcatcacctg 
tttactttcg 
aatatgctgg 
acaaccacaa 
acacctctca 
tacagttcaa 
tccacttcct 
gtcgacaggg 
aaaaagttaa 
tga 



tcccaaaaca 
tt tgtaaaga 
tactttacaa 
caaactggaa 
gagaagtgga 
gagatgagga 
cgggaagaac 
gaaaacacag 
aggcaaacga 
ctaggtacga 
ctaaaaaatt 
aaaagggaaa 
gcatatctaa 
ttgcttttgg 
ccaggatttt 
tggacgagaa 
agaaggattg 
caacaacaac 
gagaaaatag 
accgaaaagt 
cttccttttg 
acttcaatgc 
tggaaaagaa 



caaactaaag 
tttgggagtg 
gcaaggatgg 
ctttcactct 
agaagtagag 
taggagaaat 
taatgtcgtt 
aatgactgcg 
aacgtacaag 
taatggcgaa 
cgaagcttac 
gaaaaagtac 
ctttatagat 
ggatgcgaaa 
cgccaagaag 
atatacgtca 
tttttcaccg 
aacagaagaa 
agataccaga 
atcgacagga 
tactcctact 
aagcaccaat 
agacaagatg 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2253 



<210> 119 
<211> 746 
<212> PRT 
<213> SHRIMP 



<400> 119 
Met Glu Lys Lys 
1 

Pro Ser Val Ser 
20 

Thr Val Tyr Thr 
35 

Glu Ser Ser Arg 
50 

Arg Cys Lys Ser 
65 

Arg Leu Val Val 

Glu lie Asp Lys 
100 

Leu Leu Arg Lys 
115 

His Pro Val Val 
130 

Phe Leu Ser Gin 
145 

Met Asp Phe Asn 

Phe Glu Thr Asn 
180 

He Ser Gly Trp 
195 

Leu Val Gin His 
210 

Asp Ser Gly Asp 
225 



Thr 


Glu 


Thr 


Ala 


5 








Lys 


Arg 


Ser 


Arg 


Ser 


Val 


Lys 


Cys 








40 


Ser 


Asn 


Val 


Thr 






55 




Val 


Ser 


Lys 


Met 




70 






Asp 


Glu 


Cys 


Leu 


85 








Trp 


Pro 


Asp 


Met 


Gly 


Leu 


Asp 


Lys 








120 


Glu 


Asp 


Val 


Trp 






135 




Gin 


Gly 


Glu 


Gly 




150 






Thr 


He 


Thr 


Tyr 


165 








Leu 


Arg 


Thr 


His 


Leu 


Ala 


Glu 


Asn 








200 


Trp 


He 


He 


Gly 






215 




Leu 


Glu 


Arg 


Val 




230 







Ala Thr Thr Glu 
10 

Asn Lys Glu Pro 
25 

Tyr Leu Ser Ser 

Ser Thr Lys Glu 
60 

Met Val Lys Gly 
75 

Arg Arg Tyr Asn 
90 

Thr Lys Asp Asn 
105 

Lys Lys Leu Lys 

Asn Ser Pro He 
140 

Asn Asn Pro He 
155 

Ala Ala Lys Gin 
170 

Phe Arg Thr Arg 
185 

Gly Phe Asp Lys 

Cys Thr Tyr Lys 
220 

Lys Glu Gly Thr 
235 



Lys Asp Pro Glu 
15 

Lys Thr Thr Ser 
30 

He He Lys Ser 
45 

Arg Phe Glu Glu 

Ser Leu Phe Leu 
80 

His Leu Glu Asp 
95 

Phe Tyr Val Gin 
110 

Glu Gly Ser Thr 
125 

Val Gin Glu Thr 

Lys Arg His Leu 
160 

Leu Lys Thr Cys 
175 

Gin Gin Arg Ala 
190 

Lys Tyr Thr Lys 
205 

Ser Asp Trp Val 

Lys Asn Phe Val 
240 
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Thr Leu His Arg 

Thr lie Ser Tyr 
260 

Asn Tyr Tyr Lys 
275 

Gin Lys Met lie 
290 

Thr Phe Asp Gin 
305 

Trp Lys Asp Met 

Lys Gin Gly Trp 
340 

Arg Pro Asn Trp 

355 

Ser Val Leu Phe 
370 

Lys Ser Lys Lys 
385 

Tyr Pro Pro Thr 

Thr Asn Val Val 
420 

Val Arg Lys His 
435 

Met Thr Asp Arg 
450 

Glu Tyr Lys Glu 
465 

Glu lie lie Asn 

Phe Glu Ala Tyr 
500 

Asn Glu Lys Gly 
515 

Ser Arg Lys Gin 
530 

Lys Arg Asp Lys 
545 

Phe Ala Cys Thr 

Lys Lys lie Lys 

580 

Asp Glu Lys Tyr 
595 

Asn Met Leu Glu 
610 

Pro Thr lie Val 
625 

Glu Asn Gly Lys 

Thr Arg Arg Cys 
660 

Arg Lys Val Ser 
675 

Ser Ser Ser Phe 
690 

Lys Phe Val Asp 
705 

Phe Leu Gly Phe 



Lys His Leu Cys 
245 

Ser Pro Glu Glu 

Phe Leu Gin Thr 
280 

Val Val Pro Lys 
295 

Thr Thr He Gin 
310 

Glu Glu Arg His 

325 

Tyr Leu Leu Phe 

Asn Phe His Ser 
360 

Ser Arg Glu Val 
375 

Asn Lys Lys Pro 
390 

Asn Ala Lys Tyr 
405 

Ser Cys Ser Val 

Arg Met Thr Ala 
440 

Arg Lys Ala Asn 
455 

Ala Leu Glu Glu 
470 

Asp Gly Asn Gly 
485 

Leu Lys Val Val 

Lys Lys Lys Tyr 
520 

Lys Cys He Ser 
535 

He Glu Asp Tyr 
550 

Gly Arg Gly Glu 
565 

Glu Arg Val Gly 

Thr Ser Lys Val 
600 

Lys Asp Cys Phe 
615 

Thr Thr Thr Thr 
630 

Trp Lys Lys Ala 
645 

Ser Ser Glu Lys 

Thr Gly Asp He 
680 

Cys Thr Pro Thr 
695 

Arg Asp Phe Asn 
710 

Trp Asp Lys Lys 



Val He Ser Asp 
250 

Lys Tyr Pro He 
265 

Glu Tyr Pro Gin 

His Lys Leu Lys 
300 

Gly He Cys Lys 
315 

Lys Gin Ser Glu 
330 

Asp Val Lys Lys 
345 

He Gin Thr Asp 

Glu Glu Val Glu 
380 

Arg Gly Asp Glu 

395 

Val Val Gly Val 
410 

Phe Asp Thr Arg 
425 

Lys Gin Tyr Tyr 

Glu Thr Tyr Lys 
460 

He Thr Arg Tyr 
475 

Asp Thr Ser Thr 
490 

Asn Glu His Tyr 
505 

Arg Lys Asn Ala 

Asn Phe He Asp 
540 

His He Ala Phe 
555 

Gin Tyr Asp Ala 
570 

Gly Asp Lys Arg 
585 

Cys His Arg Cys 

Ser Pro Asn Lys 
620 

Thr Thr Thr Thr 
635 

Thr Pro Leu Arg 
650 

Thr Gin Phe Gly 
665 

Ser Met Pro Val 

Ser He Thr Cys 
700 

Ala Ser Thr Asn 
715 

Leu Met Glu Lys 



Lys Lys Asn Gly 
255 

Pro Ser He Leu 
270 

Asn Lys Lys He 
285 

He His Tyr Cys 

Asp Leu Gly Val 
320 

Asp He Leu Tyr 
335 

He Lys Lys Leu 
350 

Gly Glu Gly Val 

365 

Thr Val Ser Lys 

Asp Arg Arg Asn 
400 

Asp Pro Gly Arg 
415 

Gin Lys Arg Val 
430 

Gin Glu Ser Trp 
445 

Lys Asn Asn Lys 

Asp Asn Gly Glu 
480 

Pro Thr Lys Lys 
495 

Arg Leu Leu Trp 
510 

Met Lys Val Tyr 

525 

Glu Leu He Pro 

Gly Asp Ala Lys 
560 

Arg He Phe Ala 
575 

Phe Thr Phe Val 
590 

Asn Gin Pro Leu 
605 

Lys Arg Lys Pro 

Glu Glu Asp Glu 
640 

Glu Asn Arg Asp 
655 

Tyr Ser Ser Asn 
670 

Pro Ser Ser Thr 
685 

Val Leu Gly Gly 

He Val His Lys 
720 

Lys Asp Lys Met 
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725 730 735 

Pro Leu Lys Tyr His Phe He Arg Val Ala 
740 745 



<210> 120 
<211> 411 
<212> DNA 
<213> SHRIMP 



<400> 120 

atggggaata 

aaaaggtctc 

aagcagatcg 

ctcgacaaaa 

gacccagaag 

ggtttcatcg 

gaagaaaagg 



gtgaatctcg 
ataaaactct 
aagagttaca 
acgaagagcg 
aaagacgtgc 
aggaaagtga 
aggttgaaga 



<210> 121 
<211> 134 
<212> PRT 
<213> SHRIMP 



ttctagtggt 
ttacttgtca 
tcacaaaaca 
aaagaagcat 
aatattggct 
acgtctaggt 
ggaacatctt 



attgaaattg 
aatagaactg 
aataagcaat 
cagcaacagc 
gaaatcgata 
ttacttgtag 
ttgaaacaaa 



tacataaaaa 
aaagacatgc 
ttgaacaggc 
aacaaataat 
aacacatgaa 
atgcagaaat 
aagaagacta 



tggtgcacca 60 
ccagatacaa 120 
tcaaaaggtc 180 
aattcctcta 240 
agagattgat 300 
caataacttg 360 
a 411 



<400> 121 



Met 


Gly 


Asn 


Ser 


Glu 


Ser 


Arg 


Ser 


Ser 


Gly 


He 


Glu 


He 


Val 


His 


Lys 


1 








5 










10 










15 


Asn 


Gly 


Ala 


Pro 


Lys 


Arg 


Ser 


His 


Lys 


Thr 


Leu 


Tyr 


Leu 


Ser 


Asn 


Arg 








20 










25 










30 




Thr 


Glu 


Arg 

35 


His 


Ala 


Gin 


He 


Gin 
40 


Lys 


Gin 


He 


Glu 


Glu 
45 


Leu 


His 


His 


Lys 


Thr 


Asn 


Lys 


Gin 


Phe 


Glu 


Gin 


Ala 


Gin 


Lys 


Val 


Leu 


Asp 


Lys 


Asn 




50 










55 










60 






Glu 


Glu 


Arg 


Lys 


Lys 


His 


Gin 


Gin 


Gin 


Gin 


Gin 


He 


He 


He 


Pro 


Leu 


65 










70 










75 










80 


Asp 


Pro 


Glu 


Glu 


Arg 


Arg 


Ala 


He 


Leu 


Ala 


Glu 


He 


Asp 


Lys 


His 


Met 










85 










90 








95 




Lys 


Glu 


He 


Asp 


Gly 


Phe 


He 


Glu 


Glu 


Ser 


Leu 


Gly 


Leu 


Leu 


Val 


Asp 








100 










105 










110 




Ala 


Glu 


He 
115 


Asn 


Asn 


Leu 


Glu 


Glu 
120 


Lys 


Glu 


Val 


Glu 


Glu 
125 


Glu 


His 


Leu 


Leu 


Lys 


Gin 


Lys 


Glu 


Asp 























130 



<210> 122 

<211> 2772 

<212> DNA 

<213> SHRIMP 



<400> 122 

atgtcctcct 

aaggctctgg 

agaaaccaga 

agaaatggag 

agaatagtgg 

tattttgtca 

cagggcgttc 

gctcccctga 

cttcccccca 

aatgaattca 



cctcgtcgtc 
ctcacccaga 
agtgccccat 
gaaatctgtt 
gaagtcttca 
aggatctgta 
tggacaagaa 
tcccaataac 
gatctaaagc 
gtaacaactg 



gtcgt tctcg 
tctggtggac 
ccagt ttctg 
ccctttgcac 
cgggagaaca 
taatggagta 
aatatctgga 
tgatgtggat 
gtaccgagtc 
gcctcctaca 



ttccgcatct 
aagattaccc 
gccgacatct 
ccgttcaaga 
ttggacaatg 
ttttcctatg 
tctggatctg 
ttgctgtaca 
atcactgaag 
aatatcaaag 



ccacctacca 
agaaatgtga 
cgcacctga t 
accaaccaca 
acattgaaga 
tgaacggcgt 
gagaatcctc 
ttttcggtac 
ctgttctagc 
gagcatacg t 



gacctttctc 
cgagacagga 
ccaaggagaa 
tctggaacca 
atcatactgt 
caaggagtta 
ctcttctaga 
tttggtagtt 
actccccttc 
gtctagagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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ttcaggatgt ttaatctgtt ggccggttta 
agtgaatggg aatccataca cgcatctgtc 
aaagctgaga agaaacctcc atcaacatct 
gttaatgatg cagtgacaaa gatccctata 
ctcgccggta ttctccagaa agtgtactct 
tcttcaaatt ctatagacat caagcaagga 
gcaagaaaat tgggattcta tttactccaa 
caagatgttg ccgactgcat agtttcaatc 
gaaaaaatac gacagtgcat tgaggagaaa 
tgtgaaattg acaagacgcc cgatttttct 
ttccccgtcc acgcattctc ctcatcacat 
tccgaatgta tattcccgta catcatttcc 
gtaaaagttg tagatttgtt ccagtgccct 
ggaagatgtt acgagttttc aaatttgtgt 
cattgttctt ctctctttga tgctaccata 
gagtttctcc agtatgattt tgaaactgcg 
ccccatgttt acaataaggt agtaaagaat 
tactgttaca aatgcgtatc tcctgtcgta 
gaaatggtgg atgtaactcc ccctctaatt 
tatgatgatg gtcccggaaa tcatatgtgg 
ggttccagtg gagaaacacc caccattagt 
agagcaccga ggaagaaaat ggcaggatgg 
atagcgctag ccaactggag aaagagtggg 
gaaggggtaa atgccgtact ttatagaggc 
ccctgtgtta ttggtagatc catgagccct 
aaaattgctt taataaaggc cttcttccac 
aagttacttg aatgggcaga actattagtc 
acgccagaat gtgtcataca ccgcgcccat 
gacgaattgg ttcacatgcg tccagatgat 
aatgcggcca gacagaatgc ggctgctgca 
gaatttgtcc cctgcaaaga aaggacgatt 
gttagagttg taaattgtcc ttcatgtaaa 
aatgtgtttt gtgaatgtgg aacaaacatg 
gctgattcta atcattgtgt ggagaaacac 
aaatatgcct tagaaagtat gtacgggttt 
ggagttaaaa attattatgt aatggagaat 
gctaagaaat aa 

<210> 123 
<211> 919 
<212> PRT 
<213> SHRIMP 



gatcatatag aaggagaagt tgggggagaa 660 
gtcaagcgaa tggtcaccat tatgcgcaac 720 
agaattttta gagtgtatgt ggctgaacca 780 
cgtgtactca gtaaattatt cggttcaaga 840 
tattcaatgc taaatcttcc atatctcctg 900 
gtaaaaggaa ttacattatc tataccatct 960 
aaggatacaa cattacaatc ttccctatca 1020 
aacgctggta ttattggtga tgatttctct 1080 
aacaagccag aaaactgttg tatgtgcttt 1140 
tatagtgaac atgtggcaag gcacaatttc 1200 
gatgacaagt gttgtggagc aaagatttgt 1260 
ctgtatgaga aaatgactgg tgtggcaggt 1320 
ggttgtaaaa gtggcatgct caatctaaag 1380 
aagagaatga tactaccata cacatcgact 1440 
aatcgcgcag aggcttgttt ttactcccta 1500 
aggagaattg cgcatggagc taaagacatt 1560 
gtaaaagatt tggatagact gtgtgccttg 1620 
tgtgatgagc cgaatgaaag tacggactat 1680 
aatcttaccg agattgttga ttcggaagag 1740 
ccagcaaaat ttacctgtaa ctttattgcg 1800 
acgtgcagag atgctgtaac ttttctagga 1860 
gatgatcaat cggcagtggg gcaagccatt 1920 
gaattgccca aaaatatgtt tgatttacta 1980 
gacagtttct tgttacgtgc gataaactac 2040 
agtctggaac tcgttaaaag aaaggtgaat 2100 
gagaaaaggg tgcgtccaga cgcatctaaa 2160 
aaaagttatc tcatggaagt tttacttcag 2220 
tcatttgtag gcaaaactct cctcattact 2280 
gccacaagaa acgcctatat ccagaaccta 2340 
gcctcatttt ctgggtcgct ccccaaacct 2400 
gaatggatgt atgaaaagga caatgatgat 2460 
aaggctatcc agaaatatgg aggttgtgtg 2520 
tgctggatat gtgaagagaa ggtttctcct 2580 
aggattgttt atagtaactg tgttagggtt 2640 
gagatttgta ccatgaaaaa tgtagaagaa 2700 
ggatttttct ttgatgtaca agaaatggtt 2760 

2772 



<400> 123 



Met 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Phe 


Ser 


Phe 


Arg 


lie 


Ser 


Thr 


Tyr 


1 








5 










10 










15 


Gin 


Thr 


Phe 


Leu 


Lys 


Ala 


His 


Pro 


Asp 


Leu 


Val 


Asp 


Lys 


He 


Thr 


Gin 








20 










25 










30 






Lys 


Cys 


Asp 


Glu 


Thr 


Gly 


Arg 


Asn 


Gin 


Lys 


Cys 


Pro 


He 


Gin 


Phe 


Leu 






35 










40 










45 








Ala 


Asp 


lie 


Ser 


His 


Leu 


lie 


Gin 


Gly 


Glu 


Arg 


Asn 


Gly 


Gly 


Asn 


Leu 




50 










55 










60 








Phe 


Pro 


Leu 


His 


Pro 


Phe 


Lys 


Asn 


Gin 


Pro 


His 


Leu 


Glu 


Pro 


Arg 


He 


65 










70 










75 








80 


Val 


Gly 


Ser 


Leu 


His 


Gly 


Arg 


Thr 


Leu 


Asp 


Asn 


Asp 


He 


Glu 


Glu 


Ser 










85 










90 










95 




Tyr 


Cys 


Tyr 


Phe 


Val 


Lys 


Asp 


Leu 


Tyr 


Asn 


Gly 


Val 


Phe 


Ser 


Tyr 


Val 








100 










105 










110 






Asn 


Gly 


Val 


Lys 


Glu 


Leu 


Gin 


Gly 


Val 


Leu 


Asp 


Lys 


Lys 


He 


Ser 


Gly 






115 










120 










125 








Ser 


Gly 


Ser 


Gly 


Glu 


Ser 


Ser 


Ser 


Ser 


Arg 


Ala 


Pro 


Leu 


He 


Pro 


He 
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130 










135 








Thr 


Asp 


Val 


Asp 


Leu 


Leu 


Tyr 


lie 


Phe 


Gly 


145 










150 










Pro 


Arg 


Ser 


Lys 


Ala 


Tyr 


Arg 


Val 


He 


Thr 










165 










170 


Pro 


Phe 


Asn 


Glu 


Phe 


Ser 


Asn 


Asn 


Trp 


Pro 








180 










185 




Ala 


Tyr 


Val 


Ser 


Arg 


Asp 


Phe 


Arg 


Met 


Phe 






195 










200 






Asp 


His 


lie 


Glu 


Gly 


Glu 


Val 


Gly 


Gly 


Glu 




210 










215 








His 


Ala 


Ser 


Val 


Val 


Lys 


Arg 


Met 


Val 


Thr 


225 










230 










Glu 


Lys 


Lys 


Pro 


Pro 


Ser 


Thr 


Ser 


Arg 


He 










245 










250 


Glu 


Pro 


Val 


Asn 


Asp 


Ala 


Val 


Thr 


Lys 


He 








260 










265 




Lys 


Leu 


Phe 


Gly 


Ser 


Arg 


Leu 


Ala 


Gly 


He 






275 










280 






Tyr 


Ser 


Met 


Leu 


Asn 


Leu 


Pro 


Tyr 


Leu 


Leu 




290 










2 95 








lie 


Lys 


Gin 


Gly 


Val 


Lys 


Gly 


lie 


Thr 


Leu 


305 










310 










Lys 


Leu 


Gly 


Phe 


Tyr 


Leu 


Leu 


Gin 


Lys 


Asp 










325 










330 


Leu 


Ser 


Gin 


Asp 


Val 


Ala 


Asp 


Cys 


He 


Val 








340 










345 




lie 


Gly 


Asp 


Asp 


Phe 


Ser 


Glu 


Lys 


He 


Arg 






355 










360 






Asn 


Lys 


Pro 


Glu 


Asn 


Cys 


Cys 


Met 


Cys 


Phe 




370 










375 








Pro 


Asp 


Phe 


Ser 


Tyr 


Ser 


Glu 


His 


Val 


Ala 


385 










390 










Val 


His 


Ala 


Phe 


Ser 


Ser 


Ser 


His 


Asp 


Asp 










405 










410 


lie 


Cys 


Ser 


Glu 


Cys 


lie 


Phe 


Pro 


Tyr 


He 








420 










425 




Met 


Thr 


Gly 


Val 


Ala 


Gly 


Val 


Lys 


Val 


Val 






435 










440 






Gly 


Cys 


Lys 


Ser 


Gly 


Met 


Leu 


Asn 


Leu 


Lys 




4 50 










455 








Ser 


Asn 


Leu 


Cys 


Lys 


Arg 


Met 


lie 


Leu 


Pro 


4 65 










470 










Ser 


Ser 


Leu 


Phe 


Asp 


Ala 


Thr 


lie 


Asn 


Arg 










485 










490 


Ser 


Leu 


Glu 


Phe 


Leu 


Gin 


Tyr 


Asp 


Phe 


Glu 








500 










505 




His 


Gly 


Ala 


Lys 


Asp 


lie 


Pro 


His 


Val 


Tyr 






515 










520 






Val 


Lys 


Asp 


Leu 


Asp 


Arg 


Leu 


Cys 


Ala 


Leu 




530 










535 








Ser 


Pro 


Val 


Val 


Cys 


Asp 


Glu 


Pro 


Asn 


Glu 


545 










550 










Val 


Asp 


Val 


Thr 


Pro 


Pro 


Leu 


He 


Asn 


Leu 










565 










570 


Glu 


Glu 


Tyr 


Asp 


Asp 


Gly 


Pro 


Gly 


Asn 


His 








580 










585 




Thr 


Cys 


Asn 


Phe 


lie 


Ala 


Gly 


Ser 


Ser 


Gly 






595 










600 






Thr 


Cys 


Arg 


Asp 


Ala 


Val 


Thr 


Phe 


Leu 


Gly 




610 










615 
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140 



Thr 


Leu 


Val 


Val 


Leu 


Pro 


155 










160 


Glu 


Ala 


Val 


Leu 


Ala 


Leu 










175 




Pro 


Thr 


Asn 


He 


Lys 


Gly 








190 






Asn 


Leu 


Leu 


Ala 


Gly 


Leu 






205 








Ser 


Glu 


Trp 


Glu 


Ser 


He 




220 










He 


Met 


Arg 


Asn 


Lys 


Ala 


235 










240 


Phe 


Arg 


Val 


Tyr 


Val 


Ala 










255 




Pro 


He 


Arg 


Val 


Leu 


Ser 








270 






Leu 


Gin 


Lys 


Val 


Tyr 


Ser 






285 








Ser 


Ser 


Asn 


Ser 


He 


Asp 




300 










Ser 


He 


Pro 


Ser 


Ala 


Arg 


315 










320 


Thr 


Thr 


Leu 


Gin 


Ser 


Ser 










335 




Ser 


He 


Asn 


Ala 


Gly 


He 








350 






Gin 


Cys 


He 


Glu 


Glu 


Lys 






365 








Cys 


Glu 


He 


Asp 


Lys 


Thr 




380 










Arg 


His 


Asn 


Phe 


Phe 


Pro 


395 










400 


Lys 


Cys 


Cys 


Gly 


Ala 


Lys 










415 




He 


Ser 


Leu 


Tyr 


Glu 


Lys 








430 






Asp 


Leu 


Phe 


Gin 


Cys 


Pro 






445 








Gly 


Arg 


Cys 


Tyr 


Glu 


Phe 




460 










Tyr 


Thr 


Ser 


Thr 


His 


Cys 


475 










480 


Ala 


Glu 


Ala 


Cys 


Phe 


Tyr 










495 




Thr 


Ala 


Arg 


Arg 


He 


Ala 








510 






Asn 


Lys 


Val 


Val 


Lys 


Asn 






525 








Tyr 


Cys 


Tyr 


Lys 


Cys 


Val 




540 










Ser 


Thr 


Asp 


Tyr 


Glu 


Met 


555 










560 


Thr 


Glu 


He 


Val 


Asp 


Ser 










575 




Met 


Trp 


Pro 


Ala 


Lys 


Phe 








590 






Glu 


Thr 


Pro 


Thr 


He 


Ser 






605 








Arg 


Ala 


Pro 


Arg 


Lys 


Lys 



620 
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Met 


Ala 


Gly 


Trp 


Asp 


Asp 


Gin 


Ser 


Ala 


Val 


Gly 


Gin 


Ala 


He 


He 


Ala 


625 










630 










635 










640 


Asn 


Trp 


Arg 


Lys 


Ser 


Gly 


Glu 


Leu 


Pro 


Lys 


Asn 


Met 


Phe 


Asp 


Leu 


Leu 










645 










650 










655 




Glu 


Gly 


Val 


Asn 


Ala 


Val 


Leu 


Tyr 


Arg 


Gly 


Asp 


Ser 


Phe 


Leu 


Leu 


Arg 








660 










665 










670 






Ala 


He 


Asn 


Tyr 


Pro 


Cys 


Val 


He 


Gly 


Arg 


Ser 


Met 


Ser 


Pro 


Ser 


Leu 






675 










680 










685 








Glu 


Leu 


Val 


Lys 


Arg 


Lys 


Val 


Asn 


Lys 


He 


Ala 


Leu 


He 


Lys 


Ala 


Phe 




690 










695 










700 










Phe 


His 


Glu 


Lys 


Arg 


Val 


Arg 


Pro 


Asp 


Ala 


Ser 


Lys 


Lys 


Leu 


Leu 


Glu 


705 










710 










715 










720 


Trp 


Ala 


Glu 


Leu 


Leu 


Val 


Lys 


Ser 


Tyr 


Leu 


Met 


Glu 


Val 


Leu 


Leu 


Gin 










725 










730 










735 




Thr 


Pro 


Glu 


Cys 


Val 


He 


His 


Arg 


Ala 


His 


Ser 


Phe 


Val 


Gly 


Lys 


Thr 








740 










745 










750 






Leu 


Leu 


He 


Thr 


Asp 


Glu 


Leu 


Val 


His 


Met 


Arg 


Pro 


Asp 


Asp 


Ala 


Thr 






755 










760 










765 








Arg 


Asn 


Ala 


Tyr 


He 


Gin 


Asn 


Leu 


Asn 


Ala 


Ala 


Arg 


Gin 


Asn 


Ala 


Ala 




770 










775 










780 










Ala 


Ala 


Ala 


Ser 


Phe 


Ser 


Gly 


Ser 


Leu 


Pro 


Lys 


Pro 


Glu 


Phe 


Val 


Pro 


785 










790 










795 










800 


Cys 


Lys 


Glu 


Arg 


Thr 


He 


Glu 


Trp 


Met 


Tyr 


Glu 


Lys 


Asp 


Asn 


Asp 


Asp 










805 










810 










815 




Val 


Arg 


Val 


Val 


Asn 


Cys 


Pro 


Ser 


Cys 


Lys 


Lys 


Ala 


He 


Gin 


Lys 


Tyr 








820 










825 










830 






Gly 


Gly 


Cys 


Val 


Asn 


Val 


Phe 


Cys 


Glu 


Cys 


Gly 


Thr 


Asn 


Met 


Cys 


Trp 






835 










840 










845 








He 


Cys 


Glu 


Glu 


Lys 


Val 


Ser 


Pro 


Ala 


Asp 


Ser 


Asn 


His 


Cys 


Val 


Glu 




850 










855 










860 










Lys 


His 


Arg 


He 


Val 


Tyr 


Ser 


Asn 


Cys 


Val 


Arg 


Val 


Lys 


Tyr 


Ala 


Leu 


865 










870 










875 










880 


Glu 


Ser 


Met 


Tyr 


Gly 


Phe 


Glu 


He 


Cys 


Thr 


Met 


Lys 


Asn 


Val 


Glu 


Glu 










885 










890 










895 




Gly 


Val 


Lys 


Asn 


Tyr 


Tyr 


Val 


Met 


Glu 


Asn 


Gly 


Phe 


Phe 


Phe 


Asp 


Val 








900 










905 










910 




Gin 


Glu 


Met 


Val 


Ala 


Lys 


Lys 





















915 



<210> 124 
<2H> 1536 
<212> DNA 
<213> SHRIMP 



<400> 124 

atggctgaaa 

gattcggaca 

gtccacctgg 

ctgttatgtc 

gaagaggtgt 

gatgaaggtg 

gtccccaatg 

aagttgagtt 

acctttgaag 

gaatatgaag 

ttgagtaatg 

ggtgtgacat 

gaggtagatg 

aacaatatta 

gatgatgatg 



ccgtcgccgt 
cggattgcta 
agtgtctttt 
ccatgtgtag 
acaagtatgc 
taaaggatac 
ccaaaggtgt 
tcatgcttga 
ataatcatga 
gtgaatggct 
atgatgatga 
ccttcaattt 
ataatgatga 
cctttgatac 
atgatgacac 



tgatgaggtg 
taactggtca 
cacctggaga 
ggcctatata 
ctcttttcac 
ccttaataag 
taatgagaat 
aactctatct 
ggcagcttta 
taagtatgta 
tgatgaaggt 
cattgaagat 
ctcagagtca 
tattagtggt 
aatgtctttg 



cctacctgtc 
aatggaggga 
tttgaagagc 
ccccctgtgt 
tcatttttgt 
atgtcaacta 
gaggatgttt 
agacaggaaa 
atgggtaaat 
gctcccaata 
gataataatg 
gatgaggata 
cttcctgact 
attagtagtt 
cctgacttga 



ccatctgtat 
tgccgtgttg 
atatggtgaa 
ggttcttccg 
tgtctgctga 
ttctagcacc 
atatggagag 
tgcatgcatt 
tcaaggatat 
ctattgacat 
tctcccctag 
ctgtagtgtt 
tgactgttcc 
cactttatga 
atatgcctag 



gggtgattac 
caggaaaagt 
tgaaaatcac 
taaagtgtat 
ctatgttaa t 
tactttcttt 
ggcttatacc 
cagtgaagag 
CGccccttat 
tactcaatgc 
tttgttgtct 
tg tcccccca 
ccctagaagt 
tgt taatgat 
tgcttctacg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 
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tcatccgccc 
tgttttaatg 
aatcagccct 
aggtatgggc 
gtagatgtca 
gccaataggt 
aagcttggag 
ggtaagagtt 
gctaataagc 
gatgacgaaa 
gaagctgatc 



ctacctcatc 
ttgattcaga 
ctacttcctc 
gtgacgagga 
agaggtatat 
atctggctga 
attgtatatc 
tgtgctctaa 
agctaattgg 
ttgcacatgt 
aaatatcttc 



cgcccctacc 
ctctgatgac 
aggaagtagt 
caggatgtct 
gtgtagactc 
actttctgct 
tagaggtaac 
gaaattgaag 
ttcctgtctc 
acacagttca 
cctgcccaat 



tcaacctctc 
gaagaagtaa 
agtagtagta 
aatatttcct 
gataatattg 
cttagagaaa 
ctgttccata 
gtgaagcgta 
attaagtcag 
ttgttgaatg 
ctataa 



ttaacattaa 
taccttcttc 
gtaacagtag 
ctgagagtaa 
atgaggagta 
ggagacagga 
caactgtcaa 
agtacgcttc 
cctctaataa 
ggtttgatac 



tgttaacctt 
gtcatcagtg 
aaagaggcca 
gagactgtgt 
taatgagatt 
aactgagaat 
tgatgtaatt 
aaagtggagt 
tgctaggttg 
tgacccttcg 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1536 



<210> 125 
<211> 509 
<212> PRT 
<213> SHRIMP 



<400> 125 
Met Ala Glu Thr 
1 

Met Gly Asp Tyr 
20 

Gly Met Pro Cys 

35 

Trp Arg Phe Glu 
50 

Met Cys Arg Ala 
65 

Glu Glu Val Tyr 

Asp Tyr Val Asn 
100 

Thr lie Leu Ala 
115 

Glu Asn Glu Asp 
130 

Met Leu Glu Thr 
145 

Thr Phe Glu Asp 

lie Pro Pro Tyr 
180 

Asn Thr lie Asp 
195 

Glu Gly Asp Asn 
210 

Phe Asn Phe lie 
225 

Glu Val Asp Asp 

Pro Pro Arg Ser 
260 

Ser Ser Leu Tyr 
275 

Ser Leu Pro Asp 
290 

Thr Ser Ser Ala 
305 

Cys Phe Asn Val 
Ser Ser Ser Val 



Val Ala Val Asp 

5 

Asp Ser Asp Thr 

Cys Arg Lys Ser 
40 

Glu His Met Val 
55 

Tyr He Pro Pro 
70 

Lys Tyr Ala Ser 
85 

Asp Glu Gly Val 

Pro Thr Phe Phe 
120 

Val Tyr Met Glu 
135 

Leu Ser Arg Gin 
150 

Asn His Glu Ala 
165 

Glu Tyr Glu Gly 

He Thr Gin Cys 
200 

Asn Val Ser Pro 
215 

Glu Asp Asp Glu 
230 

Asn Asp Asp Ser 
245 

Asn Asn He Thr 

Asp Val Asn Asp 
280 

Leu Asn Met Pro 
295 

Pro Thr Ser Thr 
310 

Asp Ser Asp Ser 
325 

Asn Gin Pro Ser 



Glu Val Pro Thr 
10 

Asp Cys Tyr Asn 
25 

Val His Leu Glu 

Asn Glu Asn His 
60 

Val Trp Phe Phe 
75 

Phe His Ser Phe 
90 

Lys Asp Thr Leu 
105 

Val Pro Asn Ala 

Arg Ala Tyr Thr 
140 

Glu Met His Ala 
155 

Ala Leu Met Gly 
170 

Glu Trp Leu Lys 
185 

Leu Ser. Asn Asp 

Ser Leu Leu Ser 
220 

Asp Thr Val Val 
235 

Glu Ser Leu Pro 
250 

Phe Asp Thr He 
265 

Asp Asp Asp Asp 

Ser Ala Ser Thr 
300 

Ser Leu Asn He 
315 

Asp Asp Glu Glu 
330 

Thr Ser Ser Gly 



Cys Pro He Cys 
15 

Trp Ser Asn Gly 
30 

Cys Leu Phe Thr 
45 

Leu Leu Cys Pro 

Arg Lys Val Tyr 
80 

Leu Leu Ser Ala 
95 

Asn Lys Met Ser 
110 

Lys Gly Val Asn 
125 

Lys Leu Ser Phe 

Phe Ser Glu Glu 
160 

Lys Phe Lys Asp 
175 

Tyr Val Ala Pro 
190 

Asp Asp Asp Asp 
205 

Gly Val Thr Ser 

Phe Val Pro Pro 
240 

Asp Leu Thr Val 
255 

Ser Gly He Ser 
270 

Asp Asp Thr Met 
285 

Ser Ser Ala Pro 

Asn Val Asn Leu 
320 

Val He Pro Ser 
335 

Ser Ser Ser Ser 
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340 










345 










350 






Ser 


Ser 


Asn 


Ser 


Arg 


Lys 


Arg 


Pro 


Arg 


Tyr 


Gly 


Arg 


Asp 


Glu 


Asp 


Arg 






355 










360 










365 








Met 


Ser 


Asn 


lie 


Ser 


Ser 


Glu 


Ser 


Lys 


Arg 


Leu 


Cys 


Val 


Asp 


Val 


Lys 




370 










375 










380 










Arg 


Tyr 


Met 


Cys 


Arg 


Leu 


Asp 


Asn 


He 


Asp 


Glu 


Glu 


Tyr 


Asn 


Glu 


He 


385 










390 










395 










400 


Ala 


Asn 


Arg 


Tyr 


Leu 


Ala 


Glu 


Leu 


Ser 


Ala 


Leu 


Arg 


Glu 


Arg 


Arg 


Gin 










405 










410 










415 




Glu 


Thr 


Glu 


Asn 


Lys 


Leu 


Gly 


Asp 


Cys 


He 


Ser 


Arg 


Gly 


Asn 


Leu 


Phe 








420 










425 










430 






His 


Thr 


Thr 


Val 


Asn 


Asp 


Val 


He 


Gly 


Lys 


Ser 


Leu 


Cys 


Ser 


Lys 


Lys 






435 










440 










445 








Leu 


Lys 


Val 


Lys 


Arg 


Lys 


Tyr 


Ala 


Ser 


Lys 


Trp 


Ser 


Ala 


Asn 


Lys 


Gin 




450 










455 










460 










Leu 


lie 


Gly 


Ser 


Cys 


Leu 


He 


Lys 


Ser 


Asn 


Asn 


Ala 


Arg 


Leu 


Asp 


Asp 


465 










470 










475 










480 


Glu 


lie 


Ala 


His 


Val 


His 


Ser 


Ser 


Leu 


Leu 


Asn 


Gly 


Phe 


Asp 


Thr 


Asp 










485 










4 90 










495 




Pro 


Ser 


Glu 


Ala 


Asp 


Gin 


He 


Ser 


Ser 


Leu 


Pro 


Asn 


Leu 









500 505 



<210> 126 

<211> 1746 

<212> DNA 

<213> SHRIMP 



<400> 126 

atggggggac 

cagtatgttt 

tacgcgtctg 

aacatcacta 

aaacgtttca 

gtgtgtgcgt 

gattcccctt 

acctgtcaac 

gatacctgtt 

gatgaagaga 

attccgtcct 

tcaagtgcct 

cagtgtgaga 

gaagtgcctg 

gagattaaat 

gatatgaact 

aatattaagg 

ccatttgcag 

ggggagtatt 

atgggacgtt 

attgcccgaa 

ttggctgaga 

ataaaggctg 

ggactctcat 

ctaatcaggc 

tattcaatga 

tgttctaaca 

ttgcaaagcg 

cttgccatga 

aaatga 



ccactgtaat 
accatcaggg 
gttcaacctc 
tcaagacttg 
agaagtctga 
ccaaggtgat 
actactttaa 
agttgagaat 
tctatggaaa 
tgagattttt 
cccagattat 
gtgagcgaat 
actgggaaaa 
aagatgtgaa 
atggggtcta 
atgaagagtt 
gggtagttgg 
gta tcagtat 
atgagtcaag 
taccagaggt 
^ggttgtaag 
cttgtaatgt 
ataactttgt 
accctctagg 
agtacattca 
cgtacagttt 
tggaaggccc 
gaacagacac 
agcacatatt 



tactactacc 
gaataaaaaa 
cgaagccaca 
ggatgacgtc 
agttgctccc 
taggcagtgt 
aggcattgag 
tggccatagg 
ggaccctaaa 
tgacactaga 
ggacataatg 
tagacaaaag 
cttccgtaag 
ggactttaac 
ttacattgtg 
tgtttttgag 
ggtgactggt 
tgatgatgtc 
ggacgcacgc 
aaagggggtt 
aagtatgttt 
aattacaaga 
tatagatagc 
ttattgttac 
cacacctccc 
cagtgtaatg 
tgcctttaat 
tgatttctat 
cccttttaag 



atcaatactg 
cggcctgtgg 
actgttcccg 
atcaacctta 
tctcctccca 
aagaggcagt 
aagagttgta 
tcaattgtta 
gttggcttct 
cacattctta 
tatatggctg 
actagcacgc 
actgctcgtg 
actttcatct 
aataggggga 
tgtgttaatg 
gtgtgtcctc 
attaggtgtg 
ttgttgtatg 
gatacagtcg 
gaaaaactaa 
gttgcaaatg 
atatctggcc 
aacgatgaat 
gagttcttta 
gcttccagta 
ttgatgtcaa 
caaaatcgcc 
gggactgtaa 



gtggagacca 
aagaatataa 
cttacaacaa 
gcatcacgcc 
ctactcgcac 
ataatgagtg 
gtcttgagga 
agtctagcaa 
attggcccac 

^gg^gttgtc 

tagaggtgtt 
taattaagga 
gttgtttgtc 
gtccctggga 
ctgttgtcaa 
gcctttctgt 
aggggttatg 
tcaaggatag 
gggttgtcat 
caccaataac 
aggtgaacat 
agggaattat 
aacctaaaat 
attttaggaa 
ggggacactg 
tactggaaga 
acatgcactt 
cttcaatcac 
tgaacctgtt 



ccaccaccag 
caacaacaac 
caacaacaac 
ccctccccct 
cttttcaaac 
gattgaacgt 
caattatgat 
gtatgtccat 
ctcttcttgc 
tagtcgtaat 
ccaattgcct 
agtttctgac 
tgatttggtc 
gacctttttt 
gtttatgaag 
atacagaaag 
tttagagatg 
tttagatggt 
gcttcaaagg 
agactctttc 
gccttttgtt 
taatgtcgat 
gattgacttg 
cacggaagaa 
tctaggtgcc 
tgttgttgct 
tttgatgttg 
agaatatgcc 
taaagtaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1746 



<210> 127 
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<211> 575 
<212> PRT 
<213> SHRIMP 



<400> 127 



Met 


Gly 


Gly 


Pro 


Thr 


Val 


He 


Thr 


Thr 


Thr 


He 


Asn 


Thr 


Gly 


Gly 


Asp 


1 








5 










10 










15 




His 


His 


His 


Gin 


Gin 


Tyr 


Val 


Tyr 


His 


Gin 


Gly 


Asn 


Lys 


Lys 


Arg 


Pro 








20 










25 










30 






Val 


Glu 


Glu 


Tyr 


Asn 


Asn 


Asn 


Asn 


Tyr 


Ala 


Ser 


Gly 


Ser 


Thr 


Ser 


Glu 






35 










40 










45 








Ala 


Thr 


Thr 


Val 


Pro 


Ala 


Tyr 


Asn 


Asn 


Asn 


Asn 


Asn 


Asn 


He 


Thr 


He 




50 










55 










60 










Lys 


Thr 


Trp 


Asp 


Asp 


Val 


He 


Asn 


Leu 


Ser 


He 


Thr 


Pro 


Pro 


Pro 


Pro 


65 










70 










75 










80 


Lys 


Arg 


Phe 


Lys 


Lys 


Ser 


Glu 


Val 


Ala 


Pro 


Ser 


Pro 


Pro 


Thr 


Thr 


Arg 










85 










90 










95 




Thr 


Phe 


Ser 


Asn 


Val 


Cys 


Ala 


Ser 


Lys 


Val 


He 


Arg 


Gin 


Cys 


Lys 


Arg 








100 










105 










110 






Gin 


Tyr 


Asn 


Glu 


Trp 


He 


Glu 


Arg 


Asp 


Ser 


Pro 


Tyr 


Tyr 


Phe 


Lys 


Gly 






115 










120 










125 








He 


Glu 


Lys 


Ser 


Cys 


Ser 


Leu 


Glu 


Asp 


Asn 


Tyr 


Asp 


Thr 


Cys 


Gin 


Gin 




130 










135 










140 










Leu 


Arg 


He 


Gly 


His 


Arg 


Ser 


He 


Val 


Lys 


Ser 


Ser 


Lys 


Tyr 


Val 


His 


145 










150 










155 










160 


Asp 


Thr 


Cys 


Phe 


Tyr 


Gly 


Lys 


Asp 


Pro 


Lys 


Val 


Gly 


Phe 


Tyr 


Trp 


Pro 










165 










170 










175 




Thr 


Ser 


Ser 


Cys 


Asp 


Glu 


Glu 


Met 


Arg 


Phe 


Phe 


Asp 


Thr 


Arg 


His 


He 








180 










185 










190 






Leu 


Lys 


Glu 


Leu 


Ser 


Ser 


Arg 


Asn 


He 


Pro 


Ser 


Ser 


Gin 


He 


Met 


Asp 






195 










200 










205 








He 


Met 


Tyr 


Met 


Ala 


Val 


Glu 


Val 


Phe 


Gin 


Leu 


Pro 


Ser 


Ser 


Ala 


Cys 




210 










215 










220 








Glu 


Arg 


He 


Arg 


Gin 


Lys 


Thr 


Ser 


Thr 


Leu 


He 


Lys 


Glu 


Val 


Ser 


Asp 


225 










230 










235 










240 


Gin 


Cys 


Glu 


Asn 


Trp 


Glu 


Asn 


Phe 


Arg 


Lys 


Thr 


Arg 


Cys 


Leu 


Ser 


Asp 










245 










250 










255 




Leu 


Val 


Glu 


Val 


Pro 


Glu 


Asp 


Val 


Lys 


Asp 


Phe 


Asn 


Thr 


Phe 


He 


Cys 








260 










265 










270 






Pro 


Trp 


Glu 


Thr 


Phe 


Phe 


Glu 


He 


Lys 


Tyr 


Gly 


Val 


Tyr 


Tyr 


He 


Val 






275 










280 










285 








Asn 


Arg 


Gly 


Thr 


Val 


Val 


Lys 


Phe 


Met 


Lys 


Asp 


Met 


Asn 


Tyr 


Glu 


Glu 




290 










295 










300 










Phe 


Val 


Phe 


Glu 


Cys 


Val 


Asn 


Gly 


Leu 


Ser 


Val 


Tyr 


Arg 


Lys 


Asn 


He 


305 










310 










315 










320 


Lys 


Gly 


Val 


Val 


Gly 


Val 


Thr 


Gly 


Val 


Cys 


Pro 


Gin 


Gly 


Leu 


Cys 


Leu 










325 










330 










335 




Glu 


Met 


Pro 


Phe 


Ala 


Gly 


He 


Ser 


He 


Asp 


Asp 


Val 


He 


Arg 


Cys 


Val 








340 










345 










350 






Lys 


Asp 


Ser 


Leu 


Asp 


Gly 


Gly 


Glu 


Tyr 


Tyr 


Glu 


Ser 


Arg 


Asp 


Ala 


Arg 






355 










360 










365 








Leu 


Leu 


Tyr 


Gly 


Val 


Val 


Met 


Leu 


Gin 


Arg 


Met 


Gly 


Arg 


Leu 


Pro 


Glu 




370 










37 5 










380 










Val 


Lys 


Gly 


Val 


Asp 


Thr 


Val 


Ala 


Pro 


He 


Thr 


Asp 


Ser 


Phe 


He 


Ala 


385 










390 










395 










400 


Arg 


Lys 


Val 


Val 


Arg 


Ser 


Met 


Phe 


Glu 


Lys 


Leu 


Lys^ Val 


Asn 


Met 


Pro 










405 










410 










415 




Phe 


Val 


Leu 


Ala 


Glu 


Thr 


Cys 


Asn 


Val 


He 


Thr 


Arg 


Val 


Ala 


Asn 


Glu 








420 










425 










430 






Gly 


He 


He 


Asn 


Val 


Asp 


He 


Lys 


Ala 


Asp 


Asn 


Phe 


Val 


He 


Asp 


Ser 






435 










440 










445 
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lie 


Ser 


Gly 


Gin 


Pro 


Lys 


Met 


lie 


Asp 


Leu 


Gly 


Leu 


Ser 


Tyr 


Pro 


Leu 




450 










455 










460 










Gly 


Tyr 


Cys 


Tyr 


Asn 


Asp 


Glu 


Tyr 


Phe 


Arg 


Asn 


Thr 


Glu 


Glu 


Leu 


He 


465 










470 










475 










480 


Arg 


Gin 


Tyr 


lie 


His 


Thr 


Pro 


Pro 


Glu 


Phe 


Phe 


Arg 


Gly 


His 


Cys 


Leu 










485 










490 










495 




Gly 


Ala 


Tyr 


Ser 


Met 


Thr 


Tyr 


Ser 


Phe 


Ser 


Val 


Met 


Ala 


Ser 


Ser 


He 








500 










505 










510 






Asp 


Val 


Val 


Ala 


Cys 


Ser 


Asn 


Met 


Glu 


Gly 


Pro 


Ala 


Phe 


Asn 


Leu 


Met 






515 










520 










525 








Ser 


Asn 


Met 


His 


Phe 


Leu 


Met 


Leu 


Leu 


Gin 


Ser 


Gly 


Thr 


Asp 


Thr 


Asp 




530 










535 










540 










Phe 


Tyr 


Gin 


Asn 


Arg 


Pro 


Ser 


He 


Thr 


Glu 


Tyr 


Ala 


Met 


Lys 


His 


He 


545 










550 










555 










560 


Phe 


Pro 


Phe 


Lys 


Gly 


Thr 


Val 


Met 


Asn 


Leu 


Phe 


Lys 


Val 


Lys 


Lys 





565 570 575 



<210> 128 
<211> 1200 
<212> DNA 
<213> SHRIMP 



<400> 128 

atgcttagca 

attcactttt 

gaagaattaa 

ccaatcactg 

aagagtagtc 

aaaggaggac 

ttaattgaaa 

gcgaattgcc 

gatgaaaagt 

acggctggtg 

tctgatacta 

tctgatatta 

aaaatgagac 

aaagaaggag 

tcgggggaag 

tctctagtga 

tcacctatgc 

ccttttgccc 

cctggctcag 

ggggcaagag 



cgtgtgatct 
ctgaatcaat 
acaaggattt 
aaacgttgaa 
tcctgactct 
aggaaatggc 
ctgcaaagga 
gggatagtaa 
atgttaagca 
caaataaacc 
attatacagc 
catatgaagt 
tagcaacagg 
gaagagtaga 
aaatgcagat 
cagagggtgt 
cagctagtcc 
cttcagctga 
aatgggttca 
ggtctggtat 



aaagcaccct 
tccagctaat 
actggatagt 
aaattgtata 
acaagaaaaa 
actgtgtgcg 
ggcctcaatg 
gtgttcggcc 
aaactcttca 
acccaaagag 
catgagtgat 
aacaaagcaa 
cgtggaaatg 
agtagaggta 
ggaaattatg 
ggatgattat 
acttccttct 
tattgttgtt 
ccagagagac 
aaatactgga 



tcttcaacag 
gatataattt 
gtaagaaatc 
actactcaaa 
tttaatgaat 
agccttaaaa 
gataataaaa 
agtttagtta 
tcagcctcgg 
tttataacaa 
aaccttattt 
gctaaagaac 
gtaatgaagg 
gaaggagtag 
ttgcctacac 
cctgtgtttt 
aatgggaata 
gataaaacat 
aggaatagta 
agatatagaa 



acggaaatgt 
ctttcccttc 
agatcaaatt 
cacttttaca 
ggggatcaat 
tcatgggcca 
agaagaataa 
cattatttaa 
ctcttctggc 
aagataatgc 
gtccaggcaa 
gcataaaaaa 
aactagaagc 
aacaacagca 
ctcctccccc 
caccactccc 
gtgcactgga 
cagaaattat 
aaatggagat 
gaaataacac 



acttaaaaat 
ctctgacacg 
cgggttcgac 
ctcttttctc 
tcaactagag 
aatatcagct 
taatgcatgt 
taaaactatc 
aaatactttc 
acatggcaat 
atactactca 
taacaataag 
agaaaataat 
accgtccacg 
ggatctagaa 
ttcattattg 
agatgggggg 
gggcagaaca 
acgaaactat 
tgttctatag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



<210> 129 
<211> 395 
<212> PRT 
<213> SHRIMP 



<400> 129 

Met Leu Ser Thr Cys Asp Leu Lys His Pro Ser Ser Thr Asp Gly Asn 

15 10 15 

Val Leu Lys Asn He His Phe Ser Glu Ser He Pro Ala Asn Asp He 

20 25 30 

He Ser Phe Pro Ser Ser Asp Thr Glu Glu Leu Asn Lys Asp Leu Leu 

35 40 45 

Asp Ser Val Arg Asn Gin He Lys Phe Gly Phe Asp Pro He Thr Glu 

50 55 60 

Thr Leu Lys Asn Cys He Thr Thr Gin Thr Leu Leu His Ser Phe Leu 
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65 










70 










75 










80 


Lys 


Ser 


Ser 


Leu 


Leu 


Thr 


Leu 


Gin 


Glu 


Lys 


Phe 


Asn 


Glu 


Trp 


Gly 


Ser 










85 










90 










95 




lie 


Gin 


Leu 


Glu 


Lys 


Gly 


Gly 


Gin 


Glu 


Met 


Ala 


Leu 


Cys 


Ala 


Ser 


Leu 








100 










105 










110 






Lys 


He 


Met 


Gly 


Gin 


He 


Ser 


Ala 


Leu 


He 


Glu 


Thr 


Ala 


Lys 


Glu 


Ala 






115 










120 










125 








Ser 


Met 


Asp 


Asn 


Lys 


Lys 


Lys 


Asn 


Asn 


Asn 


Ala 


Cys 


Ala 


Asn 


Cys 


Arg 




130 










135 










140 










Asp 


Ser 


Lys 


Cys 


Ser 


Ala 


Ser 


Leu 


Val 


Thr 


Leu 


Phe 


Asn 


Lys 


Thr 


He 


145 










150 










155 










160 


Asp 


Glu 


Lys 


Tyr 


Val 


Lys 


Gin 


Asn 


Ser 


Ser 


Ser 


Ala 


Ser 


Ala 


Leu 


Leu 










165 










170 










175 




Ala 


Asn 


Thr 


Phe 


Thr 


Ala 


Gly 


Ala 


Asn 


Lys 


Pro 


Pro 


Lys 


Glu 


Phe 


He 








180 










185 










190 






Thr 


Lys 


Asp 


Asn 


Ala 


His 


Gly Asn 


Ser 


Asp 


Thr 


Asn 


Tyr 


Thr 


Ala 


Met 






195 










200 










205 








Ser 


Asp 


Asn 


Leu 


He 


Cys 


Pro 


Gly 


Lys 


Tyr 


Tyr 


Ser 


Ser 


Asp 


He 


Thr 




210 










215 










220 










Tyr 


Glu 


Val 


Thr 


Lys 


Gin 


Ala 


Lys 


Glu 


Arg 


He 


Lys 


Asn 


Asn 


Asn 


Lys 


225 










230 










235 










240 


Lys 


Met 


Arg 


Leu 


Ala 


Thr 


Gly Val 


Glu 


Met 


Val 


Met 


Lys 


Glu 


Leu 


Glu 










245 










250 










255 




Ala 


Glu 


Asn 


Asn 


Lys 


Glu 


Gly 


Gly 


Arg 


Val 


Glu 


Val 


Glu 


Val 


Glu 


Gly 








260 










265 










270 






Val 


Glu 


Gin 


Gin 


Gin 


Pro 


Ser 


Thr 


Ser 


Gly 


Glu 


Glu 


Met 


Gin 


Met 


Glu 






275 










280 










285 








He 


Met 


Leu 


Pro 


Thr 


Pro 


Pro 


Pro 


Pro 


Asp 


Leu 


Glu 


Ser 


Leu 


Val 


Thr 




290 










295 










300 










Glu 


Gly 


Val 


Asp 


Asp 


Tyr 


Pro 


Val 


Phe 


Ser 


Pro 


Leu 


Pro 


Ser 


Leu 


Leu 


305 










310 










315 










320 


Ser 


Pro 


Met 


Pro 


Asp 


Leu 


Pro 


Ser 


Asn 


Gly 


Asn 


Ser 


Ala 


Leu 


Glu 


Asp 










325 










330 










335 




Gly 


Gly 


Pro 


Phe 


Ala 


Pro 


Ser 


Ala 


Asp 


He 


Val 


Val 


Asp 


Lys 


Thr 


Ser 








340 










345 










350 






Glu 


He 


Met 


Gly 


Arg 


Thr 


Pro 


Gly 


Ser 


Glu 


Trp 


Val 


His 


Gin 


Arg 


Asp 






355 










360 










365 








Arg 


Asn 


Ser 


Lys 


Met 


Glu 


He 


Arg 


Asn 


Tyr 


Gly 


Arg 


Ser 


Gly 


He 


Asn 




370 










375 










380 










Thr 


Gly 


Arg 


Tyr 


Arg 


Arg 


Asn 


Asn 


Thr 


Val 


Leu 













385 390 395 



<210> 130 
<211> 1233 
<212> DNA 
<213> SHRIMP 



<400> 130 

atggaagaac 

tttagtgata 

gacaagaaat 

caactctcca 

aaagcgttaa 

atttgcctct 

aatatgatgt 

ctagggagag 

ggctacaaca 

actcaaccac 

atgtttgaag 

caagttgat t 



atctatcctt 
atacttctat 
tttcctccaa 
ccaaagtaaa 
agattgttgt 
atccagaaaa 
taatggacaa 
ggtttaatag 
tattgaatgg 
aggtgcaaag 
tggtgaaaca 
ttaaaggtga 



caacaaaccc 
gtctaacatg 
ggctctactc 
caaggcaata 
tactggactt 
acacaccttg 
ggcagaagac 
agaggctttc 
ataccaccca 
aagatgcgca 
attttcaatc 
agaaatgaaa 



tctccagaaa 
gtagacaata 
ttggcctcta 
ttttctcata 
tatgtagatg 
aatgggattc 
gcagacgaaa 
aagtttgtgg 
gataatggcc 
gtcaagcaaa 
aaacacaaca 
atgttctttg 



atggagtagt 
tccggcacag 
cgcccattcc 
gagaaactat 
gggaatatgt 
tgagatatgt 
ttcgttgtgg 
acccagttat 
accaaatctc 
tgtacaaaca 
atagaatctt 
ctctctacag 



cttctttgac 
acttcctatg 
cagtgacgaa 
tgttttatct 
ggatgatgtt 
tgtgcatcta 
acttattcct 
cccttgtgcc 
cccttcatct 
aatcaatggc 
cacta taaat 
cgaagagt ta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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cttcctttct 
tctcagttgc 
aatactctga 
ggggataatt 
ggagatgatg 
agaaatgttg 
gagggtgttg 
agggaagaaa 
tatgtaataa 



attcagaaac 
ctccacatgt 
tgaaaacaga 
ttgagttgtt 
cactcgattt 
aactcgctga 
ttggtgaatg 
ttgtattcgg 
aacaaccagt 



gggaaaatta 
gaccatttct 
ttttgggagt 
tgggatgaac 
gcgtagacga 
caacagactg 
ggtgagcaga 
aggtacaaaa 
tgattatgta 



ttatctgaga 
gttttctatt 
tgttttgcac 
aacaacattc 
ataatggaac 
aatcctcaca 
ttcgctccgt 
ttcatttttg 
tga 



aacacgtctc 
tgcgaaatat 
cagctatcaa 
tcgtatccaa 
acatttcaga 
tcactcatgg 
gcaatttcct 
gaagggtcag 



caaatcattt 
ggaagaatac 
aatcgacact 
agtatgtgtt 
tgcaataggg 
gaagat taat 
ttgtaaaccg 
taatggaaac 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1233 



<210> 131 
<211> 404 
<212> PRT 
<213> SHRIMP 



<400> 131 
Met Glu Glu His 
1 

Val Phe Phe Asp 
20 

Asn lie Arg His 

35 

Leu Leu Leu Ala 
50 

Lys Val Asn Lys 
65 

Lys Ala Leu Lys 

Asp Val lie Cys 
100 

Arg Tyr Val Val 
115 

Ala Asp Glu lie 
130 

Arg Glu Ala Phe 
145 

Asn lie Leu Asn 

Ser Ser Thr Gin 
180 

Tyr Lys Gin lie 
195 

Lys His Asn Asn 
210 

Glu Glu Met Lys 
225 

Ser Glu Thr Gly 

Ser Gin Leu Pro 
260 

Met Glu Glu Tyr 
275 

Ala Pro Ala lie 
290 

Met Asn Asn Asn 
305 

Leu Asp Leu Arg 

Val Glu Leu Ala 
340 

lie Asn Glu Gly 



Leu Ser Phe Asn 
5 

Phe Ser Asp Asn 

Arg Leu Pro Met 
40 

Ser Thr Pro lie 
55 

Ala lie Phe Ser 
70 

lie Val Val Thr 
85 

Leu Tyr Pro Glu 

His Leu Asn Met 
120 

Arg Cys Gly Leu 

135 

Lys Phe Val Asp 
150 

Gly Tyr His Pro 
165 

Pro Gin Val Gin 

Asn Gly Met Phe 
200 

Arg lie Phe Thr 
215 

Met Phe Phe Ala 
230 

Lys Leu Leu Ser 
245 

Pro His Val Thr 

Asn Thr Leu Met 
280 

Lys He Asp Thr 
295 

He Leu Val Ser 
310 

Arg Arg He Met 

325 

Asp Asn Arg Leu 

Val Val Gly Glu 



Lys 


Pro 


Ser 


Pro 




10 






Thr 


Ser 


Met 


Ser 


25 








Asp 


Lys 


Lys 


Phe 


Pro 


Ser 


Asp 


Glu 








60 


His 


Arg 


Glu 


Thr 






75 




Gly 


Val 


Asp 


Gly 




90 






Lys 


His 


Thr 


Leu 


105 








Met 


Leu 


Met 


Asp 


He 


Pro 


Leu 


Gly 








140 


Pro 


Val 


He 


Pro 






155 




Asp 


Asn 


Gly 


His 




170 






Arg 


Arg 


Cys 


Ala 


185 








Glu 


Val 


Val 


Lys 


He 


Asn 


Gin 


Val 








220 


Lys 


Glu 


Glu 


Leu 






235 




Glu 


Lys 


His 


Val 




250 






He 


Ser 


Val 


Phe 


265 








Lys 


Thr 


Asp 


Phe 


Gly 


Asp 


Asn 


Phe 








300 


Lys 


Val 


Cys 


Val 






315 




Glu 


His 


Asp 


Ala 




330 






Asn 


Pro 


His 


He 


345 








Trp 


Val 


Ser 


Arg 



Glu 


Asn 


Gly 


Val 






15 




Asn 


Met 


Val 


Asp 




30 






Ser 


Ser 


Lys 


Ala 


45 








Gin 


Leu 


Ser 


Thr 


He 


Val 


Leu 


Ser 








80 


Glu 


Tyr 


Val 


Asp 






95 




Asn 


Gly 


He 


Leu 




110 






Lys 


Ala 


Glu 


Asp 


125 








Arg 


Gly 


Phe 


Asn 


Cys 


Ala 


Gly 


Tyr 








160 


Gin 


He 


Ser 


Pro 






175 




Val 


Lys 


Gin 


Met 




190 






Gin 


Phe 


Ser 


He 


205 








Asp 


Phe 


Lys 


Gly 


Leu 


Pro 


Phe 


Tyr 








240 


Ser 


Lys 


Ser 


Phe 






255 




Tyr 


Leu 


Arg 


Asn 




270 






Gly 


Ser 


Cys 


Phe 


285 








Glu 


Leu 


Phe 


Gly 


Gly 


Asp 


Asp 


Ala 








320 


He 


Gly 


Arg 


Asn 






335 




Thr 


His 


Gly 


Lys 




350 






Phe 


Ala 


Pro 


Cys 
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355 360 
Asn Phe Leu Cys Lys Pro Arg Glu 

370 375 
Phe lie Phe Gly Arg Val Ser Asn 
385 390 
Val Asp Tyr Val 



365 

Glu lie Val Phe Gly Gly Thr Lys 
380 

Gly Asn Tyr Val lie Lys Gin Pro 
395 AOO 



<210> 132 
<211> 2839 
<212> DNA 
<213> SHRIMP 



<400> 132 

atggattcaa 

aatgaaaacc 

agaactgaca 

agtttccttc 

caatcagccc 

aggatagact 

tactttcttg 

aagaaaatat 

tacaatgaag 

ggtttggtgc 

atgacggatt 

aacaagttgt 

gtgcctcgta 

tttagattcc 

tttggaggtg 

agtgcatgta 

aacgacaaca 

ccctgtgaac 

attgttaaag 

aggacaccta 

atttctaaac 

tgggttgctc 

gagatccttc 

atggttgttc 

gcagttcatt 

gacatccatg 

ctgtttaaga 

gaggacaatg 

aggagaagaa 

acgttcagat 

acccaagtgt 

gctgcggtta 

aaaagacatc 

atatctctag 

ataatttcaa 

atgaccaagg 

tactcgttgt 

ctacgtatat 

tcaaaatcat 

cctatacctt 

actgcaaata 

tttaaagagg 

gttggagaag 

ctagtgtgta 

aagcctagaa 

agcacgactg 



cttctactac 
tgacatggga 
tgagactgaa 
aagatttctt 
tgtaccacat 
actacaacaa 
tcgatgtgaa 
ggcaacgttt 
tgagcaaatt 
atctttcgga 
ttgactttta 
cgggtatgtt 
gattggaaag 
ctagcaacca 
ccttttcatg 
gagaaaacgt 
aaactaaaaa 
tccaatatgc 
cagtgaaatt 
tcgccgaaca 
ttttgggacg 
acaagtataa 
tgaagaaatt 
cagtaatagg 
gcattgagaa 
aagaccctaa 
acaacgacgt 
tacaagactt 
aaatccacga 
ttggcgtctg 
ctcttcagct 
attggaccag 
aaaattcact 
cctctaaatt 
aattcaatgc 
ctggtttggt 
gtggaggatt 
cagctgaggc 
acaacgaaga 
tatttggcgt 
aatgtaatga 
cattagtttc 
atgaggatta 
cactgtatga 
agcatgatcc 
atttttcccc 



aactatagaa 
attcgtagac 
aacatcatcc 
cactgctcgt 
gttggatgga 
gaagatggag 
gacaatagaa 
atatgctgaa 
attgcccgat 
tactcttaaa 
tgaaaaatac 
tagtgagaat 
gtactttaac 
catcagggga 
tgtacaactt 
actggatact 
ttcatcctcc 
aagaataaac 
tgtagcagat 
cgaa tcagat 
aaaggtgtct 
acagagtttt 
gacagttgct 
ctttggttca 
tgttataggg 
aaagttggaa 
gatgaaacga 
cttggaagaa 
agaggaatac 
tggctt tcaa 
actaaaacaa 
attacttcaa 
tccttggaat 
cataaagaga 
agatattctt 
catagaggat 
taataaccag 
attaaaaaat 
caggccattt 
ggaaccttta 
tagtgtatca 
tcatacaatt 
ttgtgataac 
tatgatgggg 
tacacca tgg 
caactctgtt 



gcagaaaagg 
agagttatac 
agaagactgt 
gatggagtaa 
gttgcttcca 
agattggcct 
tctagacacg 
aaaattgctc 
gaagcaatgg 
aatgcaaaaa 
ataaggtcag 
tttgaaatcc 
gttgaaacga 
ttaatatttg 
tacctgctcg 
cccttctcaa 
aatgaagata 
tctaatgaca 
agagtggaga 
ggctacatgg 
gcttcttatg 
acgaacgatg 
tgtgggctca 
gggatgacaa 
tcttttatct 
gaaatgtccc 
gggcaagatg 
ttaaagacaa 
acaaaatctc 
caccctcttc 
cgtcaaacat 
tttttgttcc 
aggctaggtt 
tctgtgcact 
cctctcggta 
catgcgaggg 
acaacagagc 
gcaagaaatt 
ttacctcgca 
caccctttac 
gatttctggt 
ctaacagatt 
aataagagtg 
agagccaact 
agcagtaaaa 
attgttctat 



cactcctaaa 
ggcacgagaa 
tttcgttcat 
atagtgacga 
taatatcttg 
gcaccagtat 
ttgaactact 
cagaaaaagt 
caaactacaa 
agcccccaac 
atattgtatt 
ttccagacat 
attatagtct 
cttatttcat 
gtttcactct 
aattgaaaca 
atgacgggga 
aaaatgcgtg 
aagcatccgt 
cagactggct 
ccttgttatt 
ttaatggaag 
catacaacca 
acagaaaatt 
cttcggggaa 
tgatgcagct 
ggaaggtaac 
aggagtttgt 
tacacaccaa 
cagctagtag 
tcgtccagag 
cttctgacga 
ccaatttaaa 
gtgaacgtgt 
aagacccaga 
agaatatcga 
aaaagttaaa 
gtgttcttgc 
cggatgaggc 
tcaattcatt 
tagaagagag 
ccagtgtgta 
gaaaacgcat 
ataatggtct 
atactggcca 
tagacactga 



agaatacgta 
acttatgcag 
ctctatatac 
gtggtgtacc 
cttcaggaag 
acgtgaagga 
agatcctgat 
tgtcgatgca 
ctacagaact 
tgatttaacg 
gggaaaatca 
aaacattaaa 
ggagcacaac 
tggtaatatt 
ttctgcggct 
gtatataaag 
agaatattac 
cagaaaaagt 
cacaatgatg 
ctcccttcaa 
tatagtgaac 
tgaaaaatac 
caagtgcggc 
gagacagtat 
gaggaagaag 
ttctgcgcgc 
gttcgcgaat 
gcccaatgaa 
tctaaagatg 
tgacaagcca 
agaaactgca 
gagggataat 
tagacatttc 
tgtcaatgat 
ccattttcta 
taatgccatg 
cagtatacgt 
aacaaccttt 
aaagtttgtg 
tattgataat 
tgacgatatt 
ctcaactctt 
tggtaataca 
acattcagac 
gtctggacgt 
gaatgttgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 
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gatgactatg aagatgagga ggaagattat gaagctttga aacaatcaga gagggataat 2820 
gtcatcacac taaataacg 2839 

<210> 133 
<211> 945 
<212> PRT 
<213> SHRIMP 

<220> 

<221> VARIANT 

<222> (1) , . . (945) 

<223> Xaa = Any Amino Acid 



<400> 133 



Met 


Asp 


Ser 


Thr 


Ser 


Thr 


Thr 


Thr 


He 


Glu 


Ala 


Glu 


Lys 


Ala 


Leu 


Leu 


1 








5 










10 










15 




Lys 


Glu 


Tyr 


Val 


Asn 


Glu 


Asn 


Leu 


Thr 


Trp 


Glu 


Phe 


Val 


Asp 


Arg 


Val 








20 










25 










30 






He 


Arg 


His 


Glu 


Lys 


Leu 


Met 


Gin 


Arg 


Thr 


Asp 


Met 


Arg 


Leu 


Lys 


Thr 






35 










40 










45 








Ser 


Ser 


Arg 


Arg 


Leu 


Phe 


Ser 


Phe 


He 


Ser 


He 


Tyr 


Ser 


Phe 


Leu 


Gin 




50 










55 










60 










Asp 


Phe 


Phe 


Thr 


Ala 


Arg 


Asp 


Gly 


Val 


Asn 


Ser 


Asp 


Glu 


Trp 


Cys 


Thr 


65 










70 










75 










80 


Gin 


Ser 


Ala 


Leu 


Tyr 


His 


Met 


Leu 


Asp 


Gly 


Val 


Ala 


Ser 


He 


He 


Ser 










85 










90 










95 




Cys 


Phe 


Arg 


Lys 


Arg 


He 


Asp 


Tyr 


Tyr 


Asn 


Lys 


Lys 


Met 


Glu 


Arg 


Leu 








100 










105 










110 






Ala 


Cys 


Thr 


Ser 


He 


Arg 


Glu 


Gly 


Tyr 


Phe 


Leu 


Val 


Asp 


Val 


Lys 


Thr 






115 










120 










125 








He 


Glu 


Ser 


Arg 


His 


Val 


Glu 


Leu 


Leu 


Asp 


Pro 


Asp 


Lys 


Lys 


He 


Trp 




130 










135 










140 










Gin 


Arg 


Leu 


Tyr 


Ala 


Glu 


Lys 


He 


Ala 


Pro 


Glu 


Lys 


Val 


Val 


Asp 


Ala 


145 










150 










155 










160 


Tyr 


Asn 


Glu 


Val 


Ser 


Lys 


Leu 


Leu 


Pro 


Asp 


Glu 


Ala 


Met 


Ala 


Asn 


Tyr 










165 










170 










175 




Asn 


Tyr 


Arg 


Thr 


Gly 


Leu 


Val 


His 


Leu 


Ser 


Asp 


Thr 


Leu 


Lys 


Asn 


Ala 








180 










185 










190 






Lys 


Lys 


Pro 


Pro 


Thr 


Asp 


Leu 


Thr 


Met 


Thr 


Asp 


Phe 


Asp 


Phe 


Tyr 


Glu 






195 










200 










205 








Lys 


Tyr 


He 


Arg 


Ser 


Asp 


He 


Val 


Leu 


Gly 


Lys 


Ser 


Asn 


Lys 


Leu 


Ser 




210 










215 










220 










Gly 


Met 


Phe 


Ser 


Glu 


Asn 


Phe 


Glu 


He 


Leu 


Pro 


Asp 


He 


Asn 


He 


Lys 


225 










230 










235 










240 


Val 


Pro 


Arg 


Arg 


Leu 


Glu 


Arg 


Tyr 


Phe 


Asn 


Val 


Glu 


Thr 


Asn 


Tyr 


Ser 










245 










250 










255 




Leu 


Glu 


His 


Asn 


Phe 


Arg 


Phe 


Pro 


Ser 


Asn 


His 


He 


Arg 


Gly 


Leu 


He 








260 










265 










270 






Phe 


Ala 


Tyr 


Phe 


He 


Gly 


Asn 


He 


Phe 


Gly 


Gly 


Ala 


Phe 


Ser 


Cys 


Val 






275 










280 










285 








Gin 


Leu 


Tyr 


Leu 


Leu 


Gly 


Phe 


Thr 


Leu 


Ser 


Ala 


Ala 


Ser 


Ala 


Cys 


Arg 




290 










295 










300 










Glu 


Asn 


Val 


Leu 


Asp 


Thr 


Pro 


Phe 


Ser 


Lys 


Leu 


Lys 


Gin 


Tyr 


He 


Lys 


305 










310 










315 










320 


Asn 


Asp 


Asn 


Lys 


Thr 


Lys 


Asn 


Ser 


Ser 


Ser 


Asn 


Glu 


Asp 


Asn 


Asp 


Gly 










325 










330 










335 




Glu 


Glu 


Tyr 


Tyr 


Pro 


Cys 


Glu 


Leu 


Gin 


Tyr 


Ala 


Arg 


He 


Asn 


Ser 


Asn 








340 










345 










350 






Asp 


Lys 


Asn 


Ala 


Cys 


Arg 


Lys 


Ser 


He 


Val 


Lys 


Ala 


Val 


Lys 


Phe 


Val 






355 










360 










365 








Ala 


Asp 


Arg 


Val 


Glu 


Lys 


Ala 


Ser 


Val 


Thr 


Met 


Met 


Arg 


Thr 


Pro 


He 
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370 










375 








Ala 


Glu 


His 


Glu 


Ser 


Asp 


Gly 


Tyr 


Met 


Ala 


385 










390 










He 


Ser 


Lys 


Leu 


Leu 


Gly Arg 


Lys 


Val 


Ser 










405 










410 


Phe 


He 


Val 


Asn 


Trp 


Val 


Ala 


His 


Lys 


Tyr 








420 










425 




Asp 


Val 


Asn 


Gly 


Ser 


Glu 


Lys 


Tyr 


Glu 


He 






435 










440 






val 


Ala 


Cys 


Gly 


Leu 


Thr 


Tyr 


Asn 


His 


Lys 




450 










455 








val 


lie 


Gly 


Phe 


Gly 


Ser 


Gly 


Met 


Thr 


Asn 


465 










470 










Ala 


Val 


His 


Cys 


He 


Glu 


Asn 


Val 


He 


Gly 










485 










490 


Lys 


Arg 


Lys 


Lys 


Asp 


He 


His 


Glu 


Asp 


Pro 








500 










505 




Ser 


Leu 


Met 


Gin 


Leu 


Ser 


Ala 


Arg 


Leu 


Phe 






515 










520 






Lys 


Arg 


Gly 


Gin 


Asp 


Gly 


Lys 


Val 


Thr 


Phe 




530 










535 








Gin 


Asp 


Phe 


Leu 


Glu 


Glu 


Leu 


Lys 


Thr 


Lys 


545 










550 










Arg 


Arg 


Arg 


Lys 


He 


His 


Glu 


Glu 


Glu 


Tyr 










565 










570 


Asn 


Leu 


Lys 


Met 


Thr 


Phe 


Arg 


Phe 


Gly 


Val 








580 










585 




Leu 


Pro 


Ala 


Ser 


Ser 


Asp 


Lys 


Pro 


Thr 


Gin 






595 










600 






Lys 


Gin 


Arg 


Gin 


Thr 


Phe 


Val 


Gin 


Arg 


Glu 




610 










615 








Trp 


Thr 


Arg 


Leu 


Leu 


Gin 


Phe 


Leu 


Phe 


Pro 


625 










630 










Lys 


Arg 


His 


Gin 


Asn 


Ser 


Leu 


Pro 


Trp 


Asn 










645 










650 


Asn 


Arg 


His 


Phe 


He 


Ser 


Leu 


Ala 


Ser 


Lys 








660 










665 




His 


Cys 


Glu 


Arg 


Val 


Val 


Asn 


Asp 


He 


He 






675 










680 






He 


Leu 


Pro 


Leu 


Gly 


Lys 


Asp 


Pro 


Asp 


His 




690 










695 








Gly 


Leu 


Val 


He 


Glu 


Asp 


His 


Ala 


Arg 


Glu 


705 










710 










Tyr 


Ser 


Leu 


Cys 


Gly 


Gly 


Phe 


Asn 


Asn 


Gin 










725 










730 


Asn 


Ser 


He 


Arg 


Leu 


Arg 


He 


Ser 


Ala 


Glu 








740 










745 




Asn 


Cys 


Val 


Leu 


Ala 


Thr 


Thr 


Phe 


Ser 


Lys 






755 










760 






Pro 


Phe 


Leu 


Pro 


Arg 


Thr 


Asp 


Glu 


Ala 


Lys 




770 










775 








Phe 


Gly 


Val 


Glu 


Pro 


Leu 


His 


Pro 


Leu 


Leu 


785 










790 










Thr 


Ala 


Asn 


Lys 


Cys 


Asn 


Asp 


Ser 


Val 


Ser 










805 










810 


Ser 


Asp 


Asp 


He 


Phe 


Lys 


Glu 


Ala 


Leu 


Val 








820 










825 




Asp 


Ser 


Ser 


Val 


Tyr 


Ser 


Thr 


Leu 


Val 


Gly 






835 










840 






Asp 


Asn 


Asn 


Lys 


Ser 


Gly 


Lys 


Arg 


He 


Gly 




850 










855 
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380 



Asp 


Trp 


Leu 


Ser 


Leu 


Gin 


395 










400 


Ala 


Ser 


Tyr 


Ala 


Leu 


Leu 










415 




Lys 


Gin 


Ser 


Phe 


Thr 


Asn 








430 






Leu 


Leu 


Lys 


Lys 


Leu 


Thr 






445 








Cys 


Gly 


Met 


Val 


Val 


Pro 




460 










Arg 


Lys 


Leu 


Arg 


Gin 


Tyr 


475 










480 


Ser 


Phe 


He 


Ser 


Ser 


Gly 










495 




Lys 


Lys 


Leu 


Glu 


Glu 


Met 








510 






Lys 


Asn 


Asn 


Asp 


Val 


Met 






525 








Ala 


Asn 


Glu 


Asp 


Asn 


Val 




540 










Glu 


Phe 


Val 


Pro 


Asn 


Glu 


555 










560 


Thr 


Lys 


Ser 


Leu 


His 


Thr 










575 




Cys 


Gly 


Phe 


Gin 


His 


Pro 








590 






Val 


Ser 


Leu 


Gin 


Leu 


Leu 






605 








Thr 


Ala 


Ala 


Ala 


Val 


Asn 




620 










Ser 


Asp 


Glu 


Arg 


Asp 


Asn 


635 










640 


Arg 


Leu 


Gly 


Ser 


Asn 


Leu 










655 




Phe 


He 


Lys 


Arg 


Ser 


Val 








670 






Ser 


Lys 


Phe 


Asn 


Ala 


Asp 






685 








Phe 


Leu 


Met 


Thr 


Lys 


Ala 




700 










Asn 


He 


Asp 


Asn 


Ala 


Met 


715 










720 


Thr 


Thr 


Glu 


Gin 


Lys 


Leu 










735 




Ala 


Leu 


Lys 


Asn 


Ala 


Arg 








750 






Ser 


Tyr 


Asn 


Glu 


Asp 


Arg 






765 








Phe 


Val 


Pro 


He 


Pro 


Leu 




780 










Asn 


Ser 


Phe 


He 


Asp 


Asn 


795 










800 


Asp 


Phe 


Trp 


Leu 


Glu 


Glu 










815 




Ser 


His 


Thr 


He 


Leu 


Thr 








830 






Glu 


Asp 


Glu 


Asp 


Tyr 


Cys 






845 








Asn 


Thr 


Leu 


Val 


Cys 


Thr 



860 
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Leu 


Tyr 


Asp 


Met 


Met 


Gly 


Arg 


Ala 


Asn 


Tyr 


Asn 


Gly 


Leu 


His 


Ser 


Asp 


865 










870 










875 










880 


Lys 


Pro 


Arg 


Lys 


His 
885 


Asp 


Pro 


Thr 


Pro 


Trp 
890 


Ser 


Ser 


Lys 


Asn 


Thr 
895 


Gly 


Gin 


Ser 


Gly 


Arg 
900 


Ser 


Thr 


Thr 


Asp 


Phe 
905 


Ser 


Pro 


Asn 


Ser 


Val 
910 


He 


Val 


Leu 


Leu 


Asp 
915 


Thr 


Glu 


Asn 


Val 


Ala 
920 


Asp 


Asp 


Tyr 


Glu 


Asp 
925 


Glu 


Glu 


Glu 


Asp 


Tyr 
930 


Glu 


Ala 


Leu 


Lys 


Gin 
935 


Ser 


Asp 


Asn 


Val 


He 
940 


Thr 


Leu 


Asn 


Asn 



Xaa 



945 



<210> 134 
<211> 4821 
<212> DNA 
<213> SHRIMP 



<400> 134 

atgtctcgaa 

cctttcgacc 

gacatcatag 

aacacatttg 

tgcattagct 

actgaatacg 

actatttttt 

gcctacagaa 

ggaggagtca 

tgtgtagatt 

gagcccattc 

atacacccta 

agtatgactt 

gccaccaacg 

ctaggctgcc 

ctttgtgatg 

gcggcaaaaa 

acagaaaggt 

aacgatacga 

cctgaaatta 

gccatggacg 

ctcacagtca 

gctcacatct 

aattttactg 

cccggtttag 

atcttccccg 

agggatgcgc 

gggtacgatt 

atatgcgcaa 

gctaaaaatg 

aagcgaatgg 

gaaagagacc 

aatagttt tg 

gaaggaaaag 

aatgccaaat 

aagaaaagtt 

ataattgatc 

gcatatgaaa 

tttttctcgg 

agggtcgcaa 

gatggagttt 



actctttacg 
ccttatatgt 
ggaaaggttt 
actgttttaa 
gtcgttctct 
gtatagaaga 
cagcctcttc 
acgtcaaccc 
taactggagg 
cgtctgtggt 
ttgggtgcac 
caacattctc 
ctactatgga 
ccccaaaatg 
ctgtcatgtt 
aaagtacaca 
cagattatgc 
acgaaacttt 
ggatgtcggc 
cagcgtttaa 
ccgccggcaa 
gtgaatctgt 
ggaacggaga 
aacatccaaa 
gaatcgatag 
agacggttca 
acaacgatac 
cgtacaaaga 
cagcatacgt 
ataaaacagt 
cccacactcc 
aatcttttgc 
acgacgttac 
agaggtctgg 
tcaaagaaat 
ccacctttgc 
at tacaaggc 
tgtttgctca 
gaagaggact 
aaacgttatc 
taa tgaa tgg 



cgtcaaagga 
cgatacggac 
tgtagaatct 
aactcctaaa 
cgggggagga 
tga tgatgaa 
tgcgttcgac 
ttttactact 
aggaaaaaag 
gcgaaaggat 
agatattgta 
aattattgat 
ttgtttgtgc 
tgagaagaaa 
caggtatggg 
aatcagactg 
tcaacccttt 
aggtggtagt 
tctcggctat 
cggaggccat 
ttggcactca 
aggaggtgta 
tgcactaaac 
cgcatcgctc 
catagatcat 
catgagagca 
gaccatagaa 
taacccctta 
cgttccttca 
aaataagata 
catcgagaca 
gaacagtaca 
aaatttactt 
cctgcccatc 
tggaggtatt 
aagattcagt 
tggttcatct 
ccctccaact 
aaataatgga 
aaaaaagccc 
tgcatcttca 



ttaaaagaaa 
gcccccttcg 
ctattaccag 
aaatgtcgcg 
acttgcgtca 
tatgacggag 
aagcacgacg 
gtcgaggagg 
ggaacgactt 
ccttcccttc 
ctgtgtggag 
gatgtggacg 
gaacctggct 
gaaggtggta 
gttgtaggag 
gaagaagtgg 
gtagagggtg 
actaaggacg 
tcttatgccg 
ctgattacgg 
agaattgaag 
gtgccctaca 
gataatgggc 
agggtcgtgc 
gctgtgggca 
ggcgatcctt 
acatccttcc 
cagaaactga 
ctccatagag 
ttgcccttgg 
attttcaagc 
ttaaattcta 
ctggactatt 
aatacaagat 
attctgcaac 
gagaaaatat 
gcagtgttta 
gcgtggcgca 
atagaaggta 
gatatttttg 
cctcttgtaa 



atgggggaat 
gtatggctgg 
gggaaattag 
taggaggcaa 
aatccagtag 
tttgtgtccc 
atgatgttgc 
cttatttgca 
atatcaccaa 
tcaataaaga 
gaaaaggagt 
atatcgattt 
actcccagca 
tacaggaaaa 
atactggaac 
ccggtataga 
caaaattatt 
cgtgcctccc 
cttctctttt 
gaggattatt 
attctgacga 
gtggaacagg 
tggttggagc 
ctttacccca 
taatagcgtc 
caggagtaaa 
ttaaagattc 
gaaaatctca 
taataaagga 
tacactatag 
actctctttt 
tgatggttac 
ttttccccaa 
caatatacaa 
ctgttacagc 
tgtctacaaa 
agaaagtggg 
tagcttctaa 
cgggtatgcg 
caggagcaat 
gacctatgga 



tat tcccaac 
agtcaagtca 
ttcccattat 
cgattttgaa 
agaattgaaa 
tctcgccgat 
tacaga tgca 
ctacgaa tct 
aaaaagaggg 
tccaagatta 
ggggcgacct 
cgacataagc 
gagagaccct 
ggaacaaggt 
aaagggctgt 
tcttcccgat 
gctgcaaata 
tagaccagga 
tggaagagca 
gcgcgaatct 
acaaggaaaa 
aagtgtggcc 
tggaggggga 
tagcaatatt 
ccagggtaaa 
gactgacaga 
tgacaaggca 
cgactctggc 
aaaaccttct 
acctaccgct 
gacggcacaa 
aaattctagt 
cttgaacggc 
tgaaccaaac 
gcagggtgct 
ttctcccaaa 
ggaaaaagaa 
tgaagggaca 
cgaggccgaa 
cctcactgga 
aa ttcctgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 
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tcttctctcc 
cctggttcgg 
gacattagag 
ccagcacgac 
gctacatcat 
attcaaaaga 
gtccctctaa 
gaaggagccc 
aggactcttt 
gattggggtc 
tgtgtcaggg 
tacactacaa 
ttttctataa 
aagagtattt 
ttgggagaaa 
gggggtagga 
cgttttcccc 
agagctcaat 
agaaagataa 
ttggccccta 
agacaagaaa 
gcagaattgt 
gattatgaac 
gatgcagcaa 
tcttctcatg 
ggtgtcaagg 
acaatatcgg 
agtgaagttg 
ggtagtacaa 
gaagttttca 
tcatcatcaa 
aaaagagtcc 
ggaggattcg 
ataacctatg 
ggggtaagag 
gtagggggac 
atgagcagat 
gatagattcc 
actatacaac 
aggacaattt 



ccgaacacac 
ccgacaactt 
ctattctaaa 
ccttttctaa 
ttttgggagt 
gtgtccaaga 
aattccatga 
tcaatttcat 
cctctgaaac 
tgagccccca 
aagaaactgg 
tgatggtccc 
agagttatgt 
ttgagatgca 
aggagaactg 
acaaaactga 
tcatgactac 
ccggagacgc 
acctaaaagc 
ataaattttg 
aggtacctgc 
taggggatgt 
gcttgccggg 
aaacgggtca 
gagttactgc 
atatagtgag 
ccgattctgc 
tggtggaagt 
agaagaggag 
caatcaataa 
agtatgtaga 
ctagagtcat 
aatcttgcgc 
cagatttcat 
ttaaaggccc 
ctaatctaag 
attattatac 
cctttgtttg 
ctttagcact 
caatggagta 



ctggttcgaa 
gacagccatc 
t tcgacaaca 
gcctctcgct 
tttaggggga 
atcagtctcg 
aggggatgaa 
ccctcctcct 
gtttattagg 
cacggccggc 
ccaatctttc 
caaacccctg 

^gagg^scaa 

aga tcctaat 
taattgcacc 
ccctatagcg 
cctccctaaa 
tagaatacta 
tgcaacagaa 
tgccatgcgt 
tgctgttttg 
tggtatgaca 
cgtgcctcct 
agagggtgca 
ctctacattt 
cctgtatgca 
gacaataaat 
agttaacagc 
attatctatt 
gcaaggaaag 
gggaggacaa 
gagaagagtc 
acctacagct 
gagaccttca 
agaaccattt 
gaaatttgca 
aagagaaaaa 
tcaatcagat 
tgtggacatg 



cgcagatcgc 
aacaacactt 
gacattaaaa 
cctcctgcgg 
tttcctttac 
aacggcacgg 
ttatggcaac 
atggctctct 
ccagaattga 
cactacttga 
ggatacccgt 
ggcccgcaat 
acacggctac 
agtaaaaaca 
aacggtttat 
gcaacaccta 
aacgatgttc 
aatactatcg 
aatatatggg 
aggagtacgg 
gacgaaagga 
gatattgtat 
gcggaggctg 
aaggctagaa 
aacgtgggaa 
acgccttgct 
gaaggcgcaa 
aatatggaaa 
'tcagattatg 
gctacagaaa 
aaagacatgg 
catgtacttc 
gctcagagtg 
gatcttagtg 
gatgatcttt 
caccatcacc 
actgtttctg 
agaggaccat 
ggagtgttgc 



cggtcaacgc 
atgatcgagt 
cctccttcaa 
gagtgtctgc 
ccataccttc 
tcgggtccat 
aatgtgaagt 
ttgagtcgct 
tccctaatcg 
acggcgtgta 
gctcgggtgc 
cacattcttc 
ttcctgccaa 
tattcgataa 
tctgtcccaa 
gcagaggaaa 
atctctctgc 
gagaaactaa 
atatatcttc 
cttacactcc 
agggtacgtt 
ctaatgacat 
aaattttcca 
ggatagtgga 
cattttcccc 
ttacagacat 
gcattgaacc 
tgttaggagg 
tggatttgga 
atctccgcgt 
tcggtttcta 
ctgtattgac 
cttgtacaag 
gaacaaaaac 
caaccctcta 
attttggata 
tttcagaggg 
tcccaccaaa 
cagaaggcgc 



gagaggagac 
aacaaaggga 
ttcgtacgct 
ggctgctcaa 
cagttttctc 
gcatggaata 
caaggaaact 
actcagggtt 
gttcagagca 
ctccccgcct 
cctctctcag 
gttatctaaa 
tattggagaa 
aatcggggaa 
agtgaatgga 
taggcattca 
agcactcctg 
aactaatgga 
caatgttatg 
t tattccaca 
cgataggaac 
tt taatggag 
cattataagg 
ctttttcgag 
ctacgttgag 
agattcgcca 
aactgatggg 
aagcactgcc 
agaagatgcg 
tagaaca tea 
tgaagcgagt 
accctaccat 
aggtgtggaa 
aacattggag 
ctttagaagc 
tgaaggatta 
tgatttgaag 
gagggatgga 
actcacaagg 



2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4821 



<210> 135 
<211> 1600 
<212> PRT 
<213> SHRIMP 



<400> 135 



Met 


Ser 


Arg 


Asn 


Ser 


Leu 


Arg 


Val 


1 








5 








lie 


lie 


Pro 


Asn 
20 


Pro 


Phe 


Asp 


Pro 


Phe 


Gly 


Met 
35 


Ala 


Gly 


Val 


Lys 


Ser 
40 


Glu 


Ser 
50 


Leu 


Leu 


Pro 


Gly 


Giu 

55 


He 


Cys 


Phe 


Lys 


Thr 


Pro 


Lys 


Lys 


Cys 


65 










70 






Cys 


lie 


Ser 


Cys 


Arg 

85 


Ser 


Leu 


Gly 


Arg 


Glu 


Leu 


Lys 
100 


Thr 


Glu 


Tyr 


Gly 



Lys 


Gly 


Leu 


Lys 


Glu 


Asn 


Gly 


Gly 




10 










15 




Leu 


Tyr 


Val 


Asp 


Thr 


Asp 


Ala 


Pro 


25 










30 






Asp 


He 


He 


Gly 


Lys 


Gly 


Phe 


Val 










45 








Ser 


Ser 


His 


Tyr 


Asn 


Thr 


Phe 


Asp 








60 










Arg 


Val 


Gly 


Gly 


Asn 


Asp 


Phe 


Glu 






75 










80 


Gly 


Gly 


Thr 


Cys 


Val 


Lys 


Ser 


Ser 




90 










95 




He 


Glu 


Asp 


Asp 


Asp 


Glu 


Tyr 


Asp 


105 










110 
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Gly 


Val 


Cys 


Val 


Pro 


Leu 


Ala 


Asp 


Thr 


He 


Phe 


Ser 


Ala 


Ser 


Ser 


Ala 






115 










120 










125 








Phe 


Asp 


Lys 


His 


Asp 


Asp 


Asp 


Val 


Ala 


Thr 


Asp 


Ala 


Ala 


Tyr 


Arg 


Asn 




130 










135 










140 










Val 


Asn 


Pro 


Phe 


Thr 


Thr 


Val 


Glu 


Glu 


Ala 


Tyr 


Leu 


His 


Tyr 


Glu 


Ser 


145 










150 










155 










160 


Gly 


Gly 


Val 


lie 


Thr 


Gly 


Gly 


Gly 


Lys 


Lys 


Gly 


Thr 


Thr 


Tyr 


He 


Thr 










165 










170 










175 




Lys 


^ys 


Arg 


Gly 


Cys 


Val 


Asp 


Ser 


Ser 


Val 


Val 


Arg 


Lys 


Asp 


Pro 


Ser 








180 










185 










190 






Leu 


Leu 


Asn 


Lys 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


He 


Leu 


Gly 


Cys 


Thr 


Asp 






195 










200 










205 








lie 


Val 


Leu 


Cys 


Gly 


Gly 


Lys 


Gly 


Val 


Gly 


Arg 


Pro 


He 


His 


Pro 


Thr 




210 










215 










220 










Thr 


Phe 


Ser 


He 


He 


Asp 


Asp 


Val 


Asp 


Asp 


He 


Asp 


Phe 


Asp 


He 


Ser 


225 










230 










235 










240 


Ser 


Met 


Thr 


Ser 


Thr 


Met 


Asp 


Cys 


Leu 


Cys 


Glu 


Pro 


Gly 


Tyr 


Ser 


Gin 










245 










250 










255 




Gin 


Arg 


Asp 


Pro 


Ala 


Thr 


Asn 


Ala 


Pro 


Lys 


Cys 


Glu 


Lys 


Lys 


Glu 


Gly 








260 










265 










270 






Gly 


He 


Gin 


Glu 


Lys 


Glu 


Gin 


Gly 


Leu 


Gly 


Cys 


Pro 


Val 


Met 


Phe 


Arg 






275 










280 










285 








Tyr 


Giy 


Val 


Val 


Gly 


Asp 


Thr 


Gly 


Thr 


Lys 


Gly 


Cys 


Leu 


Cys 


Asp 


Glu 




290 










295 










300 










Ser 


Thr 


Gin 


He 


Arg 


Leu 


Glu 


Glu 


Val 


Ala 


Gly 


He 


Asp 


Leu 


Pro 


Asp 


305 










310 










315 










320 


Ala 


Ala 


Lys 


Thr 


Asp 


Tyr 


Ala 


Gin 


Pro 


Phe 


Val 


Glu 


Gly 


Ala 


Lys 


Leu 










325 










330 










335 




Leu 


Leu 


Gin 


He 


Thr 


Glu 


Arg 


Tyr 


Glu 


Thr 


Leu 


Gly 


Gly 


Ser 


Thr 


Lys 








340 










345 










350 






Asp 


Ala 


Cys 


Leu 


Pro 


Arg 


Pro 


Giy 


Asn 


Asp 


Thr 


Arg 


Met 


Ser 


Ala 


Leu 






355 










360 










365 








Gly 


Tyr 


Ser 


Tyr 


Ala 


Ala 


Ser 


Leu 


Phe 


Gly 


Arg 


Ala 


Pro 


Glu 


He 


Thr 




370 










375 










380 










Ala 


Phe 


Asn 


Gly 


Gly 


His 


Leu 


He 


Thr 


Gly 


Gly 


Leu 


Leu 


Arg 


Glu 


Ser 


385 










390 










395 










400 


Ala 


Met 


Asp 


Ala 


Ala 


Gly 


Asn 


Trp 


His 


Ser 


Arg 


He 


Glu 


Asp 


Ser 


Asp 










405 










410 










415 




Glu 


Gin 


Gly 


Lys 


Leu 


Thr 


Val 


Ser 


Glu 


Ser 


Val 


Gly 


Gly 


Val 


Val 


Pro 








420 










425 










430 






Tyr 


Ser 


Gly 


Thr 


Gly 


Ser 


Val 


Ala 


Ala 


His 


He 


Trp 


Asn 


Gly 


Asp 


Ala 






435 










440 










445 








Leu 


Asn 


Asp 


Asn 


Gly 


Leu 


Val 


Gly 


Ala 


Gly 


Gly 


Gly 


Asn 


Phe 


Thr 


Glu 




450 










455 










460 










His 


Pro 


Asn 


Ala 


Ser 


Leu 


Arg 


Val 


Val 


Pro 


Leu 


Pro 


His 


Ser 


Asn 


He 


465 










470 










475 










480 


Pro 


Gly 


Leu 


Gly 


He 


Asp 


Ser 


He 


Asp 


His 


Ala 


Val 


Gly 


He 


He 


Ala 










485 










490 










495 




Ser 


Gin 


Gly 


Lys 


He 


Phe 


Pro 


Glu 


Thr 


Val 


His 


Met 


Arg 


Ala 


Gly 


Asp 








500 










505 










510 






Pro 


Ser 


Gly 


Val 


Lys 


Thr 


Asp 


Arg 


Arg 


Asp 


Ala 


His 


Asn 


Asp 


Thr 


Thr 






515 










520 










525 








He 


Glu 


Thr 


Ser 


Phe 


Leu 


Lys 


Asp 


Ser 


Asp 


Lys 


Ala 


Gly 


Tyr 


Asp 


Ser 




530 










535 










540 










Tyr 


Lys 


Asp 


Asn 


Pro 


Leu 


Gin 


Lys 


Leu 


Arg 


Lys 


Ser 


His 


Asp 


Ser 


Gly 


545 










550 










555 










560 


He 


Cys 


Ala 


Thr 


Ala 


Tyr 


Val 


Val 


Pro 


Ser 


Leu 


His 


Arg 


Val 


He 


Lys 










565 










570 










575 




Glu 


Lys 


Pro 


Ser 


Ala 


Lys 


Asn 


Asp 


Lys 


Thr 


Val 


Asn 


Lys 


He 


Leu 


Pro 








580 










585 










590 






Leu 


Val 


His 


-Tyr 


Arg 


Pro 


Thr 


Ala 


Lys 


Arg 


Met 


Ala 


His 


Thr 


Pro 


He 
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595 










600 










605 








Glu 


Thr 


He 


Phe 


Lys 


His 


Ser 


Leu 


Leu 


Thr 


Ala 


Gin 


Glu 


Arg 


Asp 


Gin 




610 










615 










620 










Ser 


Phe 


Ala 


Asn 


Ser 


Thr 


Leu 


Asn 


Ser 


Met 


Met 


Val 


Thr 


Asn 


Ser 


Ser 


625 










630 










635 










640 


Asn 


Ser 


Phe 


Asp 


Asp 


Val 


Thr 


Asn 


Leu 


Leu 


Leu 


Asp 


Tyr 


Phe 


Phe 


Pro 










645 










650 










655 




Asn 


Leu 


Asn 


Gly 


Glu 


Gly 


Lys 


Glu 


Arg 


Ser 


Gly 


Leu 


Pro 


He 


Asn 


Thr 








660 










665 










670 






Arg 


Ser 


He 


Tyr 


Asn 


Glu 


Pro 


Asn 


Asn 


Ala 


Lys 


Phe 


Lys 


Glu 


He 


Gly 






675 










680 










685 








Gly 


He 


He 


Leu 


Gin 


Pro 


Val 


Thr 


Ala 


Gin 


Gly 


Ala 


Lys 


Lys 


Ser 


Ser 




690 










695 










700 










Thr 


Phe 


Ala 


Arg 


Phe 


Ser 


Glu 


Lys 


He 


Leu 


Ser 


Thr 


Asn 


Ser 


Pro 


Lys 


705 










710 










715 










720 


He 


He 


Asp 


His 


Tyr 


Lys 


Ala 


Gly 


Ser 


Ser 


Ala 


Val 


Phe 


Lys 


Lys 


Val 










725 










730 










735 




Gly 


Glu 


Lys 


Glu 


Ala 


Tyr 


Glu 


Met 


Phe 


Ala 


His 


Pro 


Pro 


Thr 


Ala 


Trp 








740 










745 










750 






Arg 


He 


Asn 


Glu 


Gly 


Thr 


Phe 


Phe 


Ser 


Gly 


Arg 


Gin 


Asn 


Gly 


He 


Glu 






755 










760 










765 








Gly 


Thr 


Gly 


Met 


Arg 


Glu 


Ala 


Glu 


Arg 


Val 


Ala 


Lys 


Thr 


Leu 


Ser 


Lys 




770 










775 










780 










Lys 


Pro 


Asp 


He 


Phe 


Ala 


Gly 


Ala 


He 


Leu 


Thr 


Gly 


Asp 


Gly 


Val 


Leu 


785 










790 










795 










800 


Met 


Asn 


Gly 


Ala 


Ser 


Ser 


Pro 


Leu 


Val 


Arg 


Pro 


Met 


Glu 


He 


Pro 


Ala 










805 










810 










815 




Ser 


Ser 


Leu 


Pro 


Glu 


His 


Thr 


Trp 


Phe 


Glu 


Arg 


Arg 


Ser 


Pro 


Val 


Asn 








820 










825 










830 






Arg 


Asp 


Pro 


Gly 


Ser 


Ala 


Asp 


Asn 


Leu 


Thr 


Ala 


He 


Asn 


Asn 


Thr 


Tyr 






835 










840 










845 








Asp 


Arg 


Val 


Thr 


Lys 


Gly 


Asp 


He 


Arg 


Ala 


He 


Leu 


Asn 


Ser 


Thr 


Thr 




850 










855 










860 










Asp 


He 


Lys 


Thr 


Ser 


Phe 


Asn 


Ser 


Tyr 


Ala 


Pro 


Ala 


Arg 


Pro 


Phe 


Ser 


865 










870 










875 










880 


Lys 


Pro 


Leu 


Ala 


Pro 


Pro 


Ala 


Gly 


Val 


Ser 


Ala 


Ala 


Ala 


Gin 


Ala 


Thr 










885 










890 










895 




Ser 


Phe 


Leu 


Gly 


Val 


Leu 


Gly 


Gly 


Phe 


Pro 


Leu 


Pro 


He 


Pro 


Ser 


Ser 








900 










905 










910 






Phe 


Leu 


He 


Gin 


Lys 


Ser 


Val 


Gin 


Glu 


Ser 


Val 


Ser 


Asn 


Gly 


Thr 


Val 






915 










920 










925 








Gly 


Ser 


Met 


His 


Gly 


He 


Val 


Pro 


Leu 


Lys 


Phe 


His 


Glu 


Gly 


Asp 


Glu 




930 










935 










940 










Leu 


Trp 


Gin 


Gin 


Cys 


Glu 


Val 


Lys 


Glu 


Thr 


Glu 


Gly 


Ala 


Leu 


Asn 


Phe 


945 










950 










955 










960 


He 


Pro 


Pro 


Pro 


Met 


Ala 


Leu 


Phe 


Glu 


Ser 


Leu 


Leu 


Arg 


Val 


Arg 


Thr 










965 










970 










975 




Leu 


Ser 


Ser 


Glu 


Thr 


Phe 


He 


Arg 


Pro 


Glu 


Leu 


He 


Pro 


Asn 


Arg 


Phe 








980 










985 










990 






Arg 


Ala 


Asp 


Trp 


Gly 


Leu 


Ser 


Pro 


His 


Thr 


Ala 


Gly 


His 


Tyr 


Leu 


Asn 






995 










1000 








1005 






Gly 


Val 


Tyr 


Ser 


Pro 


Pro 


Cys 


Val 


Arg 


Glu 


Glu 


Thr 


Gly 


Gin 


Ser 


Phe 




1010 








1015 








1020 








Gly 


Tyr 


Pro 


Cys 


Ser 


Gly 


Ala 


Leu 


Ser 


Gin 


Tyr 


Thr 


Thr 


Met 


Met 


Val 


1025 








1030 








1035 








1041 


Pro 


Lys 


Pro 


Leu 


Gly 


Pro 


Gin 


Ser 


His 


Ser 


Ser 


Leu 


Ser 


Lys 


Phe 


Ser 










1045 








1050 








1055 


He 


Lys 


Ser 


Tyr 


Val 


Glu 


Glu 


Gin 


Thr 


Arg 


Leu 


Leu 


Pro 


Ala 


Asn 


He 








1060 








1065 








1070 




Gly 


Glu 


Lys 


Ser 


He 


Phe 


Glu 


Met 


Gin 


Asp 


Pro 


Asn 


Ser 


Lys 


Asn 


He 



1075 1080 1085 
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Phe Asp Lys He Gly Glu Leu Gly Glu Lys Glu Asn Cys Asn Cys Thr 

1090 1095 1100 

Asn Gly Leu Phe Cys Pro Lys Val Asn Gly Gly Gly Arg Asn Lys Thr 
1105 1110 1115 1120 

Asp Pro He Ala Ala Thr Pro Ser Arg Gly Asn Arg His Ser Arg Phe 

1125 1130 1135 

Pro Leu Met Thr Thr Leu Pro Lys Asn Asp Val His Leu Ser Ala Ala 

1140 1145 1150 

Leu Leu Arg Ala Gin Ser Gly Asp Ala Arg He Leu Asn Thr He Gly 

1155 1160 1165 

Glu Thr Lys Thr Asn Gly Arg Lys He Asn Leu Lys Ala Ala Thr Glu 

1170 1175 1180 

Asn He Trp Asp He Ser Ser Asn Val Met Leu Ala Pro Asn Lys Phe 
1185 1190 1195 1200 

Cys Ala Met Arg Arg Ser Thr Ala Tyr Thr Pro Tyr Ser Thr Arg Gin 

1205 1210 1215 

Glu Lys Val Pro Ala Ala Val Leu Asp Glu Arg Lys Gly Thr Phe Asp 

1220 1225 1230 

Arg Asn Ala Glu Leu Leu Gly Asp Val Gly Met Thr Asp He Val Ser 

1235 1240 1245 

Asn Asp He Leu Met Glu Asp Tyr Glu Arg Leu Pro Gly Val Pro Pro 

1250 1255 1260 

Ala Glu Ala Glu He Phe His He He Arg Asp Ala Ala Lys Thr Gly 
1265 1270 1275 1280 

Gin Glu Gly Ala Lys Ala Arg Arg He Val Asp Phe Phe Glu Ser Ser 

1285 1290 1295 

His Gly Val Thr Ala Ser Thr Phe Asn Val Gly Thr Phe Ser Pro Tyr 

1300 1305 1310 

Val Glu Gly Val Lys Asp He Val Ser Leu Tyr Ala Thr Pro Cys Phe 

1315 1320 1325 

Thr Asp He Asp Ser Pro Thr He Ser Ala Asp Ser Ala Thr He Asn 

1330 1335 1340 

Glu Gly Ala Ser He Glu Pro Thr Asp Gly Ser Glu Val Val Val Glu 
1345 1350 1355 1360 

Val Val Asn Ser Asn Met Glu Met Leu Gly Gly Ser Thr Ala Gly Ser 

1365 1370 1375 

Thr Lys Lys Arg Arg Leu Ser He Ser Asp Tyr Val Asp Leu Glu Glu 

1380 1385 1390 

Asp Ala Glu Val Phe Thr He Asn Lys Gin Gly Lys Ala Thr Glu Asn 

1395 1400 1405 

Leu Arg Val Arg Thr Ser Ser Ser Ser Lys Tyr Val Glu Gly Gly Gin 

1410 1415 1420 

Lys Asp Met Val Gly Phe Tyr Glu Ala Ser Lys Arg Val Pro Arg Val 
1425 1430 1435 1440 

Met Arg Arg Val His Val Leu Pro Val Leu Thr Pro Tyr His Gly Gly 

1445 1450 1455 

Phe Glu Ser Cys Ala Pro Thr Ala Ala Gin Ser Ala Cys Thr Arg Gly 

1460 1465 1470 

Val Glu He Thr Tyr Ala Asp Phe Met Arg Pro Ser Asp Leu Ser Gly 

1475 1480 1485 

Thr Lys Thr Thr Leu Glu Gly Val Arg Val Lys Gly Pro Glu Pro Phe 

1490 1495 1500 

Asp Asp Leu Ser Thr Leu Tyr Phe Arg Ser Val Gly Gly Pro Asn Leu 
1505 1510 1515 1520 

Arg Lys Phe Ala His His His His Phe Gly Tyr Glu Gly Leu Met Ser 

1525 1530 1535 

Arg Tyr Tyr Tyr Thr Arg Glu Lys Thr Val Ser Val Ser Glu Gly Asp 

1540 1545 1550 

Leu Lys Asp Arg Phe Pro Phe Val Cys Gin Ser Asp Arg Gly Pro Phe 

1555 1560 1565 

Pro Pro Lys Arg Asp Gly Thr He Gin Pro Leu Ala Leu Val Asp Met 
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1570 1575 1580 

Gly Val Leu Pro Glu Gly Ala Leu Thr Arg Arg Thr lie Ser Met Glu 
1585 1590 1595 1600 



<210> 136 

<211> 3483 

<212> DNA 

<213> SHRIMP 



<400> 136 

atgaaaattg 

aaacctgcga 

gctgctgtta 

tcaaaaaaac 

gatgtgcagg 

tatgatattg 

tcatcattgc 

ggaaaagata 

cttcccttta 

gggggtggca 

gcaggttggc 
caacaagccg 
acaaagggaa 
aaaaacaaga 
acggaagttt 
aactatgata 
cctgctaatt 
ctacctcgcg 
aaaacaaaca 
aatatggagg 
cttagaaatc 
atagaagcgc 
ttaaccacat 
aaattgttct 
aatacacctt 
actgcaccag 
ccatgtatca 
tcttcccccg 
aacgtttcag 
actgcattac 
tacgctgttt 
ttccaacGca 
gttatcgtca 
gtctccgatc 
accagccgcc 
gaagtatttg 
gaaggacagt 
tcaaccttta 
gtctactgtc 
aaacagacta 

gggggsgsaa 

acaaacattg 
caaatcgata 
acacaggggg 
ttagaccaaa 
aaggaaaatc 
ggtat tgttg 
tggagaaaca 
gcccctctac 
gatggacaat 



tccagaacaa 
aaatagaaaa 
gaaaagaccc 
tcgctaaaaa 
atccaaaatt 
gtgatgatat 
tgccaattcg 
ccatcgtgag 
aggcagagat 
gcagcgatag 
ctcctatcac 
tattcaaaca 
gtattttcaa 
ccgtgttatt 
ttgcttgtag 
atgtggaata 
ttccccgtct 
aaaatcaact 
ctgttaacac 
gaaatgaatc 
catctaaccc 
aatggaaaat 
cttctgggta 
cggttataag 
cgagagtcgg 
attggcctgg 
tcttgaaaac 
ccacagaata 
ttgg tccat t 
tgggaggtga 
tccctgtttt 
ttcacagtat 
atgatgatga 
ccgtgtctgg 
ctatatttat 
gagg tgacga 
tttcagacat 
cggcacctgt 
gcactacact 
gtct tgtccc 
taaagtaccc 
tcaatacaga 
aggaacaaga 
agaaagtgta 
ttgtgcacac 
ttctggtaaa 
tttttggagc 
ctggccgcct 
agactcggct 
ttctggtacg 



ttttacgccc 
agctgtattc 
gaaaaagaaa 
tattgaaagg 
gttacacagt 
ttctcctcaa 
aatgattaat 
gatacatatt 
aaaagttgat 
tggtccgtct 
caatgacccg 
cgtggaagta 
cactgggaat 
cgattctcag 
aaatgtagcc 
cactcctgat 
ttcccgacaa 
ttcagatatt 
accagtatta 
cgagggttac 
agctcggacc 
gttgggctgg 
caatattgta 
tactgatgtg 
gtgcttcggt 
acccacgaac 
tgataatgat 
tgccggtgaa 
gactgaagta 
acagcctaca 
aga t tcgcat 
cacgtcaagg 
tgatgatgtt 
ttctatactt 
tgacggtgca 
taaaggcata 
gtctgtacca 
aagggacaca 
tcctcccatt 
cctacctaca 
tcggcacat t 
tactcaaact 
acgtcatcga 
cgctaaaatg 
ttcaaatact 
taaagaagac 
aaataaatac 
acgcgcagtg 
agccacatgg 
tctagaccct 



gatgaaagat 
gggaatatag 
aggaacttga 
atatcctctg 
ataatgaaac 
tcggctcctg 
atccgtacag 
cttgacggaa 
ttagtggatg 
ctattcgaaa 
aatgcgttct 
aattctctag 
agactgaaaa 
gtaacaatat 
ctcatgaggc 
acattgaatg 
gctgaaatag 
aacaaacctt 
aacaaagtac 
aagattctaa 
tttatatgtg 
attgtgggct 
tttccagcat 
aatgctaaca 
tctagcgtaa 
gggccagatt 
aatagagaac 
cgcacaacat 
cgaggtggag 
tttaaggcac 
aacggcctag 
aacaaaacta 
gtccttgaag 
gcaaaagaaa 
cgagacgata 
cagatgccct 
tctaacgaat 
cccgccaccg 
tctaatagag 
agcatacctg 
ttcatttctt 
gaattttcaa 
cagttattgc 
tacatgggag 
acatacgtgg 
aagaaacttg 
cctgacgaac 
ggttcttatt 
cctagcggcg 
agatgtggag 



ctggtgtgat 
cagcggctat 
agaatgggtt 
aggaattaaa 
gcactgcacg 
atagagatgg 
aagaattatt 
tattaccaga 
aaaaatatga 
ctttccctga 
ctcgcaacaa 
ctgacggtat 
tttccattgt 
cttcacccat 
ttgatatgcc 
aaaaatatgt 
cgtccaattt 
cagtgtcgtt 
tatacaatga 
acgccacaga 
tgtctgtacc 
tcaaaacgtc 
caaaagttac 
cgaacaaagt 
actttagagt 
tcttctccta 
ctcgaataac 
ctctactgcc 
atattattac 
ccgcagaacc 
tgaaagcgag 
ctgtcttgac 
ataaaagcta 
acgttctttc 
gggttttcag 
ttacgatgga 
tggctatatg 
acataaccaa 
ggataaggga 
aatgggcatt 
ccataagaac 
tagaaaactg 
cagcacctag 
acggcgtttc 
tagacgagtc 
cggctatttt 
ctgcagacag 
ctcaattgag 
accctgttac 
gaattggtag 



acacataaga 
cgatgattcc 
agaaccggcg 
acgtgtcact 
gcagatcggt 
ttcatcatca 
agaaaaaggt 
taatgtccct 
aggagaagac 
atttgtccca 
tggcaataaa 
aacactatca 
aacagaggat 
accaaaaata 
taaagcaata 
ttcagattac 
agcagctaaa 
cgtgtactcg 
aacgttaaag 
gataacccac 
tgaaagtgaa 
gtctgacgtt 
tcaatctgat 
tgttgtacat 
agatgctgca 
ccaactaaga 
ggctgtgctt 
tcgtgcttta 
ccccgttcaa 
tactaaactg 
tgataaccca 
cgtatctgat 
ccacatcaca 
ttcgagaata 
tgtaaaga tg 
cgggcatttt 
gaatgatccc 
caagggcata 
ccctttcatg 
cgctgattac 
gaatgatact 
gttgagggaa 
tgaagcgtat 
tgaagaaaca 
tggaaccaaa 
aggaaaatgg 
atacacgaac 
acaaccagtt 
tcggttagcc 
tgcaaacggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 
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ttctacaata 
gggaatcaag 
gctagaaatg 
aattcagcac 
ttatgtgtta 
gtactagtag 
tacaagaaat 
tcatcttata 
taa 



ataatggtgc 
acaaggtggt 
aaggaaagaa 
tacatcgccg 
aaatccctca 
accctctggt 
tcatagacac 
atatatctgt 



aaacaatgaa 
gtcgtatgca 
gcatttgaaa 
tctctggaac 
aaatgtcatg 
atttgccaat 
ttcttcaggc 
acccttaaac 



tttacttcct 
gaaagagtaa 
aatgacgatg 
aggacgacat 
tctaaaatag 
gtagcgtcaa 
cctgttgtta 
ttttatacca 



cgttactatt 
gattctacat 
gtctagtttt 
tcgaccatga 
aacccggcac 
gtacagacag 
ttgaccgtgc 
catgtggatt 



tgctatagtt 
gaaaattgta 
agtggataga 
tgatattgta 
tagctcaggt 
ggaagaattc 
atctgttaca 
tatagttggt 



3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3483 



<210> 137 
<211> 1156 
<212> PRT 
<213> SHRIMP 



<400> 137 



Met 


Lys 


He 


Val 


Gin 


Asn 


Asn 


Phe 


Thr 


Pro 


Asp 


Glu 


Arg 


Ser 


Gly 


Val 


1 








5 










10 










15 




He 


His 


He 


Arg 


Lys 


Pro 


Ala 


Lys 


He 


Glu 


Lys 


Ala 


Val 


Phe 


Gly 


Asn 








20 










25 










30 






He 


Ala 


Ala 


Ala 


He 


Asp 


Asp 


Ser 


Ala 


Ala 


Val 


Arg 


Lys 


Asp 


Pro 


Lys 






35 










40 










45 








Lys 


Lys 


Arg 


Asn 


Leu 


Lys 


Asn 


Gly 


Leu 


Glu 


Pro 


Ala 


Ser 


Lys 


Lys 


Leu 




50 










55 










60 










Ala 


Lys 


Asn 


He 


Glu 


Arg 


He 


Ser 


Ser 


Glu 


Glu 


Leu 


Lys 


Arg 


Val 


Thr 


65 










70 










75 










80 


Asp 


Val 


Gin 


Asp 


Pro 


Lys 


Leu 


Leu 


His 


Ser 


He 


Met 


Lys 


Arg 


Thr 


Ala 










85 










90 










95 




Arg 


Gin 


He 


Gly 


Tyr 


Asp 


He 


Gly 


Asp 


Asp 


He 


Ser 


Pro 


Gin 


Ser 


Ala 








100 










105 










110 






Pro 


Asp 


Arg 


Asp 


Gly 


Ser 


Ser 


Ser 


Ser 


Ser 


Leu 


Leu 


Pro 


He 


Arg 


Met 






115 










120 










125 








He 


Asn 


He 


Arg 


Thr 


Glu 


Glu 


Leu 


Leu 


Glu 


Lys 


Gly 


Gly 


Lys 


Asp 


Thr 




130 










135 










140 










He 


Val 


Arg 


He 


His 


He 


Leu 


Asp 


Gly 


He 


Leu 


Pro 


Asp 


Asn 


Val 


Pro 


145 










150 










155 










160 


Leu 


Pro 


Phe 


Lys 


Ala 


Glu 


He 


Lys 


Val 


Asp 


Leu 


Val 


Asp 


Glu 


Lys 


Tyr 










165 










170 










175 




Glu 


Gly 


Glu 


Asp 


Gly 


Gly 


Gly 


Ser 


Ser 


Asp 


Ser 


Gly 


Pro 


Ser 


Leu 


Phe 








180 










185 










190 






Glu 


Thr 


Phe 


Pro 


Glu 


Phe 


Val 


Pro 


Ala 


Gly 


Trp 


Pro 


Pro 


He 


Thr 


Asn 






195 










200 










205 








Asp 


Pro 


Asn 


Ala 


Phe 


Ser 


Arg 


Asn 


Asn 


Gly 


Asn 


Lys 


Gin 


Gin 


Ala 


Val 




210 










215 










220 










Phe 


Lys 


His 


Val 


Glu 


Val 


Asn 


Ser 


Leu 


Ala 


Asp 


Gly 


He 


Thr 


Leu 


Ser 


225 










230 










235 










240 


Thr 


Lys 


Gly 


Ser 


He 


Phe 


Asn 


Thr 


Gly 


Asn 


Arg 


Leu 


Lys 


He 


Ser 


He 










245 










250 










255 




Val 


Thr 


Glu 


Asp 


Lys 


Asn 


Lys 


Thr 


Val 


Leu 


Phe 


Asp 


Ser 


Gin 


Val 


Thr 








260 










265 










270 






He 


Ser 


Ser 


Pro 


He 


Pro 


Lys 


He 


Thr 


Glu 


Val 


Phe 


Ala 


Cys 


Arg 


Asn 






275 










280 










285 








Val 


Met 


Arg 


Leu 


Asp 


Met 


Pro 


Lys 


Ala 


He 


Asn 


Tyr 


Asp 


Asn 


Val 


Glu 




290 










295 










300 










Tyr 


Thr 


Pro 


Asp 


Thr 


Leu 


Asn 


Glu 


Lys 


Tyr 


Val 


Ser 


Asp 


Tyr 


Pro 


Ala 


305 










310 










315 










320 


Asn 


Phe 


Pro 


Arg 


Leu 


Ser 


Arg 


Gin 


Ala 


Glu 


He 


Asn 


Leu 


Ala 


Ala 


Lys 










325 










330 










335 




Leu 


Pro 


Arg 


Glu 


Asn 


Gin 


Leu 


Ser 


Asp 


He 


Asn 


Lys 


Pro 


Ser 


Val 


Ser 








340 










345 










350 






Phe 


Val 


Tyr 


Ser 


Lys 


Thr 


Asn 


Thr 


Val 


Asn 


Thr 


Pro 


Val 


Leu 


Asn 


Lys 
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355 










360 










365 








Val 


Leu 


Tyr 


Asn 


Glu 


Thr 


Leu 


Lys 


Asn 


Met 


Glu 


Gly 


Asn 


Glu 


Ser 


Glu 




370 










375 










380 










Gly 


Tyr 


Lys 


He 


Leu 


Asn 


Ala 


Thr 


Glu 


He 


Thr 


His 


Leu 


Arg 


Asn 


Pro 


385 










390 










395 










400 


Ser 


Asn 


Pro 


Ala 


Arg 


Thr 


Phe 


He 


Cys 


Val 


Ser 


Val 


Pro 


Glu 


Ser 


Glu 










405 










410 










415 




lie 


Glu 


Ala 


Gin 


Trp 


Lys 


Met 


Leu 


Gly 


Trp 


He 


Val 


Gly 


Phe 


Lys 


Thr 








420 










425 










430 






Ser 


Ser 


Asp 


Val 


Leu 


Thr 


Thr 


Ser 


Ser 


Gly 


Tyr 


Asn 


He 


Val 


Phe 


Pro 






435 










440 










445 








Ala 


Ser 


Lys 


Val 


Thr 


Gin 


Ser 


Asp 


Lys 


Leu 


Phe 


Ser 


Val 


He 


Ser 


Thr 




450 










455 










460 










Asp 


Val 


Asn 


Ala 


Asn 


Thr 


Asn 


Lys 


Val 


val 


Val 


His 


Asn 


Thr 


Pro 


Ser 


465 










470 










475 










480 


Arg 


Val 


Gly 


Cys 


Phe 


Gly 


Ser 


Ser 


Val 


Asn 


Phe 


Arg 


Val 


Asp 


Ala 


Ala 










485 










490 










495 




Thr 


Ala 


Pro 


Asp 


Trp 


Pro 


Gly 


Pro 


Thr 


Asn 


Gly 


Pro 


Asp 


Phe 


Phe 


Ser 








500 










505 










510 






Tyr 


Gin 


Leu 


Arg 


Pro 


Cys 


He 


He 


Leu 


Lys 


Thr 


Asp 


Asn 


Asp 


Asn 


Arg 






515 










520 










525 








Glu 


Pro 


Arg 


He 


Thr 


Ala 


Val 


Leu 


Ser 


Ser 


Pro 


Ala 


Thr 


Glu 


Tyr 


Ala 




530 










535 










540 










Gly 


Glu 


Arg 


Thr 


Thr 


Ser 


Leu 


Leu 


Pro 


Arg 


Ala 


Leu 


Asn 


Val 


Ser 


Val 


545 










550 










555 










560 


Gly 


Pro 


Leu 


Thr 


Glu 


Val 


Arg 


Gly 


Gly 


Asp 


He 


He 


Thr 


Pro 


Val 


Gin 










565 










570 










575 




Thr 


Ala 


Leu 


Leu 


Gly 


Gly 


Glu 


Gin 


Pro 


Thr 


Phe 


Lys 


Ala 


Pro 


Ala 


Glu 








580 










585 










590 






Pro 


Thr 


Lys 


Leu 


Tyr 


Ala 


Val 


Phe 


Pro 


Val 


Leu 


Asp 


Ser 


His 


Asn 


Gly 






595 










600 










605 








Leu 


Val 


Lys 


Ala 


Ser 


Asp 


Asn 


Pro 


Phe 


Gin 


Pro 


He 


His 


Ser 


He 


Thr 




610 










615 










620 










Ser 


Arg 


Asn 


Lys 


Thr 


Thr 


Val 


Leu 


Thr 


Val 


Ser 


Asp 


Val 


He 


Val 


Asn 


625 










630 










635 










640 


Asp 


Asp 


Asp 


Asp 


Asp 


Val 


Val 


Leu 


Glu 


Asp 


Lys 


Ser 


Tyr 


His 


He 


Thr 










645 










650 










655 




Val 


Ser 


Asp 


Pro 


Val 


Ser 


Gly 


Ser 


He 


Leu 


Ala 


Lys 


Glu 


Asn 


Val 


Leu 








660 










665 










670 






Ser 


Ser 


Arg 


He 


Thr 


Ser 


Arg 


Pro 


He 


Phe 


He 


Asp 


Gly 


Ala 


Arg 


Asp 






675 










680 










685 








Asp 


Arg 


Val 


Phe 


Ser 


Val 


Lys 


Met 


Glu 


Val 


Phe 


Gly 


Gly 


Asp 


Asp 


Lys 




690 










695 










700 










Gly 


lie 


Gin 


Met 


Pro 


Phe 


Thr 


Met 


Asp 


Gly 


His 


Phe 


Glu 


Gly 


Gin 


Phe 


705 










710 










715 










720 


Ser 


Asp 


Met 


Ser 


Val 


Pro 


Ser 


Asn 


Glu 


Leu 


Ala 


He 


Trp 


Asn 


Asp 


Pro 










725 










730 










735 




Ser 


Thr 


Phe 


Thr 


Ala 


Pro 


Val 


Arg 


Asp 


Thr 


Pro 


Ala 


Thr 


Asp 


He 


Thr 








740 










745 










750 






Asn 


Lys 


Gly 


He 


Val 


Tyr 


Cys 


Arg 


Thr 


Thr 


Leu 


Pro 


Pro 


He 


Ser 


Asn 






755 










760 










765 








Arg 


Gly 


He 


Arg 


Asp 


Pro 


Phe 


Met 


Lys 


Gin 


Thr 


Ser 


Leu 


Val 


Pro 


Leu 




770 










775 










780 










Pro 


Thr 


Ser 


He 


Pro 


Glu 


Trp 


Ala 


Phe 


Ala 


Asp 


Tyr 


Gly 


Gly 


Glu 


He 


785 










790 










795 










800 


Lys 


Tyr 


Pro 


Arg 


His 


He 


Phe 


He 


Ser 


Ser 


He 


Arg 


Thr 


Asn 


Asp 


Thr 










805 










810 










815 




Thr 


Asn 


He 


Val 


Asn 


Thr 


Asp 


Thr 


Gin 


Thr 


Glu 


Phe 


Ser 


He 


Glu 


Asn 








820 










825 










830 






Trp 


Leu 


Arg 


Glu 


Gin 


He 


Asp 


Lys 


Glu 


Gin 


Glu 


Arg 


His 


Arg 


Gin 


Leu 






835 










840 










845 
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Leu 


Pro 


Ala 


Pro 


Ser 


Glu 


Ala 


Tyr 


Thr 


Gin 


Gly 


Glu 


Lys 


Val 


Tyr 


Ala 




850 










855 










860 










Lys 


Met 


Tyr 


Met 


Gly 


Asp 


Gly 


Val 


Ser 


Glu 


Glu 


Thr 


Leu 


Asp 


Gin 


He 


865 










870 










875 










880 


Val 


His 


Thr 


Ser 


Asn 


Thr 


Thr 


Tyr 


Val 


Val 


Asp 


Glu 


Ser 


Gly 


Thr 


Lys 










885 










890 










895 




Lys 


Glu 


Asn 


Leu 


Leu 


Val 


Asn 


Lys 


Glu 


Asp 


Lys 


Lys 


Leu 


Ala 


Ala 


He 








900 










905 










910 






Leu 


Gly 


Lys 


Trp 


Gly 


He 


Val 


Val 


Phe 


Gly 


Ala 


Asn 


Lys 


Tyr 


Pro 


Asp 






915 










920 










925 








Glu 


Pro 


Ala 


Asp 


Arg 


Tyr 


Thr 


Asn 


Trp 


Arg 


Asn 


Thr 


Gly 


Arg 


Leu 


Arg 




930 










935 










940 










Ala 


Val 


Gly 


Ser 


Tyr 


Ser 


Gin 


Leu 


Arg 


Gin 


Pro 


Val 


Ala 


Pro 


Leu 


Gin 


945 










950 










955 










960 


Thr 


Arg 


Leu 


Ala 


Thr 


Trp 


Pro 


Ser 


Gly 


Asp 


Pro 


Val 


Thr 


Arg 


Leu 


Ala 










965 










970 










975 




Asp 


Gly 


Gin 


Phe 


Leu 


Val 


Arg 


Leu 


Asp 


Pro 


Arg 


Cys 


Gly 


Gly 


He 


Gly 








980 










985 










990 






Ser 


Ala 


Asn 


Gly 


Phe 


Tyr 


Asn 


Asn 


Asn 


Gly 


Ala 


Asn 


Asn 


Glu 


Phe 


Thr 






995 










1000 








1005 






Ser 


Ser 


Leu 


Leu 


Phe 


Ala 


He 


Val 


Gly 


Asn 


Gin 


Asp 


Lys 


Val 


Val 


Ser 




1010 








1015 








1020 








Tyr 


Ala 


Glu 


Arg 


Val 


Arg 


Phe 


Tyr 


Met 


Lys 


He 


Val 


Ala 


Arg 


Asn 


Glu 


1025 








1030 








1035 








1041 


Gly 


Lys 


Lys 


His 


Leu 


Lys 


Asn 


Asp 


Asp 


Gly 


Leu 


Val 


Leu 


Val 


Asp 


Arg 










1045 








1050 








1055 


Asn 


Ser 


Ala 


Leu 


His 


Arg 


Arg 


Leu 


Trp 


Asn 


Arg 


Thr 


Thr 


Phe 


Asp 


His 








1060 








1065 








1070 




Asp 


Asp 


He 


Val 


Leu 


Cys 


Val 


Lys 


He 


Pro 


Gin 


Asn 


Val 


Met 


Ser 


Lys 






1075 








1080 








1085 






He 


Glu 


Pro 


Gly 


Thr 


Ser 


Ser 


Gly 


Val 


Leu 


Val 


Asp 


Pro 


Leu 


Val 


Phe 




1090 








1095 








1100 








Ala 


Asn 


Val 


Ala 


Ser 


Ser 


Thr 


Asp 


Arg 


Glu 


Glu 


Phe 


Tyr 


Lys 


Lys 


Phe 


1105 








1110 








1115 








1121 


He 


Asp 


Thr 


Ser 


Ser 


Gly 


Pro 


Val 


Val 


He 


Asp 


Arg 


Ala 


Ser 


Val 


Thr 










1125 








1130 








1135 


Ser 


Ser 


Tyr 


Asn 


He 


Ser 


Val 


Pro 


Leu 


Asn 


Phe 


Tyr 


Thr 


Thr 


Cys 


Gly 



1140 1145 1150 



Phe He Val Gly 
1155 



<210> 138 

<211> 615 

<212> DNA 

<213> SHRIMP 



<400> 138 

atggaa t ttg 

atcattgctc 

agcgtcgtcg 

atgtcaattc 

ggtctctccg 

gccccaagga 

ctcaacaaca 

gcaggaaaca 

tcttcatctt 

ttcgaagcta 

atcaagaaga 



gcaacctaac 
taatcgttat 
ctaattatga 
gtggagagag 
ataaggacat 
ctgatcccgc 
ctggcgtcga 
ttaagggaaa 
caaaaattac 
caatgaacat 
ag taa 



aaacctggac 
catggttata 
tcagatgatg 
gtcctacaat 
gaaggatgtt 
tggcactggg 
tctcttgatc 
tactatgtcg 
cctcattgac 
tggattcacc 



gttgcaatta 
atgattgtat 
cgagtcccaa 
actcctcttg 
tctgctgatc 
gccgagaact 
aacgacatta 
aacacttact 
gtgtgcagca 
tccaagaatg 



ttgcaatctt 
tcaacacacg 
ttcaaagaag 
gaaaggtggc 
ttgtcatctc 
ctaacatgac 
ctgttcggcc 
tctcgagcaa 
aatttgaaga 
tgatcgatat 



gtccattgca 
tgttggaaga 
ggcaaaggta 
ca tgaagaat 
taccgtcaca 
tttgaagatc 
aactgttatt 
ggacattaaa 
cggcgcagcc 
caaggacgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

615 



<21Q> 139 
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<211> 204 
<212> PRT 
<213> SHRIMP 



<400> 139 



Met 


Glu 


Phe 


Gly 


Asn 


Leu 


Thr 


Asn 


Leu 


Asp 


Val 


Ala 


He 


He 


Ala 


He 


1 








5 










10 










15 




Leu 


Ser 


He 


Ala 


He 


He 


Ala 


Leu 


He 


Val 


He 


Met 


Val 


He 


Met 


He 








20 










25 










30 






Val 


Phe 


Asn 


Thr 


Arg 


Val 


Gly 


Arg 


Ser 


Val 


Val 


Ala 


Asn 


Tyr 


Asp 


Gin 






35 










40 










45 








Met 


Met 


Arg 


Val 


Pro 


He 


Gin 


Arg 


Arg 


Ala 


Lys 


Val 


Met 


Ser 


He 


Arg 




50 










55 










60 










Gly 


Glu 


Arg 


Ser 


Tyr 


Asn 


Thr 


Pro 


Leu 


Gly 


Lys 


Val 


Ala 


Met 


Lys 


Asn 


65 










70 










75 










80 


Gly 


Leu 


Ser 


Asp 


Lys 


Asp 


Met 


Lys 


Asp 


Val 


Ser 


Ala 


Asp 


Leu 


Val 


He 










85 










90 










95 




Ser 


Thr 


Val 


Thr 


Ala 


Pro 


Arg 


Thr 


Asp 


Pro 


Ala 


Gly 


Thr 


Gly 


Ala 


Glu 








100 










105 










110 






Asn 


Ser 


Asn 


Met 


Thr 


Leu 


Lys 


He 


Leu 


Asn 


Asn 


Thr 


Gly 


Val 


Asp 


Leu 






115 










120 










125 








Leu 


lie 


Asn 


Asp 


He 


Thr 


Val 


Arg 


Pro 


Thr 


Val 


He 


Ala 


Gly 


Asn 


He 




130 










135 










140 










Lys 


Gly 


Asn 


Thr 


Met 


Ser 


Asn 


Thr 


Tyr 


Phe 


Ser 


Ser 


Lys 


Asp 


He 


Lys 


145 










150 










155 










160 


Ser 


Ser 


Ser 


Ser 


Lys 


He 


Thr 


Leu 


He 


Asp 


Val 


Cys 


Ser 


Lys 


Phe 


Glu 










165 










170 










175 




Asp 


Gly 


Ala 


Ala 


Phe 


Glu 


Ala 


Thr 


Met 


Asn 


He 


Gly 


Phe 


Thr 


Ser 


Lys 








180 










185 










190 






Asn 


Val 


He 


Asp 


He 


Lys 


Asp 


Glu 


He 


Lys 


Lys 


Lys 











195 200 



<210> 140 

<211> 2571 

<212> DNA 

<213> SHRIMP 



<400> 140 

atgacagagt 

cgactaaatt 

gaaaaacaaa 

acgacagaat 

gtcatgagga 

tggagattga 

ccattattga 

gctaatctag 

tcagaaatgt 

aaaagagcca 

tctatcgcct 

ctagagtacg 

tcctttagtg 

atatcgactg 

gtaggttctt 

atcgacgcag 

acaactaaac 

acagaaaggg 

actgctgaga 

gggcctaaac 

aagtttaacg 

gttatgttct 



caaaggacta 
atgtatcgga 
ggaaggttgt 
tttttaattc 
gtgtctacag 
ccgaaactgc 
accttttgaa 
ttctttattc 
tggccaacag 
ttgctctatt 
ctagaa tgaa 
gacgcgctgc 
atggagttac 
ttt taaccaa 
tccatgaagc 
agtacagggc 
tggataaata 
aaaaagaatg 
atctctttag 
gcacacctac 
atattgtttc 
ccaactactc 



tgttctggca 
agggctagtt 
tatttcttct 
gggactaaga 
agatatagaa 
agcggaagaa 
tgcctttgag 
cactagaggt 
aaacaattca 
tctaggatat 
agagaggggt 
tgacatgatt 
agtcagtgaa 
catggtttct 
tcttttaggg 
agtgctgtcc 
tgtgaacaat 
ggtcgagagg 
ggaccaccca 
cgcccttcaa 
accagaaact 
tgttacccgc 



ttagtagcag 
gctgccattt 
gacgtattcg 
ttagccaaag 
ggtgttagag 
ctatgtgact 
aatattatgg 
gatactaatg 
acggtggccg 
actctttgtg 
ttagatccct 
gaaaaaagga 
gaagatggac 
gtaatacagg 
agggaaatca 
aggaccagaa 
ccccatctgc 
gaaagagaac 
tatcttccaa 
gctcttcaac 
ttggagtatt 
gttcttgatc 



aaactaaaac 
caaatttaca 
gacccacgtg 
gacacctatc 
agcatataat 
ttactttttt 
acggtgtgtt 
aaccttcatg 
atctggcaat 
atattctaag 
ttgctgccat 
taaaagactt 
aatcttatgt 
aaggattcta 
tggtactatt 
acgctaaacc 
agatgccatc 
gaattaaaac 
aggcgataga 
gtgaatacaa 
ttcttgtcaa 
ccgactcagc 



cgatgaaaaa 
aaatacacct 
gttcaataaa 
taaagatgcc 
tgacccctct 
gaaacaagca 
tagatcagca 
ggtaattgat 
gggacgagca 
atggaagcaa 
gcctcctcac 
tattgaaggg 
tgttccaact 
cccccccaag 
atcagcagcc 
aaatcctt tg 
cgaaagtgtc 
tacggata tg 
cggaatttta 
gagatgtaac 
taatagacaa 
tgcccgat tt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 
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tcaatgtatg 
aatagctctg 
ttattcgtct 
atcagtttaa 
aaagcagtac 
aatgggttag 
ggagggttag 
gaagtgcgta 
gtccatgtat 
acctccaatc 
tggggagatt 
acatcatcat 
agaaatgtga 
tctgcatttt 
gaaaatgtta 
gagatatgga 
caagacttga 
cctatttcat 
atgggaagca 
cctctaagtt 
gattcttata 



tgctatggaa 
cagtgaaaag 
gccagcgttg 
aggaacaaag 
tcgatgcttt 
gggagttgat 
gtggtgatat 
acatgcacct 
ctaaagggtt 
cttcaataac 
tgaagtttcc 
tattatcctt 
ttgatgcgtc 
catccaagat 
acagagaaaa 
aaactagtgg 
tacggaacgc 
ccatagcctt 
ataataagta 
atactggaaa 
ctacaatgaa 



tgcattattt 
caggttaatt 
tgaatctggt 
taaaccgtct 
aggtggtggt 
ggttgagata 
aggttttgaa 
agtggataag 
tgctgccctt 
gaacaatttc 
tacacataac 
tgcaagcttt 
aaacactagt 
gttagtggat 
attgatctgg 
gagtaaagcc 
atctatagga 
a ttcaggtct 
ccgtctcgcc 
gattgtat tc 
agtatttcct 



ttatgctctg 
cttcaagtgt 
ttcataacaa 
atgggagaac 
ggtggtaata 
ctctctgcag 
ggaaaaatga 
aaagggtatg 
tctttctacc 
gacagagcag 
tta tggggaa 
tgggctttaa 
tttgtaccag 
aacatgttga 
aaagcattta 
gcatccgaca 
agacttattg 
atgaagagga 
agggacacga 
tctttggatg 
ctaacacctt 



gagggctaac 
ttttgaaaga 
aatctctaga 
aggaactaga 
ataaaggagc 
attcgggtct 
aacaaaaaag 
tgtttgaagc 
tgttgtacgc 
tatatttact 
atgttccaac 
ggaatgctgt 
ggaggccctt 
agaataatta 
gagagatgca 
ggaatgtgaa 
tcgaacctgt 
gtagaagcga 
aaacagctac 
atttgaaaaa 
tagatggata 



acagaaaact 
tatgcattct 
cacttttacc 
gacgtactgg 
agaaaatgtc 
attaagaggg 
cigaggacgaa 
tgcaaaatat 
ggctgcagcc 
attggcaaga 
agatgaaaat 
tagagcacga 
acctctctta 
tgtcaaagtg 
aacggaatca 
gaaggccaag 
aggaaagacc 
agatttaaag 
accaaggaac 
tttcagtaag 

g 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2571 



<210> 141 
<211> 852 
<212> PRT 
<213> SHRIMP 



<400> 141 
Met Thr Glu Ser 
1 

Thr Asp Glu Lys 
20 

lie Ser Asn Leu 

35 

Ser Ser Asp Val 
50 

Phe Asn Ser Gly 
65 

Val Met Arg Ser 

lie Asp Pro Ser 
100 

Asp Phe Thr Phe 
115 

Phe Glu Asn lie 
130 

Lys Thr Arg Gly 
145 

Met Leu Ala Asn 

Arg Ala Lys Arg 
180 

lie Leu Arg Trp 
195 

Leu Asp Pro Phe 
210 

Ala Asp Met lie 
225 

Ser Asp Gly Val 
Pro Thr lie Ser 



Lys Asp Tyr Val 
5 

Arg Leu Asn Tyr 

Gin Asn Thr Pro 
40 

Phe Gly Pro Thr 
55 

Leu Arg Leu Ala 
70 

Val Tyr Arg Asp 
85 

Trp Arg Leu Thr 

Leu Lys Gin Ala 
120 

Met Asp Gly Val 
135 

Asp Thr Asn Glu 
150 

Arg Asn Asn Ser 
165 

Ala lie Ala Leu 

Lys Gin Ser lie 
200 

Ala Ala Met Pro 
215 

Glu Lys Arg lie 
230 

Thr Val Ser Glu 
245 

Thr Val Leu Thr 



Leu Ala Leu Val 
10 

Val Ser Glu Gly 
25 

Glu Lys Gin Arg 

Trp Phe Asn Lys 
60 

Lys Gly His Leu 
75 

lie Glu Gly Val 
90 

Glu Thr Ala Ala 
105 

Pro Leu Leu Asn 

Phe Arg Ser Ala 
140 

Pro Ser Trp Val 
155 

Thr Val Ala Asp 
170 

Phe Leu Gly Tyr 
185 

Ala Ser Arg Met 

Pro His Leu Glu 
220 

Lys Asp Phe lie 
235 

Glu Asp Gly Gin 
250 

Asn Met Val Ser 



Ala 


Glu 


Thr 


Lys 






15 




Leu 


Val 


Ala 


Ala 




30 






Lys 


Val 


Val 


He 


45 








Thr 


Thr 


Glu 


Phe 


Ser 


Lys 


Asp 


Ala 








80 


Arg 


Glu 


His 


He 






95 




Glu 


Glu 


Leu 


Cys 




110 






Leu 


Leu 


Asn 


Ala 


125 








Ala 


Asn 


Leu 


Val 


lie 


Asp 


Ser 


Glu 








160 


Leu 


Ala 


Met 


Gly 






175 




Thr 


Leu 


Cys 


Asp 




190 






Lys 


Glu 


Arg 


Gly 


205 








Tyr 


Gly 


Arg 


Ala 


Glu 


Gly 


Ser 


Phe 








240 


Ser 


Tyr 


Val 


Val 






255 




Val 


He 


Gin 


Glu 
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260 265 270 



Gly 


Phe 


Tyr 


Pro 


Pro 


Lys 


Val 




Ser 


Phe 


His 


Glu 


Ala 


Leu 


Leu 


Gly 






275 










280 










285 








Arg 


Glu 


He 


Met 


Val 


Leu 


Leu 


Ser 


Ala 


Ala 


He 


Asp 


Ala 


Glu 


Tyr 


Arg 




290 










295 










300 










Ala 


Val 


Leu 


Ser 


Arg 


Thr 


Arg 


Asn 


Ala 


Lys 


Pro 


Asn 


Pro 


Leu 


Thr 


Thr 


305 










310 










315 










320 


Lys 


Leu 


Asp 


Lys 


Tyr 


Val 


Asn 


Asn 


Pro 


His 


Leu 


Gin 


Met 


Pro 


Ser 


Glu 










325 










330 










335 




Ser 


Val 


Thr 


Glu 


Arg 


Glu 


Lys 


Glu 


Trp 


Val 


Glu 


Arg 


Glu 


Arg 


Glu 


Arg 








340 










345 










350 






He 


Lys 


Thr 


Thr 


Asp 


Met 


Thr 


Ala 


Glu 


Asn 


Leu 


Phe 


Arg 


Asp 


His 


Pro 






355 










360 










365 








Tyr 


Leu 


Pro 


Lys 


Ala 


He 


Asp 


Gly 


He 


Leu 


Gly 


Pro 


Lys 


Arg 


Thr 


Pro 




370 










375 










380 










Thr 


Ala 


Leu 


Gin 


Ala 


Leu 


Gin 


Arg 


Glu 


Tyr 


Lys 


Arg 


Cys 


Asn 


Lys 


Phe 


385 










390 










395 










400 


Asn 


Asp 


He 


Val 


Ser 


Pro 


Glu 


Thr 


Leu 


Glu 


Tyr 


Phe 


Leu 


Val 


Asn 


Asn 










405 










410 










415 




Arg 


Gin 


Val 


Met 


Phe 


Ser 


Asn 


Tyr 


Ser 


Val 


Thr 


Arg 


Val 


Leu 


Asp 


Pro 








420 










425 










430 






Asp 


Ser 


Ala 


Ala 


Arg 


Phe 


Ser 


Met 


Tyr 


Val 


Leu 


Trp 


Asn 


Ala 


Leu 


Phe 






435 










440 










445 








Leu 


Cys 


Ser 


Gly 


Gly 


Leu 


Thr 


Gin 


Lys 


Thr 


Asn 


Ser 


Ser 


Ala 


Val 


Lys 




450 










455 










460 










Ser 


Arg 


Leu 


He 


Leu 


Gin 


Val 


Phe 


Leu 


Lys 


Asp 


Met 


His 


Ser 


Leu 


Phe 


465 










470 










475 










480 


Val 


Cys 


Gin 


Arg 


Cys 


Glu 


Ser 


Gly 


Phe 


He 


Thr 


Lys 


Ser 


Leu 


Asp 


Thr 










485 










490 










495 




Phe 


Thr 


He 


Ser 


Leu 


Lys 


Glu 


Gin 


Ser 


Lys 


Pro 


Ser 


Met 


Gly 


Glu 


Gin 








500 










505 










510 






Glu 


Leu 


Glu 


Thr 


Tyr 


Trp 


Lys 


Ala 


Val 


Leu 


Asp 


Ala 


Leu 


Gly 


Gly 


Gly 






515 










520 










525 








Gly 


Gly 


Asn 


Asn 


Lys 


Gly 


Ala 


Glu 


Asn 


Val 


Asn 


Gly 


Leu 


Gly 


Glu 


Leu 




530 










535 










540 










Met 


Val 


Glu 


He 


Leu 


Ser 


Ala 


Asp 


Ser 


Gly 


Leu 


Leu 


Arg 


Gly 


Gly 


Gly 


545 










550 










555 










560 


Leu 


Gly 


Gly 


Asp 


He 


Gly 


Phe 


Glu 


Gly 


Lys 


Met 


Lys 


Gin 


Lys 


Arg 


Glu 










565 










570 










575 




Asp 


Glu 


Glu 


Val 


Arg 


Asn 


Met 


His 


Leu 


Val 


Asp 


Lys 


Lys 


Gly 


Tyr 


Val 








580 










585 










590 






Phe 


Glu 


Ala 


Ala 


Lys 


Tyr 


Val 


His 


Val 


Ser 


Lys 


Gly 


Phe 


Ala 


Ala 


Leu 






595 










600 










605 








Ser 


Phe 


Tyr 


Leu 


Leu 


Tyr 


Ala 


Ala 


Ala 


Ala 


Thr 


Ser 


Asn 


Pro 


Ser 


He 




610 










615 










620 










Thr 


Asn 


Asn 


Phe 


Asp 


Arg 


Ala 


Val 


Tyr 


Leu 


Leu 


Leu 


Ala 


Arg 


Trp 


Gly 


625 










630 










635 










640 


Asp 


Leu 


Lys 


Phe 


Pro 


Thr 


His 


Asn 


Leu 


Trp 


Gly 


Asn 


Val 


Pro 


Thr 


Asp 










645 










650 










655 




Glu 


Asn 


Thr 


Ser 


Ser 


Leu 


Leu 


Ser 


Phe 


Ala 


Ser 


Phe 


Trp 


Ala 


Leu 


Arg 








660 










665 










670 






Asn 


Ala 


Val 


Arg 


Ala 


Arg 


Arg 


Asn 


Val 


He 


Asp 


Asn 


Thr 


Ser 


Phe 


Val 






675 










680 










685 








Pro 


Gly 


Arg 


Pro 


Leu 


Pro 


Leu 


Leu 


Ser 


Ala 


Phe 


Ser 


Ser 


Lys 


Met 


Leu 




690 










695 










700 










Val 


Asp 


Asn 


Met 


Leu 


Lys 


Asn 


Asn 


Tyr 


Val 


Lys 


Val 


Glu 


Asn 


Val 


Asn 


705 










710 










715 










720 


Arg 


Glu 


Lys 


Leu 


He 


Trp 


Lys 


Ala 


Phe 


Arg 


Glu 


Met 


Gin 


Thr 


Glu 


Ser 










725 










730 










735 




Glu 


He 


Trp 


Lys 


Thr 


Ser 


Gly 


Ser 


Lys 


Ala 


Ala 


Ser 


Asp 


Arg 


Asn 


Val 








740 










745 










750 
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Lys 


Lys 


Ala 


Lys 


Gin 


Asp 


Leu 


He 


Arg 


Asn 


Ala 


Ser 


He 


Gly Arg 


Leu 






755 










760 










765 








He 


Val 
770 


Glu 


Pro 


Val 


Gly 


Lys 
775 


Thr 


Pro 


He 


Ser 


Ser 
780 


He 


Ala 


Leu 


Phe 


Arg 


Ser 


Met 


Lys 


Arg 


Ser 


Arg 


Ser 


Glu 


Asp 


Leu 


Lys 


Met 


Gly 


Ser 


Asn 


785 










790 










795 










800 


Asn 


Lys 


Tyr 


Arg 


Leu 
805 


Ala 


Arg 


Asp 


Thr 


Lys 
810 


Thr 


Ala 


Thr 


Pro 


Arg 
815 


Asn 


Pro 


Leu 


Ser 


Tyr 
820 


Thr 


Gly 


Lys 


He 


Val 
825 


Phe 


Ser 


Leu 


Asp 


Asp 
830 


Leu 


Lys 


Asn 


Phe 


Ser 
835 


Lys 


Asp 


Ser 


Tyr 


Thr 
840 


Thr 


Met 


Lys 


Val 


Phe 
845 


Pro 


Leu 


Thr 


Pro 


Leu 
850 


Asp 


Gly 



























<210> 142 

<211> 591 

<212> DNA 

<213> SHRIMP 



<400> 142 

atggacgttt 

ctccagcgcg 

aaggagggaa 

ccagccactt 

cgcaagtcaa 

cccgtccgtt 

ccctacgacg 

ccaccatacg 

cgtcaagtcc 

agggagaaga 



cttcctataa 
ccacctacaa 
ggtacgtccg 
gctacacttg 
tattcagaga 
tagaagagct 
acggatcttc 
acccatcaga 
tcagttctat 
aggccctggg 



<210> 143 
<211> 194 
<212> PRT 
<213> SHRIMP 



gagcactatt 
ggatcgtatg 
atcgttggcc 
cggctacacc 
gccttgcgct 
tgtcgacgtg 
tacatcgtct 
aaatccacag 
gacggatcac 
tcttggccgt 



gactaccaca 
gagacggaat 
accatggacg 
tttattagac 
tacccagaac 
ccagaaggag 
tcacaagccg 
aggtcccaag 
gccagggcca 
cgcaacaact 



acattgaaga 
tggtcctcga 
aattggaggt 
gcagggcacc 
ttctccccga 
cgagtttttt 
aatgtgaaga 
tgtgtgatta 
acctcataaa 
ttagctacta 



tatggacgat 
gatggctaag 
acctgaagaa 
cccaccaaaa 
tgcaccatcc 
cacctaccct 
tgattatcct 
ttgtaccaca 
aaatctgaag 

g 



60 

120 

180 

240 

300 

360 

420 

480 

540 

591 



<400> 143 



Met 


Asp 


Val 


Ser 


Ser 


Tyr 


Lys 


Ser 


Thr 


He 


Asp 


Tyr 


His 


Asn 


He 


Glu 


1 








5 










10 










15 




Asp 


Met 


Asp 


Asp 


Leu 


Gin 


Arg 


Ala 


Thr 


Tyr 


Lys 


Asp 


Arg 


Met 


Glu 


Leu 








20 










25 










30 






Val 


Leu 


Glu 


Met 


Ala 


Lys 


Lys 


Glu 


Gly 


Arg 


Tyr 


Val 


Arg 


Ser 


Leu 


Ala 






35 










40 










45 








Thr 


Met 


Asp 


Glu 


Leu 


Glu 


Val 


Pro 


Glu 


Glu 


Pro 


Ala 


Thr 


Cys 


Tyr 


Thr 




50 










55 










60 










Cys 


Gly 


Tyr 


Thr 


Phe 


He 


Arg 


Arg 


Arg 


Ala 


Pro 


Pro 


Pro 


Lys 


Arg 


Lys 


65 










70 










75 










80 


Ser 


He 


Phe 


Arg 


Glu 


Pro 


Cys 


Ala 


Tyr 


Pro 


Glu 


Leu 


Leu 


Pro 


Asp 


Ala 










85 










90 










95 




Pro 


Ser 


Pro 


Val 


Arg 


Leu 


Glu 


Glu 


Leu 


Val 


Asp 


Val 


Pro 


Glu 


Gly 


Ala 








100 










105 










110 






Ser 


Phe 


Phe 


Thr 


Tyr 


Pro 


Pro 


Tyr 


Asp 


Asp 


Gly 


Ser 


Ser 


Thr 


Ser 


Ser 






115 










120 










125 








Ser 


Gin 


Ala 


Glu 


Cys 


Glu 


Asp 


Asp 


Tyr 


Pro 


Pro 


Pro 


Tyr 


Asp 


Pro 


Ser 




130 










135 










140 










Glu 


Asn 


Pro 


Gin 


Arg 


Ser 


Gin 


Val 


Cys 


Asp 


Tyr 


Cys 


Thr 


Thr 


Arg 


Gin 


145 










150 










155 










160 


Val 


Leu 


Ser 


Ser 


Met 


Thr 


Asp 


His 


Ala 


Arg 


Ala 


Asn 


Leu 


He 


Lys 


Asn 



165 



170 



175 
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Leu Lys Arg Glu Lys Lys Ala Leu Gly Leu Gly Arg Arg Asn Asn Phe 
180 185 190 

Ser Tyr 



<210> 144 
<211> 627 
<212> DNA 
<213> SHRIMP 



<400> 144 

atggccccaa 

gaaagggatg 

atctgcctca 

gtatacaaag 

cgaggattgg 

atggaagaga 

tacaatgaag 

aaaacaattg 

caagctctag 

ttcaaggagg 

caccctaaaa 



actccttcca 
aacatgacga 
ctgcaaactg 
tactacgcta 
ccctctcaat 
ctgaagaaaa 
gcgatggcat 
aaggaggaac 
agacattgat 
atatggagga 
aggtgtctgt 



gaaatttgct 
ccctttacgg 
tgtttcgagg 
cgggaaccca 
ggatcaaggt 
ccccatcgac 
gacccacctt 
aagaacgtgg 
gatgtcccat 
aggatttgtg 
tttgtga 



ccagttatca 
cagattgatt 
aagagaaaag 
tacaagtacc 
gaagtaggaa 
aagtgcggag 
tacaacgacg 
gtaaagaaga 
tctataaaac 
cacaaactcc 



agactgagaa 
ttagagatag 
ctggatctgc 
gtcgccccaa 
catgcctccc 
tggcgttcct 
aagagtatat 
accgccaaga 
aatattccaa 
acacatttat 



gaaggaagaa 
aaagacatta 
acatgatcga 
tagaacacat 
tctgcgaccc 
gtactccaac 
aaagaagtgc 
atacttcaga 
ttttattttt 
taatatggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 145 
<211> 204 
<212> PRT 
<213> SHRIMP 



<400> 145 
Met Ala Pro Asn 
1 

Lys Lys Glu Glu 
20 

Asp Phe Arg Asp 
35 

Ser Arg Lys Arg 
50 

Leu Arg Tyr Gly 
65 

Leu Ala Leu Ser 

Arg Pro Met Glu 
100 

Ala Phe Lys Asn 
115 

Asp Glu Glu Tyr 
130 

Thr Trp Val Lys 
145 

Thr Leu Met Met 

Phe Lys Glu Asp 
180 

lie Asn Met Val 
195 



Ser 


Phe 


Gin 


Lys 


5 








Glu 


Arg 


Asp 


Glu 


Arg 


Lys 


Thr 


Leu 








40 


Lys 


Ala 


Gly 


Ser 






55 




Asn 


Pro 


Tyr 


Lys 




70 






Met 


Asp 


Gin 


Gly 


85 








Glu 


Thr 


Glu 


Glu 


Tyr 


Asn 


Glu 


Gly 








120 


He 


Lys 


Lys 


Cys 






135 




Lys 


Asn 


Arg 


Gin 




150 






Ser 


His 


Ser 


lie 


165 








Met 


Glu 


Glu 


Gly 


His 


Pro 


Lys 


Lys 



200 



Phe Ala Pro Val 
10 

His Asp Asp Pro 
25 

He Cys Leu Thr 

Ala His Asp Arg 
60 

Tyr Arg Arg Pro 
75 

Glu Val Gly Thr 
90 

Asn Pro He Asp 
105 

Asp Gly Met Thr 

Lys Thr He Glu 
140 

Glu Tyr Phe Arg 
155 

Lys Gin Tyr Ser 
170 

Phe Val His Lys 
185 

Val Ser Val Leu 



He Lys Thr Glu 
15 

Leu Arg Gin He 
30 

Ala Asn Cys Val 
45 

Val Tyr Lys Val 

Asn Arg Thr Gly 
80 

Cys Leu Pro Leu 
95 

Lys Cys Gly Val 
110 

His Leu Tyr Asn 
125 

Gly Gly Thr Arg 

Gin Ala Leu Glu 
160 

Asn Phe He Phe 

175 

Leu His Thr Phe 
190 



<210> 146 
<211> 870 
<212> DNA 
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<213> SHRIMP 



<400> 146 

atggagggag 

aaggacgata 

cttcgagacg 

gaagaactct 

atacacatct 

agagcagagg 

tatgatacct 

aagaccctga 

gatcttcacc 

ggagaattgt 

aatattgcat 

ggagagttta 

aagaagcagt 

gaaaaaattg 

ggaaacttgc 



aacatcaata 
gaactggaac 
acactattcc 
tgtggttcat 
ggaacgctaa 
gagatttggg 
gttcttccga 
gagaaaatcc 
ttatggctct 
cgtgccagtt 
catactctct 
tcctcactct 
tgtgccgcgt 
aggactttac 
agatgaaaat 



tttgaaccta 
aggaactcta 
agttctcact 
caggggcaat 
tgggtcgcgg 
acccgtatac 
ttatactgga 
agatgataga 
tcctccatgc 
gtatcagcga 
tctgactcat 
tggtgacgca 
ccctagacca 
tatcgacatg 
ggctgtttga 



gtcagggaga 
tccatttttg 
accaagaaaa 
acagacgcca 
gaatttttgg 
ggattccaat 
aagggtattg 
aggatgatta 
cacatgactg 
agcggagatg 
ctgatggcca 
cacatttata 
ttccctaagt 
ttttatcttg 



tcctagaaag 
gaccccaaat 
ttttctggag 
aagaattggc 
acagtagagg 
ggcgtcattt 
atcaattggc 
tgacggcatg 
ctcaatttta 
tcgggttggg 
gtatggtggg 
atacccacat 
tgaggatttt 
aggggtatca 



aggagtgaag 
gaggttctct 
aggagttgtg 
caagaagaag 
gttatacgat 
tggtgctgaa 
caatatacta 
gaatcctatg 
tgtggctaat 
cgtgcccttc 
tctaaaaccg 
tgaggtgtta 
aatggctcca 
accacacagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

870 



<210> 147 
<211> 285 
<212> PRT 
<213> SHRIMP 



<400> 147 



Met 


Glu 


Gly 


Glu 


His 


Gin 


Tyr 


Leu 


Asn 


Leu 


Val 


Arg 


Glu 


He 


Arg 


Gly 


1 








5 










10 










15 




Val 


Lys 


Lys 


Asp 


Asp 


Arg 


Thr 


Gly 


Thr 


Gly 


Thr 


Leu 


Ser 


He 


Phe 


Gly 








20 










25 










30 






Pro 


Gin 


Met 


Arg 


Phe 


Ser 


Leu 


Arg 


Asp 


Asp 


Thr 


He 


Pro 


Val 


Leu 


Thr 






35 










40 










45 








Thr 


Lys 


Lys 


He 


Phe 


Trp 


Arg 


Gly 


Val 


Val 


Glu 


Glu 


Leu 


Leu 


Trp 


Phe 




50 










55 










60 










He 


Arg 


Gly 


Asn 


Thr 


Asp 


Ala 


Lys 


Glu 


Leu 


Ala 


Lys 


Lys 


Lys 


He 


His 


65 










70 










75 










80 


He 


Trp 


Asn 


Ala 


Asn 


Gly 


Ser 


Arg 


Glu 


Phe 


Leu 


Asp 


Ser 


Arg 


Gly 


Asp 










85 










90 










95 




Arg 


Ala 


Glu 


Gly 


Asp 


Leu 


Gly 


Pro 


Val 


Tyr 


Gly 


Phe 


Gin 


Trp 


Arg 


His 








100 










105 










110 






Phe 


Gly 


Ala 


Glu 


Tyr 


Asp 


Thr 


Cys 


Ser 


Ser 


Asp 


Tyr 


Thr 


Gly 


Lys 


Gly 






115 










120 










125 








He 


Asp 


Gin 


Leu 


Ala 


Asn 


He 


Leu 


Lys 


Thr 


Leu 


Arg 


Glu 


Asn 


Pro 


Asp 




130 










135 










140 










Asp 


Arg 


Arg 


Met 


He 


Met 


Thr 


Ala 


Trp 


Asn 


Pro 


Met 


Asp 


Leu 


His 


Leu 


145 










150 










155 










160 


Met 


Ala 


Leu 


Pro 


Pro 


Cys 


His 


Met 


Thr 


Ala 


Gin 


Phe 


Tyr 


Val 


Ala 


Asn 










165 










170 










175 




Gly 


Glu 


Leu 


Ser 


Cys 


Gin 


Leu 


Tyr 


Gin 


Arg 


Ser 


Gly 


Asp 


Val 


Gly 


Leu 








180 










185 










190 






Gly 


Val 


Pro 


Phe 


Asn 


He 


Ala 


Ser 


Tyr 


Ser 


Leu 


Leu 


Thr 


His 


Leu 


Met 






195 










200 










205 








Ala 


Ser 


Met 


Val 


Gly 


Leu 


Lys 


Pro 


Gly 


Glu 


Phe 


He 


Leu 


Thr 


Leu 


Gly 




210 










215 










220 










Asp 


Ala 


His 


He 


Tyr 


Asn 


Thr 


His 


He 


Glu 


Val 


Leu 


Lys 


Lys 


Gin 


Leu 


225 










230 










235 










240 


Cys 


Arg 


Val 


Pro 


Arg 


Pro 


Phe 


Pro 


Lys 


Leu 


Arg 


He 


Leu 


Met 


Ala 


Pro 










245 










250 










255 




Glu 


Lys 


He 


Glu 


Asp 


Phe 


Thr 


He 


Asp 


Met 


Phe 


Tyr 


Leu 


Glu 


Gly 


Tyr 








260 










265 










270 






Gin 


Pro 


His 


Ser 


Gly 


Asn 


Leu 


Gin 


Met 


Lys 


Met 


Ala 


Val 












275 










280 










285 
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<210> 148 
<211> 678 
<212> DNA 
<213> SHRIMP 



<400> 148 

atggccttta 

acaacaacag 

ggtattgagg 

agtggaggtg 

aacagtggtt 

aactggattg 

ggtggggatg 

gatgtgactt 

tggatgtacc 

gcatcaaacc 

tcatacgtta 

ggattctttg 



attttgaaga 
accctacccg 
tgatgaagag 
ctatgtcgta 
cccatgtcaa 
aagagactga 
gttcacgtgg 
tctttgtgtt 
aaaaactact 
caatgttttc 
ttgaaggcca 
tataataa 



ctctacaaat 
ccccaatatc 
gcgtctcgta 
tttctggctc 
gacaaactgc 
tcgacatgga 
gggatatttt 
tgttgatagg 
agcaattggt 
tgtatgtaat 
tggagtgggt 



ctctttgcca 
atattctttg 
cggcaaggaa 
gaagataatg 
ttggcattat 
cagtactgta 
acttcgctag 
aataatactg 
gcaaaggtag 
gcgtgtaggt 
ca ttctgatt 



atatggactt 
aaagtctact 
agtgtgggaa 
cagaagatat 
tccttcaaga 
cttttcccca 
ccatgaaatg 
tagaaaatgc 
taaaggtgat 
gcaagtaccc 
tgacatgtga 



gacggctggc 
ccccaactct 
ttttgaagca 
ggagaatctc 
gtttatcagc 
atacatggac 
gatggctagg 
ggcatccata 
tgttgacaat 
aggcccagtg 
tgagatttct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

678 



<210> 149 
<211> 220 
<212> PRT 
<213> SHRIMP 



<400> 149 



Met 


Ala 


Phe 


Asn 


Phe 


Glu 


Asp 


Ser 


Thr 


Asn 


Leu 


Phe 


Ala 


Asn 


Met 


Asp 


1 








5 










10 










15 




Leu 


Thr 


Ala 


Gly 
20 


Thr 


Thr 


Thr 


Asp 


Pro 
25 


Trp 


Asn 


He 


He 


Phe 
30 


Phe 


Glu 


Ser 


Leu 


Leu 
35 


Pro 


Asn 


Ser 


Gly 


He 
40 


Glu 


Val 


Met 


Lys 


Arg 
45 


Arg 


Leu 


Val 


Arg 


Gin 
50 


Gly 


Lys 


Cys 


Gly 


Asn 
55 


Phe 


Glu 


Ala 


Ser 


Gly 
60 


Gly 


Ala 


Met 


Ser 


Tyr 


Phe 


Trp 


Leu 


Glu 


Asp 


Asn 


Ala 


Glu 


Asp 


Met 


Glu 


Asn 


Leu 


Asn 


Ser 


65 










70 










75 










80 


Gly 


Ser 


His 


Val 


Lys 
85 


Thr 


Asn 


Cys 


Leu 


Ala 
90 


Leu 


Phe 


Leu 


Gin 


Glu 
95 


Phe 


He 


Ser 


Asn 


Trp 
100 


He 


Glu 


Glu 


Thr 


Asp 
105 


Arg 


His 


Gly 


Gin 


Tyr 
110 


Cys 


Thr 


Phe 


Pro 


Gin 
115 


Tyr 


Met 


Asp 


Gly 


Gly 
120 


Asp 


Gly 


Ser 


Arg 


Gly 
125 


Gly 


Tyr 


Phe 


Thr 


Ser 
130 


Leu 


Ala 


Met 


Lys 


Trp 
135 


Met 


Ala 


Arg 


Asp 


Val 
140 


Thr 


Phe 


Phe 


Val 


Phe 


Val 


Asp 


Arg 


Asn 


Asn 


Thr 


Val 


Glu 


Asn 


Ala 


Ala 


Ser 


He 


Trp 


Met 


145 










150 










155 










160 


Tyr 


Gin 


Lys 


Leu 


Leu 
165 


Ala 


He 


Gly 


Ala 


Lys 
170 


Val 


Val 


Lys 


Val 


He 
175 


Val 


Asp 


Asn 


Asn 


Pro 
180 


Met 


Phe 


Ser 


Val 


Cys 
185 


Asn 


Ala 


Cys 


Arg 


Cys 
190 


Lys 


Tyr 


Pro 


Gly 


Pro 
195 


Val 


Ser 


Tyr 


Val 


He 
200 


Glu 


Gly 


His 


Gly 


Val 
205 


Gly 


His 


Ser 


Asp 


Leu 
210 


Thr 


Cys 


Asp 


Glu 


He 
215 


Ser 


Gly 


Phe 


Phe 


Val 
220 











<210> 150 
<211> 1269 
<212> DNA 
<213> SHRIMP 
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<400> 150 

atggattcta atacttctat 
gttttaggga ttataataac 
agggtaggta aacgcaaata 
gataatgggg tagaaggagg 
ggtggagatg gatacgtaga 
gcaaacgtca tctttcaaga 
aatgattcaa ctgataagat 
ccattcagag aagcaaccgt 
atcctttcaa ctctagattt 
aaaaatcaat tagaaaattt 
ccagagactg attatttatg 
gaaaaggtat tagggttctc 
aaaggtaaaa aatatgaggt 
gaaacaataa aagagaaggc 
gacgaacaga cctacaaatc 
gtaaataagg agaaaaggga 
gccaatatta ctcctctaac 
actagcacag caaaggaaaa 
tttgtttgtg gtctgagtaa 
agtgatactg ttttcccttc 
gtagaaattt taggtaaaag 
actcagtga 



tttaccgcca agcaaacggc 
ggtagcatta atagcttccg 
ttacccttct tcatcctctt 
aggaggaaca acaacgacac 
tctttctcct caaaagaagg 
agtgtcaaag gatcaaggag 
aatggaagaa actgaggcga 
agaacgtgaa gtgtttaagg 
aacagaggaa caatttaagg 
tgactatgaa gacatgaccc 
gacaactcat ttcgatccga 
agatataaat agtatagaaa 
aactactgga aatgtagctg 
aggaaatagt ctcatccgta 
tttcttccct gcattcaatc 
agttactgta tccatcaata 
tacgccagtt ggagcagctt 
gacctatcta tttgtaatag 
caagtcaact cctctcatgg 
atttgacgaa gcagaaagac 
actaggtgta ggaagataca 



cagggttaaa tctgttacag 60 
tttcatcctt tatattttat 120 
cctcagaatt atctgatgta 180 
caactcaacc ttcacctgac 240 
ctgaactaag aactagagtt 300 
tggcctttag acgggcaatg 360 
gaatcaataa cttttcagag 420 
atgacacaga caaaaacttt 480 
acattgttat ggctgaagtg 540 
gtctcatctt tgataacatc 600 
aaaaatatga cacgtactct 660 
gaatatcctc tacattttat 720 
tcctcgttga ttttgaatct 780 
atgtcgagtt tattgttgtg 840 
aagttttctt ctcctttaaa 900 
atggatgtgt aggtatagtg 960 
ccggacacta catctatggc 1020 
acaagtacga taccactgaa 1080 
ctctaaatat tctctttatg 1140 
ctctgacgga tgccaaggca 1200 
caaacgccaa catcagaaat 1260 

1269 



<210> 151 
<211> 418 
<212> PRT 
<213> SHRIMP 



<400> 151 

Met Asp Ser Asn Thr Ser lie Leu Pro Pro Ser Lys Arg Pro Gin Leu 

15 10 15 

Leu Gin Val Leu Gly lie lie lie Thr Val lie Ala Ser Val Ser Ser 

20 25 30 

Phe lie Phe Tyr Arg Val Gly Lys Arg Lys Tyr Tyr Pro Ser Ser Ser 

35 40 45 

Ser Ser Ser Glu Leu Ser Asp Val Asp Asn Gly Val Glu Gly Gly Gly 

50 55 60 

Gly Thr Thr Thr Thr Pro Thr Gin Pro Ser Pro Asp Gly Gly Asp Gly 
65 70 75 80 

Tyr Val Asp Leu Ser Pro Gin Lys Lys Ala Glu Leu Arg Thr Arg Val 

85 90 95 

Ala Asn Val lie Phe Gin Glu Val Ser Lys Asp Gin Gly Val Ala Phe 

100 105 110 

Arg Arg Ala Met Asn Asp Ser Thr Asp Lys lie Met Glu Glu Thr Glu 

115 120 125 

Ala Arg lie Asn Asn Phe Ser Glu Pro Phe Arg Glu Ala Thr Val Glu 

130 135 140 

Arg Glu Val Phe Lys Asp Asp Thr Asp Lys Asn Phe lie Leu Ser Thr 
145 150 155 160 

Leu Asp Leu Thr Glu Glu Gin Phe Lys Asp lie Val Met Ala Glu Val 

165 170 175 

Lys Asn Gin Leu Glu Asn Phe Asp Tyr Glu Asp Met Thr Arg Leu lie 

180 185 190 

Phe Asp Asn lie Pro Glu Thr Asp Tyr Leu Trp Thr Thr His Phe Asp 

195 200 205 

Pro Lys Lys Tyr Asp Thr Tyr Ser Glu Lys Val Leu Gly Phe Ser Asp 

210 215 220 

lie Asn Ser lie Glu Arg lie Ser Ser Thr Phe Tyr Lys Gly Lys Lys 
225 230 235 240 
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Tyr 


Glu 


Val 


Thr 


Thr 


Gly 


Asn 


Val 


Ala 


Val 


Leu 


Val 


Asp 


Phe 


Glu 


Ser 










245 










250 










255 




Glu 


Thr 


He 


Lys 


Glu 


Lys 


Ala 


Gly 


Asn 


Ser 


Leu 


He 


Arg 


Asn 


Val 


Glu 








260 










265 










270 






Phe 


lie 


Val 


Val 


Asp 


Glu 


Gin 


Thr 


Tyr 


Lys 


Ser 


Phe 


Phe 


Pro 


Ala 


Phe 






275 










280 










285 








Asn 


Gin 


Val 


Phe 


Phe 


Ser 


Phe 


Lys 


Val 


Asn 


Lys 


Glu 


Lys 


Arg 


Glu 


Val 




290 










295 










300 










Thr 


Val 


Ser 


He 


Asn 


Asn 


Gly 


Cys 


Val 


Gly 


He 


Val 


Ala 


Asn 


He 


Thr 


305 










310 










315 










320 


Pro 


Leu 


Thr 


Thr 


Pro 


Val 


Gly 


Ala 


Ala 


Ser 


Gly 


His 


Tyr 


He 


Tyr 


Gly 










325 










330 










335 




Thr 


Ser 


Thr 


Ala 


Lys 


Glu 


Lys 


Thr 


Tyr 


Leu 


Phe 


Val 


He 


Asp 


Lys 


Tyr 








340 










345 










350 






Asp 


Thr 


Thr 


Glu 


Phe 


Val 


Cys 


Gly 


Leu 


Ser 


Asn 


Lys 


Ser 


Thr 


Pro 


Leu 






355 










360 










365 








Met 


Ala 


Leu 


Asn 


He 


Leu 


Phe 


Met 


Ser 


Asp 


Thr 


Val 


Phe 


Pro 


Ser 


Phe 




370 










375 










380 










Asp 


Glu 


Ala 


Glu 


Arg 


Pro 


Leu 


Thr 


Asp 


Ala 


Lys 


Ala 


Val 


Glu 


He 


Leu 


385 










390 










395 










400 


Gly 


Lys 


Arg 


Leu 


Gly 


Val 


Gly 


Arg 


Tyr 


Thr 


Asn 


Ala 


Asn 


He 


Arg 


Asn 










405 










410 










415 




Thr 


Gin 































<210> 152 
<2H> 831 
<212> DNA 
<213> SHRIMP 



<400> 152 

atggaagggg 

gggtctattg 

aaaataaatg 

actataagaa 

gaaaaaat ta 

tttggaaaaa 

tttgaaagtg 

gtatctatga 

ccttcaccta 

aatgcaggtt 

gctgaaaatc 

tttatagact 

gttcagaaaa 

cctgatgaac 



ttattttgga 
atgtgggtac 
aagaaaccac 
gtatagtggc 
gagcaatgac 
tttcaactgg 
ctttaaaggc 
acagtttacg 
tcatgggtat 
cgaaagaatt 
tcattattag 
acgaaacaaa 
caaaggctct 
cagaagggag 



taagatagaa 
ggctattttg 
aatggagaag 
taatgttatc 
agataaggaa 
agatggagga 
agaatataaa 
tatcgatgcg 
tagagaatac 
ggtttttatg 
agttgccaac 
gaagggggtg 
taaagggcgt 
tatcataata 



acaattgcaa 
cgccgtcaat 
attatgggca 
aaagagaata 
ttaaatgaca 
accttccatc 
aacgttccag 
attaatggaa 
ggcaccattc 
accaagattg 
cagcagtata 
tccaaggagg 
agtacttact 
cacgcactag 



aaagggcctc 
tcatggaaaa 
caaaggaaga 
ctgttaaaga 
atagggaatt 
tctttgaaga 
gagcaactac 
aaatcgaaga 
gcaggggcag 
aaaaaagacc 
atgttatgag 
aaatgtttat 
tttcattcgt 
ggttttattg 



tcctagttat 
.aattaggggt 
aagagaggac 
aaatgtaacc 
tatgcatgat 
tacaccgggt 
tccaaaatac 
ggtttataat 
gtacgaagaa 
caataatgta 
gatggtgttt 
accatataat 
aaggaaaatt 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

831 



<210> 153 
<211> 274 
<212> PRT 
<213> SHRIMP 



<400> 153 

Met Glu Gly Val He Leu Asp Lys He Glu Thr He Ala Lys Arg Asp 

15 10 15 

Ser Tyr Gly Ser He Asp Val Gly Thr Ala He Leu Arg Arg Gin Phe 

20 25 30 

Met Glu Lys He Arg Gly Lys He Asn Glu Glu Thr Thr Met Glu Lys 

35 40 45 

He Met Gly Thr Lys Glu Glu Arg Glu Asp Thr He Arg Ser He Val 
50 55 60 
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Ala 


Asn 


Val 


He 


Lys 


Glu 


Asn 


Thr 


Val 


Lys 


Glu 


Asn 


Val 


Thr 


Glu 


Lys 


65 










70 










75 










80 


lie 


Arg 


Ala 


Met 


Thr 


Asp 


Lys 


Glu 


Leu 


Asn 


Asp 


Asn 


Arg 


Glu 


Phe 


Met 










85 










90 










95 




His 


Asp 


Phe 


Gly 


Lys 


He 


Ser 


Thr 


Gly 


Asp 


Gly 


Gly 


Thr 


Phe 


His 


Leu 








100 










105 










110 






Phe 


Glu 


Asp 


Thr 


Pro 


Gly 


Phe 


Glu 


Ser 


Ala 


Leu 


Lys 


Ala 


Glu 


Tyr 


Lys 






115 










120 










125 








Asn 


Val 


Pro 


Gly 


Ala 


Thr 


Thr 


Pro 


Lys 


Tyr 


Val 


Ser 


Met 


Asn 


Ser 


Leu 




130 










135 










140 










Arg 


lie 


Asp 


Ala 


He 


Asn 


Gly 


Lys 


He 


Glu 


Glu 


Val 


Tyr 


Asn 


Pro 


Ser 


145 










150 










155 










160 


Pro 


lie 


Met 


Gly 


He 


Arg 


Glu 


Tyr 


Gly 


Thr 


He 


Arg 


Arg 


Gly 


Arg 


Tyr 










165 










170 










175 




Glu 


Glu 


Asn 


Ala 


Gly 


Ser 


Lys 


Glu 


Leu 


Val 


Phe 


Met 


Thr 


Lys 


He 


Glu 








180 










185 










190 






Lys 


Arg 


Pro 


Asn 


Asn 


Val 


Ala 


Glu 


Asn 


Leu 


He 


He 


Arg 


Val 


Ala 


Asn 






195 










200 










205 








Gin 


Gin 


Tyr 


Asn 


Val 


Met 


Arg 


Met 


Val 


Phe 


Phe 


He 


Asp 


Tyr 


Glu 


Thr 




210 










215 










220 








Lys 


Lys 


Gly 


Val 


Ser 


Lys 


Glu 


Glu 


Met 


Phe 


He 


Pro 


Tyr 


Asn 


Val 


Gin 


225 










230 










235 










240 


Lys 


Thr 


Lys 


Ala 


Leu 


Lys 


Gly 


Arg 


Ser 


Thr 


Tyr 


Phe 


Ser 


Phe 


Val 


Arg 










245 










250 










255 




Lys 


He 


Pro 


Asp 


Glu 


Pro 


Glu 


Gly 


Ser 


He 


He 


He 


His 


Ala 


Leu 


Gly 








260 










265 










270 







Phe Tyr 



<210> 154 
<211> 486 
<212> DNA 
<213> SHRIMP 



<400> 154 

atgacatctc cagctccatc accctcttcc acccccaaat ccagttgtac cactattgta 60 

aaccgatgtg gtttcctcct tgacaacaac aaggaagtgg tcatctacga caccaattcc 120 

aaattcaagt gtgaacccaa aaatctggaa ctaattggtg tactttctgg agtctctgat 180 

aatgttgtta cccagatatc ccccgaccag atatttgtgg gaacatatat ggtcaaatat 240 

aactggtcta aatctggtca tgaacgcttc agtgacatga gtaacaactg tctggacaat 300 

attacacgcc cttcagaagt gattgaaagt gtgataaaga aaacgtccag cgactttaaa 360 

atgaagtaca cacgttcctt gatggaccac accgagaaat actatttttc tggtgaccaa 420 

aaattgagca aaattagtag ttggtgtaca acccctatac gacagtgggt atgcaactcc 480 

gtctag 486 

<210> 155 
<211> 159 
<212> PRT 
<213> SHRIMP 



<400> 155 



Met 


Thr 


Ser 


Pro 


Ala 


Pro 


Ser 


Pro 


Ser 


Ser 


Thr 


Pro 


Lys 


Ser 


Ser 


Cys 


1 








5 










10 










15 


Thr 


Thr 


He 


Val 


Asn 


Arg 


Cys 


Gly 


Phe 


Leu 


Leu 


Asp 


Asn 


Asn 


Lys 


Glu 








20 










25 










30 






Val 


Val 


He 


Tyr 


Asp 


Thr 


Asn 


Ser 


Lys 


Phe 


Lys 


Cys 


Glu 


Pro 


Lys 


Asn 






35 










40 










45 








Leu 


Glu 


Leu 


He 


Gly 


Val 


Leu 


Ser 


Gly 


Val 


Ser 


Asp 


Asn 


Val 


Val 


Thr 




50 










55 










60 










Gin 


He 


Ser 


Pro 


Asp 


Gin 


He 


Phe 


Val 


Gly 


Thr 


Tyr 


Met 


Val 


Lys 


Tyr 
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65 










70 










75 










80 


Asn 


Trp 


Ser 


Lys 


Ser 
85 


Gly 


His 


Glu 


Arg 


Phe 
90 


Ser 


Asp 


Met 


Ser 


Asn 
95 


Asn 


Cys 


Leu 


Asp 


Asn 
100 


He 


Trp 


Ser 


Glu 


Val 
105 


He 


Glu 


Ser 


Val 


He 
110 


Lys 


Lys 


Thr 


Ser 


Ser 
115 


Asp 


Phe 


Lys 


Met 


Lys 
120 


Tyr 


Thr 


Arg 


Ser 


Leu 
125 


Met 


Asp 


His 


Thr 


Glu 
130 


Lys 


Tyr 


Tyr 


Phe 


Ser 
135 


Gly 


Asp 


Gin 


Lys 


Leu 
140 


Ser 


Lys 


He 


Ser 


Ser 
145 


Trp 


Cys 


Thr 


Thr 


Pro 

150 


He 


Arg 


Gin 


Trp 


Val 
155 


Cys 


Asn 


Ser 


Val 





<210> 156 
<211> 894 
<212> DNA 
<213> SHRIMP 



<400> 156 

atggcattac 

atgtactcac 

gttgaatcat 

acatctgaca 

ttcgaagata 

gactttattg 

ttatctagac 

gctataatgg 

acactcgata 

tattgtaaaa 

gaacatacag 

aacaacctaa 

gcagatgtgc 

agtagcctct 

atcgaaggag 



aggaaaagga 
ccacacatat 
cttcagaaga 
tatctgaaac 
ccttagaagg 
atggccgtaa 
agcgagattt 
gttgtggtat 
atgatgatga 
tggctataaa 
ttgaaggaag 
taacaatcat 
aggcgccatt 
acatggatga 
ctgatgcgaa 



tataacta ta 
gcagcatcac 
gaaaataaga 
agtcaataat 
tatggtgaaa 
aaagctcaat 
tgttgcagaa 
tataaggtat 
caaggcagca 
ggactatgaa 
aaagatgacc 
ccct tcagtg 
actt ttaaac 
gaaaaaaaca 
tgcagtctat 



<210> 157 
<211> 297 
<212> PRT 
<213> SHRIMP 



gggaatgttt 
gataagctaa 
caaattgtgg 
gtcacaacta 
aaaaatatag 
gacatgaaga 
agtataacag 
actgtatttg 
acctattata 
ataccagatt 
tttaatataa 
acaggttatc 
tgcaacaact 
gaggcatcat 
gaaatatgta 



ctgctgccct 
acacattcct 
ataaaatacg 
atgggactgc 
gtgataacct 
gtctagctac 
gaacaaagga 
tcaataacct 
acacccctat 
cgtacagcaa 
aatggagagg 
ttgcttccat 
gttttataga 
ttaccctcaa 
tagtagtagt 



acgagagttg 
ggacagaaat 
atcccaaaca 
attttccctt 
tcagagtggg 
tggagccatc 
ctggctcaag 
tgcaagatca 
atatggcggg 
ggtcgaagcg 
cgataccata 
ctctgaagac 
ggcagatatg 
cttaccggaa 
ttga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

894 



<400> 157 

Met Ala Leu Gin Glu Lys 

1 5 
Leu Arg Glu Leu Met Tyr 
20 

Leu Asn Thr Phe Leu Asp 
35 

He Arg Gin He Val Asp 
50 

Ser Glu Thr Val Asn Asn 
65 70 
Phe Glu Asp Thr Leu Glu 
85 



Asp He Thr 
Ser 
Arg 



Lys 

55 

Val 

Gly 



Pro Thr 

25 
Asn Val 
40 

He Arg 



Thr Thr 
Met Val 



Leu 


Gin 


Ser 


Gly 


Asp 


Phe 


He 


Asp 


Gly 








100 










105 


Lys 


Ser 


Leu 


Ala 


Thr 


Gly 


Ala 


He 


Leu 






115 










120 




Ala 


Glu 


Ser 


He 


Thr 


Gly 


Thr 


Lys 


Asp 




130 










135 






Cys 


Gly 


He 


He 


Arg 


Tyr 


Thr 


Val 


Phe 


145 










150 









He Gly 
10 

His Met 

Glu Ser 

Ser Gin 

Asn Gly 

75 
Lys Lys 
90 

Arg Lys 

Ser Arg 

Trp Leu 

Val Asn 
155 



Val 


Ser 


Ala 


Ala 






15 




His 


His 


Asp 


Lys 




30 






Ser 


Glu 


Glu 


Lys 


45 








Thr 


Ser 


Asp 


He 



60 

Thr Ala Phe Ser 



Asn He 

Lys Leu 

Gin Arg 
125 
Lys Ala 
140 

Asn Leu 



Gly Asp 
95 

Asn Asp 
110 

Asp Phe 
He Met 
Ala Arg 



Leu 

80 

Asn 

Met 

Val 

Gly 

Ser 
160 
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Thr Leu Asp Asn Asp 
165 

lie Tyr Gly Gly Tyr 
180 

Asp Ser Tyr Ser Lys 
195 

Met Thr Phe Asn lie 
210 

Thr He He Pro Ser 

225 

Ala Asp Val Gin Ala 
245 

Glu Ala Asp Met Ser 
260 

Ser Phe Thr Leu Asn 
275 

Val Tyr Glu He Cys 
290 



Asp Asp Lys Ala Ala Thr 
170 

Cys Lys Met Ala He Lys 
185 

Val Glu Ala Glu His Thr 
200 

Lys Trp Arg Gly Asp Thr 
215 

Val Thr Gly Tyr Leu Ala 
230 235 
Pro Leu Leu Leu Asn Cys 
250 

Ser Leu Tyr Met Asp Glu 

265 

Leu Pro Glu He Glu Gly 
280 

He Val Val Val 
295 



Tyr Tyr Asn Thr Pro 
175 

Asp Tyr Glu He Pro 
190 

Val Glu Gly Arg Lys 
205 

He Asn Asn Leu He 
220 

Ser He Ser Glu Asp 
240 

Asn Asn Cys Phe He 
255 

Lys Lys Thr Glu Ala 
270 

Ala Asp Ala Asn Ala 
285 



<210> 158 
<211> 6588 
<212> DNA 
<213> SHRIMP 



<400> 158 

atgcaactcc 

ccttctggaa 

aatgagatgg 

gaaattaacg 

agatggtgcc 

tacaagtgca 

aagaggaaga 

ggaaaggaag 

ggtgttgatt 

ggaaactatg 

gatgaatctt 

gatgaacgac 

ttccatatgc 

ccacttggac 

catgaatttg 

ccccgattca 

gtgtttattc 

cgaggttctg 

tttcaagtaa 

gaaactctcc 

aagggattct 

cagagtgtca 

actggtttca 

ggctgtagat 

atgaggatcc 

atcatctttg 

atagggcagc 

aaaaaatcag 

ggaggagaga 

gagactataa 

caagtgattg 

gataaagtga 

gtctttactc 

gcagatgcat 

gaaactccat 



gtctagaaaa 
tgactggatt 
acgccctaat 
ccaatatgga 
accacgagaa 
ggagaaaaat 
atgttggtgg 
ccctagaagc 
caacaccatg 
gatgtattgc 
ccttcatatc 
tacgaaacaa 
tggctgatat 
ccaatggcag 
ttacccattc 
attttgatat 
aaaattgtca 
cttctgggaa 
caaaatgtaa 
ctcgtctacc 
gtagagggga 
agaatttttg 
tctcttcaga 
ctaatcgtgc 
taaagtcacg 
aaaacgttaa 
acaagatact 
tcatcacatc 
gaggagtatt 
taaattcgta 
aaattttcac 
agcctcacgt 
gtatcgtcaa 
ctacagcaga 
atagattgag 



ctttgtcaag 
caacatattt 
ttttatggcg 
cttgtacacg 
gagtttttct 
catgcaagcc 
agcacctgcg 
tagtttcgat 
cccccattct 
ctgcttcgca 
aacggatgcc 
tgaaaatgga 
tacccaaaag 
gcagtggaat 
tctagtaaac 
cttgtacaac 
cattttaact 
gtggtggact 
agttgaaaag 
taatccagga 
aaatctaggg 
tctcatggtt 
atcgatagtg 
ccaagatact 
tgaaggtgct 
ccacgatacg 
gtactatgat 
tattgggttc 
tggactggtt 
cgatcctgaa 
caacgagttt 
gattagtggg 
acatttgagt 
atctgtcctt 
cacccctcaa 



gaagaacatg 
aatagttccc 
gctttcttga 
tttgattatg 
gtcgtccgag 
ctggacaata 
ttcacattta 
gtgattgggg 
tcagccatgc 
tcaatgttct 
tctaaaattg 
ggagaagaaa 
gctcatgaaa 
tggcctactc 
acattgaaaa 
ttgcttaatc 
ggacataaaa 
attaattttg 
gatagaaaaa 
agcactaccg 
agtgtaggtg 
gaaaatagga 
tctgatccct 
attaataatg 
cgtgtatggt 
gccatctcta 
attgaaacaa 
tgtttgtgta 
gcacctggat 
gaaaaggaag 
gaaatgttgc 
tggaacaatg 
gatcacacca 
cctagagcaa 
gaaagaatac 



aaactgtcgt 
ccgtgtattt 
agcacaatag 
cgggtgcttt 
cacaacttat 
actacaacaa 
tgagcgggga 
gaacaagagg 
aactaaaact 
ttgtattgga 
gacaagcgca 
a taatgtctt 
ctgcctattc 
acactgtgga 
atctagggga 
catttggaaa 
acaatgaaaa 
tgggtgtgaa 
tatccgattt 
tcgatgacag 
aagtcgtatc 
aatttagtgt 
tcttttcact 
gccgagttag 
tggccaagga 
cggacgctat 
cagataaaga 
cgggaggcga 
ccgacgtgga 
acattatgaa 
t tggttttgg 
tagcttttga 
aagacatgtc 
cagaaggagg 
aactagcaag 



ggtgcacaac 
tgaagtgcac 
tttatgggga 
tctggacgaa 
caactcgtat 
caagaataag 
cggagaggga 
aggaagattt 
ggacaatgaa 
gaacccaggt 
agcatggata 
taaaaagacc 
caataccatc 
acctattgcc 
tagaaaactt 
aatgttgcta 
tgtggtgcct 
catgtggact 
ggcctgtatg 
aatagttttt 
cgacattaca 
ggataaagaa 
agaagtgact 
tgctcgtgta 
tgaaaatgcc 
ggagcgagct 
tttcaccgac 
tatgacacat 
aaaggtgaaa 
acagtgccct 
aaagtatata 
cgacccct tt 
ttattgtgta 
aggaggagga 
cactggtatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 



wo 01/38351 



235 



PCT/USOO/28888 



ttcaataaat 
gcagatttat 
ttatcctcca 
agtgctattc 
gaatttaatt 
gtaaaaga tg 
caggatcagg 
tctacctcca 
gaggcggtcg 
agaaatatgg 
gccacagaca 
acgattgaca 
acctttgtga 
aaaaacaaaa 
gtgttgtccg 
aatagacaaa 
ttggatgccg 
gtcactgccg 
gcggccaagg 
gtattgccca 
tgtcgtacat 
gaggaaatta 
agtcctgtga 
tatgaaggca 
gttggcgcat 
aatgttggaa 
cctcaaatgc 
tttgccaaga 
aagctcggta 
gagtttgcgt 
ggaattggag 
tgtgtatttg 
tacgggga ta 
ccagaata tg 
aaaatggctg 
gagcgccgag 
ggtcttttgt 
ttcaatgaca 
aagagaccta 
gcggccgaag 
gacgagaaga 
tctgttatca 
gacgccatga 
gcgaatgcag 
accaccaaac 
aagaaaagct 
acaaatcctg 
cagagtagac 
atttcattgg 
gaggaagggg 
aacatgaagg 
atcacatctg 
aaaaatgtag 
gcattgtcat 
tctctaaata 
gacgacgacg 
aacaaaggtg 
ggtgaagaaa 
aagggaaaga 
gatgtcattc 
agaaccagag 



tgggaaaatt 
tggccggggc 
gtaataaagg 
ggttggatct 
tgaacgccgt 
aagtagacct 
caaaagtaca 
tcaacaagga 
tgacagccaa 
gagaagaaag 
caaagggagg 
tgacaagttt 
cccatcgaca 
ccaccgactc 
agtacagacc 
ctccaattac 
agaagacaaa 
caggtatgga 
tgaatgatga 
taatgttaat 
tggaggatat 
taagaaccca 
cccaaatggc 
atgaaaaatt 
ttgattcggc 
tgttggttag 
aagccactta 
tgggagacat 
tgaattccat 
ctggatctgc 
gaggcacaag 
gcaatattgg 
cagattctgt 
atgaacaaac 
caattattcc 
acgctggtgt 
ttgccaagaa 
tga taaaatt 
gccacgcaga 
gacccgctgg 
gtatggagga 
acaacttgta 
cttcccgccc 
ctttcacccc 
taaagggtct 
gttatttgga 
ctatgatgat 
ctaatcctct 
ggcagaaatt 
aagtccctgc 
gaagtgttcc 
tatacaagat 
aagtgttgtg 
ttaatacagg 
ataaacaaat 
acgacgaaga 
actgtgtcac 
ctaaaacaag 
cggaggaaat 
ttgaccgttt 
tcttacaaca 



tgtagacaag 
agaaagtcag 
atcagcagga 
catgaaagtg 
gctcgccaaa 
acactttcat 
cgtctattgt 
aggggagatt 
cctggccact 
ggcagataga 
tatggtgagt 
gtacccgatg 
aattatgcaa 
tttattgttg 
cattgatatt 
tcgcatagag 
taataaaacg 
ttacttcccc 
tatgcatata 
cgacagaccg 
tctctcagaa 
ttggacattt 
tcgtcacatt 
cgagaggttg 
caatgatcca 
gacatggaac 
cagagccgat 
gaaaagagct 
gtatggtcat 
caatactgcc 
gcactcggtg 
ttgtggatta 
attctgcgtg 
tggcaaatat 
catcctagtc 
gggcatgatg 
aacataccat 
gaaatcaacc 
tgggtatgtt 
taagaaattg 
atggtttacc 
tgcctcacaa 
catagaagcg 
ttacaaaaag 
ccaatcattg 
tgtgatgaag 
tactagttcc 
agctctagcg 
taagacggtg 
tggttatttt 
tgcattttca 
ggttgagagc 
ttctacatct 
cgtcattgtt 
gttattgttt 
agcagaagaa 
ggaaaaaaag 
cgagaaaacg 
tgctagttcg 
actaaaagca 
atatagcaat 



aaaactggca 
gccaatacca 
tggttccaga 
tgcgaaaagg 
gtgagtagtg 
ctattgggat 
tgcaaggatg 
tttaggctgt 
cctctatgta 
agaggtgtgg 
caacctatcg 
accatgtgtc 
ctgagggata 
ttggacgtta 
gcagtcgcat 
gaaagtttgg 
tggtgcacca 
gagattgtgt 
gccccagcaa 
tacattggcg 
ctcgaaaagg 
aagggtcaaa 
attgaatcta 
gttagcttat 
gctgttagac 
gtaaaaactg 
cgagttgtga 
ggtctaaaca 
ttggccctaa 
tcaagtattt 
acggccaatc 
cagatggctc 
ca taatattg 
tattatgtga 
aactcgttaa 
aatatcgccc 
atgcttcact 
gataacaata 
gttccccata 
aaaagctttt 
tcttcaccta 
attgtagggg 
ggtacagaaa 
ggagcctttg 
attgcaagat 
aatcatgtgg 
cgagtcaaca 
ataaataacc 
acatcagttt 
aacgctggta 
gtcaagaatt 
gataagacgg 
gccaatactg 
acaaaggacg 
gttggagggg 
gaggacgaag 
atccctggac 
gagggagaaa 
ttgagtaaat 
acacattctt 
tgtacattat 



tgttgaaacc 
agtttaagga 
aaat tattgg 
cctataaaga 
tcggcgacaa 
tcttgaagct 
cctacttgac 
gtatggactc 
taggagaagg 
gagtaagaag 
tcaatcatgt 
agaataatct 
gattggtact 
ttgacgagtg 
catggaagaa 
gtctaagatt 
atacatctcc 
gtgacattaa 
gtttagagta 
cacacataac 
acttttctgt 
aacaatacga 
cgggaagaaa 
cagacagaat 
tgtggtcttc 
acattcttaa 
tgcagaacaa 
aagttggaca 
gagcacgttc 
ccaacatgtc 
agattacaga 
ttcctggtac 
taggtgatgg 
tggatattgc 
caaagggcat 
atgaacgtc t 
ttaatgaaaa 
ataagtttgc 
atccttcatt 
tggaagagga 
cgtggatggc 
tggagaaggg 
tgatggaggc 
tgaagaaggg 
ttttaccaaa 
agaattttgc 
agtttgatac 
acctgaaccc 
cttcttggag 
gcgtgcgttg 
tatctgttgt 
caataaaatc 
gattttctct 
tggcta tggc 
gaaagga tta 
aaaatggtga 
gaagcactaa 
gaaagggctc 
gtgggaagaa 
cttgcaccaa 
cttcctatat 



tgaaatgact 
acgcaacaag 
cggtatgtgc 
atccctctct 
ggttaaaaat 
gaagaaggcc 
tggtatagtt 
tgctttaacc 
agcaatctgt 
acactctatt 
tccctatcaa 
gtgcaccact 
tgaaaaaatg 
caa tcagatt 
tagcaactct 
catagaaaat 
caatatgaat 
tatgcagttt 
tatgcttcaa 
agctggaaaa 
tgaaaaagat 
tttctgtcat 
tatccgtgat 
ttatcgccgc 
tcgtctaatc 
gggaatcatc 
ggccaaggag 
aaatattatg 
gagccgtaaa 
agccaccgga 
aaatgctcga 
taagcagacg 
aggaatgata 
tctaaaaaat 
ccagtttgta 
agctgtcgct 
tagtgcagcg 
atcctttatc 
gattcttaga 
aggaattcat 
catggatgct 
taactggatt 
ggtgacgcaa 
aa ttacaccc 
aatagaggaa 
atcccatata 
gtcaaaagaa 
ttcttcagaa 
tctttcggca 
ggatgccacc 
gcccaacgcc 
ca tgattgct 
gagaagagga 
ttgtatacga 
cggtgaagac 
aaacgaagag 
caaggatgtt 
taagacggca 
agatgcgaga 
caatgaagag 
aacttcagtc 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
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atgaaattgg 
atacgtaatc 
gacgccatgg 
ttgactcaag 
ccattaatgg 
acagatacat 
gctttaaaat 
ttgatggctg 
gtaaaagatg 
gacacaaata 
ggtatgagtg 
ggtactgtgt 
ctttgtaacg 
acggcagcag 



accaaagagt 
tctccaacaa 
ttgctgcagt 
accat tggaa 
gtgtgccttt 
gcgacatggc 
ttgagaacga 
agatgttgtt 
ctcaagatag 
caattatatt 
ttagtgaaat 
ggaattatac 
aacgtgtcaa 
tagaattggc 



agcagaccaa 
gaagaggcaa 
taaggttaag 
aaagtgcccc 
tgaagttgca 
ttgttgtcaa 
aagattggcc 
cggaggggat 
aaagttggtg 
tttgtttgag 
aaaagacgct 
tgatgaaaga 
tggaaattgt 
tgctgcatgt 



atggaaaatt 
gaaaagggag 
tttttcccag 
gtgtccatcc 
ctcaattctc 
tcattgtatt 
cggcaaattg 
aaactattgg 
aaatctttat 
tctttaaggt 
gttagaggtc 
ttttttggac 
aaattgtcct 
ctatcttgtg 



taatatctca 
ggccttttaa 
ttttagatgc 
cagaaacgcg 
taataggaaa 
ttgtcctctt 
gcctagatga 
cccaggaagt 
tgcctttaaa 
ttgctcagaa 
tggccttttc 
cattgtataa 
ttataactgg 
ttttgtaa 



attggatcaa 
gtctgaattg 
gtctagaaaa 
tgaagaaaaa 
acacaagtgc 
gtacacgcta 
ctctgtagat 
gttaaaaagg 
ttataaccat 
acctgtagct 
taccactaca 
catggatgaa 
tatttatcat 



5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6588 



<210> 159 
<211> 2183 
<212> PRT 
<213> SHRIMP 



<400> 159 



Met 


Gin 


Leu 


Arg 


Leu 


Glu 


Asn 


Phe 


Val 


Lys 


Glu 


Glu 


His 


Glu 


Thr 


Val 


1 








5 










10 










15 




Val 


Val 


His 


Asn 


Pro 


Ser 


Gly 


Met 


Thr 


Gly 


Phe 


Asn 


He 


Phe 


Asn 


Ser 








20 










25 










30 






Ser 


Pro 


Val 


Tyr 


Phe 


Glu 


Val 


His 


Asn 


Glu 


Met 


Asp 


Ala 


Leu 


He 


Phe 






35 










40 










45 








Met 


Ala 


Ala 


Phe 


Leu 


Lys 


His 


Asn 


Ser 


Leu 


Trp 


Gly 


Glu 


He 


Asn 


Ala 




50 










55 










60 










Asn 


Met 


Asp 


Leu 


Tyr 


Thr 


Phe 


Asp 


Tyr 


Ala 


Gly 


Ala 


Phe 


Leu 


Asp 


Glu 


65 










70 










75 










80 


Arg 


Trp 


Cys 


His 


His 


Glu 


Lys 


Ser 


Phe 


Ser 


Val 


Val 


Arg 


Ala 


Gin 


Leu 










85 










90 










95 




lie 


Asn 


Ser 


Tyr 


Tyr 


Lys 


Cys 


Arg 


Arg 


Lys 


He 


Met 


Gin 


Ala 


Leu 


Asp 








100 










105 










110 






Asn 


Asn 


Tyr 


Asn 


Asn 


Lys 


Asn 


Lys 


Lys 


Arg 


Lys 


Asn 


Val 


Gly 


Gly 


Ala 






115 










120 










125 








Pro 


Ala 


Phe 


Thr 


Phe 


Met 


Ser 


Gly 


Asp 


Gly 


Glu 


Gly 


Gly 


Lys 


Glu 


Ala 




130 










135 










140 










Leu 


Glu 


Ala 


Ser 


Phe 


Asp 


Val 


He 


Gly 


Gly 


Thr 


Arg 


Gly 


Gly 


Arg 


Phe 


145 










150 










155 










160 


Gly 


Val 


Asp 


Ser 


Thr 


Pro 


Cys 


Pro 


His 


Ser 


Ser 


Ala 


Met 


Gin 


Leu 


Lys 










165 










170 










175 




Leu 


Asp 


Asn 


Glu 


Gly 


Asn 


Tyr 


Gly 


Cys 


He 


Ala 


Cys 


Phe 


Ala 


Ser 


Met 








180 










185 










190 






Phe 


Phe 


Val 


Leu 


Glu 


Asn 


Pro 


Gly 


Asp 


Glu 


Ser 


Ser 


Phe 


He 


Ser 


Thr 






195 










200 










205 








Asp 


Ala 


Ser 


Lys 


He 


Gly 


Gin 


Ala 


Gin 


Ala 


Trp 


He 


Asp 


Glu 


Arg 


Leu 




210 










215 










220 










Arg 


Asn 


Asn 


Glu 


Asn 


Gly 


Gly 


Glu 


Glu 


Asn 


Asn 


Val 


Phe 


Lys 


Lys 


Thr 


225 










230 










235 










240 


Phe 


His 


Met 


Leu 


Ala 


Asp 


He 


Thr 


Gin 


Lys 


Ala 


His 


Glu 


Thr 


Ala 


Tyr 










245 










250 










255 




Ser 


Asn 


Thr 


He 


Pro 


Leu 


Gly 


Pro 


Asn 


Gly 


Arg 


Gin 


Trp 


Asn 


Trp 


Pro 








260 










265 










270 






Thr 


His 


Thr 


Val 


Glu 


Pro 


He 


Ala 


His 


Glu 


Phe 


Val 


Thr 


His 


Ser 


Leu 






275 










280 










285 








Val 


Asn 


Thr 


Leu 


Lys 


Asn 


Leu 


Gly 


Asp 


Arg 


Lys 


Leu 


Pro 


Arg 


Phe 


Asn 




290 










295 










300 










Phe 


Asp 


He 


Leu 


Tyr 


Asn 


Leu 


Leu 


Asn 


Pro 


Phe 


Gly 


Lys 


Met 


Leu 


Leu 


305 










310 










315 










320 
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Val 


Phe 


He 


Gin 


Asn 


Cys 


His 


He 


Leu 


Thr 


Gly 


His 


Lys 


Asn 


Asn 


Glu 










325 










330 










335 




Asn 


Val 


Val 


Pro 


Arg 


Gly 


Ser 


Ala 


Ser 


Gly 


Lys 


Trp 


Trp 


Thr 


He 


Asn 








340 










345 










350 






Phe 


Val 


Gly 


Val 


Asn 


Met 


Trp 


Thr 


Phe 


Gin 


Val 


Thr 


Lys 


Cys 


Lys 


Val 






355 










360 










365 








Glu 


Lys 


Asp 


Arg 


Lys 


He 


Ser 


Asp 


Leu 


Ala 


Cys 


Met 


Leu 


Pro 


Arg 


Leu 




370 










375 










380 










Pro 


Asn 


Pro 


Gly 


Ser 


Thr 


Thr 


Val 


Asp 


Asp 


Arg 


He 


Val 


Phe 


Lys 


Gly 


385 










390 










395 










400 


Phe 


Cys 


Arg 


Gly 


Glu 


Asn 


Leu 


Gly 


Ser 


Val 


Gly 


Glu 


Val 


Val 


Ser 


Asp 










405 










410 










415 




He 


Thr 


Gin 


Ser 


Val 


Lys 


Asn 


Phe 


Cys 


Leu 


Met 


Val 


Glu 


Asn 


Arg 


Lys 








420 










425 










430 






Phe 


Ser 


Val 


Asp 


Lys 


Glu 


Thr 


Gly 


Phe 


He 


Ser 


Ser 


Glu 


Ser 


He 


Val 






435 










440 










445 








Ser 


Asp 


Pro 


Phe 


Phe 


Ser 


Leu 


Glu 


Val 


Thr 


Gly 


Cys 


Arg 


Ser 


Asn 


Arg 




450 










455 










460 










Ala 


Gin 


Asp 


Thr 


He 


Asn 


Asn 


Gly 


Arg 


Val 


Ser 


Ala 


Arg 


Val 


Met 


Arg 


465 










470 










475 










480 


He 


Leu 


Lys 


Ser 


Arg 


Glu 


Gly 


Ala 


Arg 


Val 


Trp 


Leu 


Ala 


Lys 


Asp 


Glu 










485 










490 










495 




Asn 


Ala 


He 


He 


Phe 


Glu 


Asn 


Val 


Asn 


His 


Asp 


Thr 


Ala 


He 


Ser 


Thr 








500 










505 










510 






Asp 


Ala 


Met 


Glu 


Arg 


Ala 


He 


Gly 


Gin 


His 


Lys 


He 


Leu 


Tyr 


Tyr 


Asp 






515 










520 










525 








He 


Glu 


Thr 


Thr 


Asp 


Lys 


Asp 


Phe 


Thr 


Asp 


Lys 


Lys 


Ser 


Val 


He 


Thr 




530 










535 










540 










Ser 


He 


Gly 


Phe 


Cys 


Leu 


Cys 


Thr 


Gly 


Gly 


Asp 


Met 


Thr 


His 


Gly 


Gly 


545 










550 










555 










560 


Glu 


Arg 


Gly 


Val 


Phe 


Gly 


Leu 


Val 


Ala 


Pro 


Gly 


Ser 


Asp 


Val 


Glu 


Lys 










565 










570 










575 




Val 


Lys 


Glu 


Thr 


He 


He 


Asn 


Ser 


Tyr 


Asp 


Pro 


Glu 


Glu 


Lys 


Glu 


Asp 








580 










585 










590 






He 


Met 


Lys 


Gin 


Cys 


Pro 


Gin 


Val 


He 


Glu 


He 


Phe 


Thr 


Asn 


Glu 


Phe 






595 










600 










605 








Glu 


Met 


•Leu 


Leu 


Gly 


Phe 


Gly 


Lys 


Tyr 


He 


Asp 


Lys 


Val 


Lys 


Pro 


His 




610 










615 










620 










Val 


He 


Ser 


Gly 


Trp 


Asn 


Asn 


Val 


Ala 


Phe 


Asp 


Asp 


Pro 


Phe 


Val 


Phe 


625 










630 










635 










640 


Thr 


Arg 


He 


Val 


Lys 


His 


Leu 


Ser 


Asp 


His 


Thr 


Lys 


Asp 


Met 


Ser 


Tyr 










645 










650 










655 




Cys 


Val 


Ala 


Asp 


Ala 


Ser 


Thr 


Ala 


Glu 


Ser 


Val 


Leu 


Pro 


Arg 


Ala 


Thr 








660 










665 










670 






Glu 


Gly 


Gly 


Gly 


Gly 


Gly 


Glu 


Thr 


Pro 


Tyr 


Arg 


Leu 


Ser 


Thr 


Pro 


Gin 






675 










680 










685 








Glu 


Arg 


He 


Gin 


Leu 


Ala 


Ser 


Thr 


Gly 


He 


Phe 


Asn 


Lys 


Leu 


Gly 


Lys 




690 










695 










700 










Phe 


Val 


Asp 


Lys 


Lys 


Thr 


Gly 


Met 


Leu 


Lys 


Pro 


Glu 


Met 


Thr 


Ala 


Asp 


705 










710 










715 










720 


Leu 


Leu 


Ala 


Gly Ala 


Glu 


Ser 


Gin 


Ala 


Asn 


Thr 


Lys 


Phe 


Lys 


Glu 


Arg 










725 










730 










735 




Asn 


Lys 


Leu 


Ser 


Ser 


Ser 


Asn 


Lys 


Gly 


Ser 


Ala 


Gly 


Trp 


Phe 


Gin 


Lys 








740 










745 










750 






He 


He 


Gly 


Gly 


Met 


Cys 


Ser 


Ala 


He 


Arg 


Leu 


Asp 


Leu 


Met 


Lys 


Val 






755 










760 










765 








Cys 


Glu 


Lys 


Ala 


Tyr 


Lys 


Glu 


Ser 


Leu 


Ser 


Glu 


Phe 


Asn 


Leu 


Asn 


Ala 




770 










775 










780 










Val 


Leu 


Ala 


Lys 


Val 


Ser 


Ser 


Val 


Gly 


Asp 


Lys 


Val 


Lys 


Asn 


Val 


Lys 


785 










790 










795 










800 


Asp 


Glu 


Val 


Asp 


Leu 


His 


Phe 


His 


Leu 


Leu 


Gly 


Phe 


Leu 


Lys 


Leu 


Lys 
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805 










810 










815 




Lys 


Ala 


Gin 


Asp 
820 


Gin 


Ala 


Lys 


Val 


His 
825 


Val 


Tyr 


Cys 


Cys 


Lys 
830 


Asp 


Ala 


Tyr 


Leu 


Thr 
835 


Gly 


He 


Val 


Ser 


Thr 
840 


Ser 


He 


Asn 


Lys 


Glu 
845 


Gly 


Glu 


He 


Phe 


Arg 
850 


Leu 


Cys 


Met 


Asp 


Ser 
855 


Ala 


Leu 


Thr 


Glu 


Ala 
860 


Val 


Val 


Thr 


Ala 


Asn 


Leu 


Ala 


Thr 


Pro 


Leu 


Cys 


He 


Gly 


Glu 


Gly 


Ala 


He 


Cys 


Arg 


Asn 


865 










870 










875 










880 


Met 


Gly 


Glu 


Glu 


Arg 
885 


Ala 


Asp 


Arg 


Arg 


Gly 
890 


Val 


Gly 


Val 


Arg 


Arg 
895 


His 


Ser 


lie 


Ala 


Thr 
900 


Asp 


Thr 


Lys 


Gly 


Gly 
905 


Met 


Val 


Ser 


Gin 


Pro 
910 


He 


Val 


Asn 


His 


Val 
915 


Pro 


Tyr 


Gin 


Thr 


He 
920 


Asp 


Met 


Thr 


Ser 


Leu 
925 


Tyr 


Pro 


Met 


Thr 


Met 
930 


Cys 


Gin 


Asn 


Asn 


Leu 
935 


Cys 


Thr 


Thr 


Thr 


Phe 
940 


Val 


Thr 


Gin 


He 


Met 


Gin 


Leu 


Arg 


Asp 


Arg 


Leu 


Val 


Leu 


Glu 


Lys 


Met 


Lys 


Asn 


Lys 


Thr 


945 










950 










955 










960 


Thr 


Asp 


Ser 


Leu 


Leu 
965 


Leu 


Leu 


Asp 


Val 


He 
970 


Asp 


Glu 


Cys 


Asn 


Gin 
975 


He 


Val 


Leu 


Ser 


Glu 
980 


Tyr 


Arg 


Pro 


He 


Asp 
985 


He 


Ala 


Val 


Ala 


Ser 
990 


Trp 


Lys 


Asn 


Ser 


Asn 


Ser 


Asn 


Arg 


Gin 


Thr 


Pro 


He 


Thr 


Arg 


He 


Glu 


Glu 


Ser 






995 










1000 








1005 






Leu 


Gly 


Leu 


Arg 


Phe 


He 


Glu 


Asn 


Leu 


Asp 


Ala 


Glu 


Lys 


Thr 


Asn 


Asn 




1010 








1015 








1020 








Lys 


Thr 


Trp 


Cys 


Thr 


Asn 


Thr 


Ser 


Pro 


Asn 


Met 


Asn 


Val 


Thr 


Ala 


Ala 


1025 








1030 








1035 








1040 


Gly 


Met 


Asp 


Tyr 


Phe 


Pro 


Glu 


He 


Val 


Cys 


Asp 


He 


Asn 


Met 


Gin 


Phe 










1045 








1050 








1055 


Ala 


Ala 


Lys 


Val 


Asn 


Asp 


Asp 


Met 


His 


He 


Ala 


Pro 


Ala 


Ser 


Leu 


Glu 








1060 








1065 








1070 




Tyr 


Met 


Leu 


Gin 


Val 


Leu 


Pro 


He 


Met 


Leu 


He 


Asp 


Arg 


Pro 


Tyr 


He 






1075 








1080 








1085 






Gly 


Ala 


His 


He 


Thr 


Ala 


Gly 


Lys 


Cys 


Arg 


Thr 


Leu 


Glu 


Asp 


He 


Leu 




1090 








1095 








1100 








Ser 


Glu 


Leu 


Glu 


Lys 


Asp 


Phe 


Ser 


Val 


Glu 


Lys 


Asp 


Glu 


Glu 


He 


He 


1105 








1110 








1115 








1120 


Arg 


Thr 


His 


Trp 


Thr 


Phe 


Lys 


Gly 


Gin 


Lys 


Gin 


Tyr 


Asp 


Phe 


Cys 


His 










1125 








1130 








1135 


Ser 


Pro 


Val 


Thr 


Gin 


Met 


Ala 


Arg 


His 


He 


He 


Glu 


Ser 


Thr 


Gly 


Arg 








1140 








1145 








1150 




Asn 


lie 


Arg 


Asp 


Tyr 


Glu 


Gly 


Asn 


Glu 


Lys 


Phe 


Glu 


Arg 


Leu 


Val 


Ser 






1155 








1160 








1165 






Leu 


Ser 


Asp 


Arg 


He 


Tyr 


Arg 


Arg 


Val 


Gly 


Ala 


Phe 


Asp 


Ser 


Ala 


Asn 




1170 








1175 








1180 








Asp 


Pro 


Ala 


Val 


Arg 


Leu 


Trp 


Ser 


Ser 


Arg 


Leu 


He 


Asn 


Val 


Gly 


Met 


1185 








1190 








1195 








1200 


Leu 


Val 


Arg 


Thr 


Trp 


Asn 


Val 


Lys 


Thr 


Asp 


He 


Leu 


Lys 


Gly 


He 


He 










1205 








1210 








1215 


Pro 


Gin 


Met 


Gin 


Ala 


Tyr 


Ala 


Asp 


Arg 


Val 


Val 


Met 


Gin 


Asn 


Lys 


Ala 








1220 








1225 








1230 




Lys 


Glu 


Phe 


Ala 


Lys 


Met 


Gly 


Asp 


Met 


Lys 


Arg 


Ala 


Gin 


Lys 


Val 


Gly 






1235 








1240 








1245 






Gin 


Asn 


lie 


Met 


Lys 


Leu 


Gly 


Met 


Asn 


Ser 


Met 


Tyr 


Gly 


His 


Leu 


Ala 




1250 








1255 








1260 








Leu 


Arg 


Ala 


Arg 


Ser 


Ser 


Arg 


Lys 


Glu 


Phe 


Ala 


Ser 


Gly 


Ser 


Ala 


Asn 


1265 








1270 








1275 








1280 


Thr 


Ala 


Ser 


Ser 


He 


Ser 


Asn 


Met 


Ser 


Ala 


Thr 


Gly 


Gly 


He 


Gly 


Gly 



1285 1290 1295 
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Gly Thr Arg His Ser Val Thr Ala Asn Gin lie Thr Glu Asn Ala Arg 

1300 1305 1310 

Cys Val Phe Gly Asn lie Gly Cys Gly Leu Gin Met Ala Leu Pro Gly 

1315 1320 1325 

Thr Lys Gin Thr Tyr Gly Asp Thr Asp Ser Val Phe Cys Val His Asn 

1330 1335 1340 

He Val Gly Asp Gly Gly Met He Pro Glu Tyr Asp Glu Gin Thr Gly 
1345 1350 1355 1360 

Lys Tyr Tyr Tyr Val Met Asp He Ala Leu Lys Asn Lys Met Ala Ala 

1365 1370 1375 

He He Pro He Leu Val Asn Ser Leu Thr Lys Gly He Gin Phe Val 

1380 1385 1390 

Glu Arg Arg Asp Ala Gly Val Gly Met Met Asn He Ala His Glu Arg 

1395 1400 1405 

Leu Ala Val Ala Gly Leu Leu Phe Ala Lys Lys Thr Tyr His Met Leu 

1410 1415 1420 

His Phe Asn Glu Asn Ser Ala Ala Phe Asn Asp Met He Lys Leu Lys 
1425 1430 1435 1440 

Ser Thr Asp Asn Asn Asn Lys Phe Ala Ser Phe He Lys Arg Pro Ser 

1445 1450 1455 

His Ala Asp Gly Tyr Val Val Pro His Asn Pro Ser Leu He Leu Arg 

1460 1465 1470 

Ala Ala Glu Gly Pro Ala Gly Lys Lys Leu Lys Ser Phe Leu Glu Glu 

1475 1480 1485 

Glu Gly He His Asp Glu Lys Ser Met Glu Glu Trp Phe Thr Ser Ser 

1490 1495 1500 

Pro Thr Trp Met Ala Met Asp Ala Ser Val He Asn Asn Leu Tyr Ala 

1505 1510 1515 ' 1520 

Ser Gin He Val Gly Val Glu Lys Gly Asn Trp He Asp Ala Met Thr 

1525 1530 1535 

Ser Arg Pro He Glu Ala Gly Thr Glu Met Met Glu Ala Val Thr Gin 

1540 1545 1550 

Ala Asn Ala Ala Phe Thr Pro Tyr Lys Lys Gly Ala Phe Val Lys Lys 

1555 1560 1565 

Gly He Thr Pro Thr Thr Lys Leu Lys Gly Leu Gin Ser Leu He Ala 

1570 1575 1580 

Arg Phe Leu Pro Lys He Glu Glu Lys Lys Ser Cys Tyr Leu Asp Val 
1585 1590 1595 1600 

Met Lys Asn His Val Glu Asn Phe Ala Ser His He Thr Asn Pro Ala 

1605 1610 1615 

Met Met He Thr Ser Ser Arg Val Asn Lys Phe Asp Thr Ser Lys Glu 

1620 1625 1630 

Gin Ser Arg Pro Asn Pro Leu Ala He Asn Asn His Leu Asn Pro Ser 

1635 1640 1645 

Ser Glu He Ser Leu Gly Gin Lys Phe Lys Thr Val Thr Ser Val Ser 

1650 1655 1660 

Ser Trp Ser Leu Ser Ala Glu Glu Gly Glu Val Pro Ala Gly Tyr Phe 
1665 1670 1675 1680 

Asn Ala Gly Ser Val Arg Trp Asp Ala Thr Asn Met Lys Gly Ser Val 

1685 1690 1695 

Pro Ala Phe Ser Val Lys Asn Leu Ser Val Val Pro Asn Ala He Thr 

1700 1705 1710 

Ser Val Tyr Lys Met Val Glu Ser Asp Lys Thr Ala He Lys Ser Met 

1715 1720 1725 

He Ala Lys Asn Val Glu Val Leu Cys Ser Thr Ser Ala Asn Thr Gly 

1730 1735 1740 

Phe Ser Leu Arg Arg Gly Ala Leu Ser Phe Asn Thr Gly Val He Val 
1745 1750 1755 1760 

Thr Lys Asp Val Ala Met Ala Cys He Arg Ser Leu Asn Asn Lys Gin 

1765 1770 1775 

Met Leu Leu Phe Val Gly Gly Gly Lys Asp Tyr Gly Glu Asp Asp Asp 



CJi *i:p j 
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1780 



1785 



1790 



Asp Asp Asp Glu Glu Ala Glu Glu Glu Asp Glu Glu Asn Gly Glu Asn 

1795 1800 1805 

Glu Glu Asn Lys Gly Asp Cys Val Thr Glu Lys Lys lie Pro Gly Arg 

1810 1815 1820 

Ser Thr Asn Lys Asp Val Gly Glu Glu Thr Lys Thr Ser Glu Lys Thr 
1825 1830 1835 184' 

Glu Gly Glu Arg Lys Gly Ser Lys Thr Ala Lys Gly Lys Thr Glu Glu 

1845 1850 1855 

lie Ala Ser Ser Leu Ser Lys Cys Gly Lys Lys Asp Ala Arg Asp Val 

1860 1865 1870 

lie Leu Asp Arg Leu Leu Lys Ala Thr His Ser Ser Cys Thr Asn Asn 

1875 1880 1885 

Glu Glu Arg Thr Arg Val Leu Gin Gin Tyr Ser Asn Cys Thr Leu Ser 

1890 1895 1900 

Ser Tyr lie Thr Ser Val Met Lys Leu Asp Gin Arg Val Ala Asp Gin 
1905 1910 1915 192i 

Met Glu Asn Leu lie Ser Gin Leu Asp Gin lie Arg Asn Leu Ser Asn 

1925 1930 1935 

Lys Lys Arg Gin Glu Lys Gly Gly Pro Phe Lys Ser Glu Leu Asp Ala 

1940 1945 1950 

Met Val Ala Ala Val Lys Val Lys Phe Phe Pro Val Leu Asp Ala Ser 

1955 1960 1965 

Arg Lys Leu Thr Gin Asp His Trp Lys Lys Cys Pro Val Ser lie Pro 

1970 1975 1980 

Glu Thr Arg Glu Glu Lys Pro Leu Met Gly Val Pro Phe Glu Val Asn 
1985 1990 1995 200' 

Ser Leu lie Gly Lys His Lys Cys Thr Asp Thr Cys Asp Met Ala Cys 

2005 2010 2015 

Cys Gin Ser Leu Tyr Phe Val Leu Leu Tyr Thr Leu Ala Leu Lys Phe 

2020 2025 2030 

Glu Asn Glu Arg Leu Ala Arg Gin lie Gly Leu Asp Asp Ser Val Asp 

2035 2040 2045 

Leu Met Ala Glu Met Leu Phe Gly Gly Asp Lys Leu Leu Ala Gin Glu 

2050 2055 2060 

Val Leu Lys Arg Val Lys Asp Ala Gin Asp Arg Lys Leu Val Lys Ser 
2065 2070 2075 208i 

Leu Leu Pro Leu Asn Tyr Asn His Asp Thr Asn Thr lie lie Phe Leu 

2085 2090 2095 

Phe Glu Ser Leu Arg Phe Ala Gin Lys Pro Val Ala Gly Met Ser Val 

2100 2105 2110 

Ser Glu lie Lys Asp Ala Val Arg Gly Leu Ala Phe Ser Thr Thr Thr 

2115 2120 2125 

Gly Thr Val Trp Asn Tyr Thr Asp Glu Arg Phe Phe Gly Pro Leu Tyr 

2130 2135 2140 

Asn Met Asp Glu Leu Cys Asn Glu Arg Val Asn Gly Asn Cys Lys Leu 
2145 2150 2155 216i 

Ser Phe lie Thr Gly lie Tyr His Thr Ala Ala Val Glu Leu Ala Ala 

2165 2170 2175 

Ala Cys Leu Ser Cys Val Leu 



2180 



<210> 160 
<211> 4172 
<212> DNA 



<213> SHRIMP 



<400> 160 

atgactgaac aaggggatca aggaataaaa gtaaggaaat tacatggccc gagaggagaa 60 
agaggagaaa ctggtccagc aggagcagtt ggccctgcag gccctcaagg agaaagagga 120 
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gcaattggac 
ggagcaattg 
agaggagaaa 
gcaattggac 
ggaccggcag 
aatggacgcc 
gcaattggac 
ggagcaacag 
agaggagaaa 
agaagaggag 
ggagaaaaag 
agaggagaga 
ttcccaggac 
ggccctcaag 
gctggcctcc 
gcaactggtc 
ggagcaacag 
agaggagcaa 
gaaagaggag 
ggagaaagag 
gctggtcctc 
cctataggcc 
ggtcctcaag 
gctggccctc 
cctgctggcc 
ggccctatag 
gatggagcaa 
cgcccaggaa 
ggattaactg 
actggtgcaa 
gcaacaggcc 
ggagaaaatg 
caaggagaaa 
gaaaagggag 
ggagaaaggg 
ccaggaaggg 
cgcacaggaa 
ggattagctg 
attggaccta 
gaggatggtt 
ggaccgcaag 
ccagggcgac 
gcacctggta 
ggccctattg 
gacggagaaa 
acaactggcg 
ggtgcacaag 
gacggtccac 
cctagaggtc 
ggacctcgag 
gaaggtcctc 
catcgaggag 
ggggaaaaag 
cctggcacgg 
gcacaagggg 
gggccgatgg 
agaggagaga 
aaagatgccg 
tgggaaggaa 
cctgtgatga 
atacaagtaa 



cggcaggaaa 
gaccggcagg 
atggacgccc 
cggcaggaaa 
gaaaggatgg 
caggaagaga 
cggcaggaag 
gtataccagg 
ttggacgccc 
caacaggacg 
gagaagctgg 
ctggaccacc 
caccaggcga 
ggaaaagagg 
caggaagaga 
taccaggaag 
gccgcgcagg 
caggtatacc 
aaactggacc 
gagaaaatgg 
aaggagaaaa 
ctagaggaga 
gaagaagagg 
caggagaaag 
ctcaaggaga 
gtcctgctgg 
ctggccctat 
gagatggagc 
gacgcccagg 
tgggaaagaa 
gcgcaggaaa 
gacgcccagg 
caggattagc 
aaaatggacg 
gagagactgg 
atggtgtgga 
gggatggggc 
gcctgccagg 
taggcccacg 
ccacaggccc 
gagaaagagg 
aaggaaggga 
atgatggtct 
gacctcaagg 
tgggaccagc 
ctaaaggacc 
gatctagagg 
ccggtgaaga 
ctagaggtat 
gtccccgagg 
gaggtcctag 
aaaagggtga 
gtccaagagg 
aaggggttag 
aattaggtcc 
ggcgtacagg 
aaggaggtag 
taaatatcat 
atgaaatcgg 
tagaaaa tag 
tgacaatgca 



ggatggagca 
aaaggatgga 
aggaagagat 
ggatggagca 
agcagttggc 
tggagcagtt 
agatggagca 
aagggatggc 
aggaagagat 
cgcaggaaag 
taaggacggt 
gggaagggac 
aactggacca 
agaaacagga 
tggagcaatt 
gaatggtgtg 
aaaggatggg 
aggaagggat 
agcaggaagg 
acgcccagga 
aggagaaaat 
aactggtgca 
agcaacaggc 
aggagaaact 
aacaggatta 
ccctcaagga 
aggtcctgct 
aactggccct 
aagagatgga 
tggtgtggac 
ggatggagca 
aagagatgga 
tgggctgcca 
cccaggaaag 
tcctataggt 
tggt tctgtt 
aattggccct 
gatagatgga 
aggagaaaga 
tatgggcccc 
at taaaggga 
tggaataatg 
agagggacct 
aataagagga 
cggcaaggac 
tagaggatta 
agaaaaaggc 
aggtcctcaa 
tcgtggccgt 
aacaaaagga 
aggtatacaa 
taaaggagac 
t ttacgagga 
aggtcctaga 
ccaaggacca 
agatactggt 
aggaagaaag 
acaaaaatat 
agaagcatac 
aa tagggatg 
ctcaataaca 



gttggccctg 
gcagttggcc 
ggagcagttg 
gttggccctc 
cctgcaggcc 
ggccctgcag 
gttggccctg 
gtggacggtt 
ggagcagttg 
gatggtgcag 
tctatagggc 
ggcactgcag 
ccaggaaagg 
cccgttggac 
ggccctgcag 
gatggttcta 
gcagttggcc 
ggtgtggacg 
gacggttcag 
agagatgggg 
ggacgcccag 
atgggaaaga 
cgcgcaggaa 
ggaccagcag 
actggcagcc 
gaaaaaggag 
ggccctcaag 
ataggtcctg 
gcaactggcc 
ggttctacgg 
gttggccctg 
gcaactggcc 
ggaagagatg 
gatggggcaa 
cctgctggcc 
ggccctcaag 
gtaggtcctg 
aaggacggtt 
ggagaaactg 
caaggactaa 
cggccaggaa 
ggtcctaggg 
gaaggaagag 
ttaaaaggta 
ggaatagaag 
agaggttttc 
gatagagggc 
ggtcttagag 
tcaggacctc 
agaacaggaa 
ggaaaggaag 
cgtggagaac 
attcgaggcc 
ggggtgagag 
acaggtcctc 
cccatgggcc 
ggaaaaaatg 
tcaatcaccc 
attggaagat 
acaaatgagg 
actagaggaa 



caggccctca 
ctcaaggccc 
gccctcaagg 
aaggagaaag 
ctcaaggaga 
gccctccagg 
caggccctcc 
ctgtgggccc 
gccctgcagg 
ttggtcctgc 
ctcaaggaat 
cagaaagagg 
atggtgtgga 
ctaggggtga 
gccctccagg 
tcggccccca 
ctgcaggccc 
gttctgtggg 
ttggccctgc 
caactggccc 
gaagagatgg 
atggcgtgga 
aggatggggc 
gaagggacgg 
caggaagaga 
aaaatggacg 
gagaaaaagg 
ctggccctca 
ctataggtcc 
gtcctcaagg 
ctggccctcc 
ctataggtcc 
gagcaattgg 
ctggccctat 
ctcaaggagc 
gaaaaagagg 
caggccctaa 
ccgtgggtcc 
gacgaccagg 
gaggagctac 
aagatggtga 
gtcttcgagg 
atggtgcacc 
tccagggacg 
gccctagagg 
aaggaagaac 
taacaggccc 
gagaaagggg 
aaggaagtaa 
tacaaggcct 
gaagaatggg 
aaggcatcgc 
ctattggtgc 
gtgttcctgg 
aagggccagc 
ctcctggagc 
gccctaaagg 
atgctcgtgc 
cttatggaac 
acaaaaaaaa 
gaacatcggg 



aggagaaaga 
tccaggagaa 
agaaagagga 
aggagcaatt 
aagaggagaa 
agaaagagga 
aggagaaaga 
tcaaggagaa 
ccctcaagga 
aggccctcaa 
acaaggccca 
agaaagaggc 
tggttctgag 
accaggtcta 
agaaagagga 
aggaagaaga 
tccaggagaa 
ccctccagga 
tggccctcat 
tataggtcct 
agcaactggc 
cggttctatg 
agttggccct 
ttcagttggc 
tggagcaact 
cccaggaaga 
agaaaatgga 
aggagaaaca 
tagaggagaa 
aagaagagga 
aggagaaaga 
tgctggccct 
tcctcaagga 
gggtcctcca 
aactggtctt 
attaataggg 
aggagaaaca 
tcaaggagca 
aagggacggt 
gggagctcca 
aacaggtcct 
agaaaaagga 
tggtcccgct 
accaggaaga 
tcaagatgga 
aggagaaact 
tcaaggaaga 
agcacctggc 
cggcgtgcaa 
cactggcata 
gaaaattgga 
tggagcagac 
tcctggtaag 
ctatcctggc 
aggtcctcaa 
agtgggacca 
agcggacgga 
agagataatg 
tgatacaatc 
cgaatattgt 
tgtttttgtg 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 
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gtaagcaata agacagatta tatcctttta gttactttac tgatgccaga aagtgtttcc 3840 
tgtagaacag atgtcagtac aaatgcgagg tcagagaggg tgaatgctgt tagagaaaga 3900 
gaaagcaaat cgtacagatt tattaggccg tctgaccaat ctataggtac tcattcacgt 3960 
tcaaaaattg ccgtggtaat gtatccagac gcaagcatga gttactcagt tgatacatta 4020 
gacgctgatg tggcgcgaag agaaacaacg tctgtgcttt tattagcaga aaccatacac 4080 
ggggaaaaag atagaggttt ctatgctgat agaggaactg tagggaggtt gatggtacct 4140 
cccactgaag aagagttatt ggtattgcaa gc 4172 



<210> 161 
<211> 1389 
<212> PRT 
<213> SHRIMP 

<220> 

<221> VARIANT 

<222> (1) . . - (1389) 

<223> Xaa = Any Amino Acid 



<400> 161 



Met 


Thr 


Glu 


Gin 


Gly 


Asp 


Gin 


Gly 


He 


Lys 


Val 


Arg 


Lys 


Leu 


His 


Gly 


1 








5 










10 










15 




Pro 


Arg 


Gly 


Glu 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 


Ala 


Gly 


Ala 


Val 


Gly 


Pro 








20 










25 










30 






Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Arg 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 


Lys 


Asp 






35 










40 










45 








Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Arg 


Gly 


Ala 


He 


Gly 




50 










55 










60 










Pro 


Ala 


Gly 


Lys 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Gin 


Gly 


Pro 


Pro 


Gly 


Glu 


65 










70 










75 










80 


Arg 


Gly 


Glu 


Asn 


Gly 


Arg 


Pro 


Gly 


Arg 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Gin 










85 










90 










95 




Gly 


Glu 


Arg 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 


Lys 


Asp 


Gly 


Ala 


Val 


Gly 








100 










105 










110 






Pro 


Gin 


Gly 


Glu 


Arg 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 


Lys 


Asp 


Gly 


Ala 






115 










120 










125 








Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Arg 


Gly 


Glu 


Asn 


Gly 


Arg 


Pro 




130 










135 










140 










Gly 


Arg 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Pro 


Gly 


Glu 


Arg 


Gly 


145 










150 










155 










160 


Ala 


He 


Gly 


Pro 


Ala 


Gly 


Arg 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 










165 










170 










175 




Pro 


Gly 


Glu 


Arg 


Gly 


Ala 


Thr 


Gly 


He 


Pro 


Gly 


Arg 


Asp 


Gly 


Val 


Asp 








180 










185 










190 






Gly 


Ser 


Val 


Gly 


Pro 


Gin 


Gly 


Glu 


Arg 


Gly 


Glu 


He 


Gly 


Arg 


Pro 


Gly 






195 










200 










205 








Arg 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Arg 


Arg 


Gly Ala 




210 










215 










220 










Thr 


Gly 


Arg 


Ala 


Gly 


Lys 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


225 










230 










235 










240 


Gly 


Glu 


Lys 


Gly 


Glu 


Ala 


Gly 


Lys 


Asp 


Gly 


Ser 


He 


Gly 


Pro 


Gin 


Gly 










245 










250 










255 




He 


Gin 


Gly 


Pro 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 


Pro 


Gly 


Arg 


Asp 


Gly 


Thr 








260 










265 










270 






Ala 


Ala 


Glu 


Arg 


Gly 


Glu 


Arg 


Gly 


Phe 


Pro 


Gly 


Pro 


Pro 


Gly 


Glu 


Thr 






275 










280 










285 








Gly 


Pro 


Pro 


Gly 


Lys 


Asp 


Gly 


Val 


Asp 


Gly 


Ser 


Glu 


Gly 


Pro 


Gin 


Gly 




2 90 










295 










300 










Lys 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 


Val 


Gly 


Pro 


Arg 


Gly 


Glu 


Pro 


Gly 


Leu 


305 










310 










315 










320 


Ala 


Gly 


Leu 


Pro 


Gly 


Arg 


Asp 


Gly 


Ala 


He 


Gly 


Pro 


Ala 


Gly 


Pro 


Pro 



325 330 335 
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Gly 


Glu 


Arg 


Gly 


Ala 


Thr 


Gly 


Leu 








340 










Ser 


He 


Gly 


Pro 


Gin 


Gly 


Arg 


Arg 






355 










360 


Asp 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 




370 










375 




Gly 


He 


Pro 


Gly 


Arg 


Asp 


Gly 


Val 


385 










390 






Glu 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 


Ala 










405 








Ala 


Gly 


Pro 


His 


Gly 


Glu 


Arg 


Gly 








420 










Gly 


Ala 


Thr 


Gly 


Pro 


He 


Gly 


Pro 






435 










440 


Glu 


Asn 


Gly 


Arg 


Pro 


Gly 


Arg 


Asp 




450 










455 




Arg 


Gly 


Glu 


Thr 


Gly 


Ala 


Met 


Gly 


465 










470 






Gly 


Pro 


Gin 


Gly Arg 


Arg 


Gly 


Ala 










485 








Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 


Pro 








500 










Ala 


Gly 


Arg 


Asp 


Gly 


Ser 


Val 


Gly 






515 










520 


Gly 


Leu 


Thr 


Gly 


Ser 


Pro 


Gly 


Arg 




530 










535 




Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Lys 


545 










550 






Asp 


Gly 


Ala 


Thr 


Gly 


Pro 


He 


Gly 










565 








Gly 


Glu 


Asn 


Gly Arg 


Pro 


Gly 


Arg 








580 










Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Thr 






595 










600 


Asp 


Gly 


Ala 


Thr 


Gly 


Pro 


He 


Gly 




610 










615 




Gly 


Lys 


Asn 


Gly 


Val 


Asp 


Gly 


Ser 


625 










630 






Ala 


Thr 


Gly 


Arg 


Ala 


Gly 


Lys 


Asp 










645 








Pro 


Gly 


Glu 


Arg 


Gly 


Glu 


Asn 


Gly 








660 










Gly 


Pro 


He 


Gly 


Pro 


Ala 


Gly 


Pro 






675 










680 


Leu 


Pro 


Gly 


Arg 


Asp 


Gly 


Ala 


He 




690 










695 




Asn 


Gly 


Arg 


Pro 


Gly 


Lys 


Asp 


Gly 


705 










710 






Gly 


Glu 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 










725 








Ala 


Thr 


Gly 


Leu 


Pro 


Gly Arg 


Asp 








740 










Gin 


Gly 


Lys 


Arg 


Gly 


Leu 


He 


Gly 






755 










760 


Gly 


Pro 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 




770 










775 




Leu 


Pro 


Gly 


He 


Asp 


Gly 


Lys 


Asp 


785 










790 






He 


Gly 


Pro 


He 


Gly 


Pro 


Arg 


Gly 










805 








Gly 


Arg 


Asp 


Gly 


Glu 


Asp 


Gly 


Ser 



Pro 


Gly Arg 


Asn 


Gly 


Val 


Asp 


Gly 


345 










350 






Gly 


Ala 


Thr 


Gly 


Arg 


Ala 


Gly 


Lys 










365 








Pro 


Pro 


Gly 


Glu 


Arg 


Gly 


Ala 


Thr 








380 










Asp 


Gly 


Ser 


Val 


Gly 


Pro 


Pro 


Gly 






395 










400 


Gly 


Arg 


Asp 


Gly 


Ser 


Val 


Gly 


Pro 




410 










415 




Glu 


Asn 


Gly 


Arg 


Pro 


Gly 


Arg 


Asp 


425 










430 






Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Lys 


Gly 










445 








Gly 


Ala 


Thr 


Gly 


Pro 


He 


Gly 


Pro 








460 










Lys 


Asn 


Gly 


Val 


Asp 


Gly 


Ser 


Met 






475 










480 


Thr 


Gly 


Arg 


Ala 


Gly 


Lys 


Asp 


Gly 




490 










495 




Gly 


Glu 


Arg 


Gly 


Glu 


Thr 


Gly 


Pro 


505 










510 






Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Thr 










525 








Asp 


Gly 


Ala 


Thr 


Gly 


Pro 


He 


Gly 








540 










Gly 


Glu 


Asn 


Gly 


Arg 


Pro 


Gly 


Arg 






555 










560 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 


Glu 


Lys 




570 










575 




Asp 


Gly 


Ala 


Thr 


Gly 


Pro 


He 


Gly 


585 










590 






Gly 


Leu 


Thr 


Gly 


Arg 


Pro 


Gly 


Arg 










605 








Pro 


Arg 


Gly 


Glu 


Thr 


Gly 


Ala 


Met 








620 










Thr 


Gly 


Pro 


Gin 


Gly 


Arg 


Arg 


Gly 






635 










640 


Gly 


Ala 


Val 


Gly 


Pro 


Ala 


Gly 


Pro 




650 










655 




Arg 


Pro 


Gly 


Arg 


Asp 


Gly 


Ala 


Thr 


665 










670 






Gin 


Gly 


Glu 


Thr 


Gly 


Leu 


Ala 


Gly 










685 








Gly 


Pro 


Gin 


Gly 


Glu 


Lys 


Gly 


Glu 








700 










Ala 


Thr 


Gly 


Pro 


Met 


Gly 


Pro 


Pro 






715 










720 


He 


Gly 


Pro 


Ala 


Gly 


Pro 


Gin 


Gly 




730 










735 




Gly 


Val 


Asp 


Gly 


Ser 


Val 


Gly 


Pro 


745 










750 






Arg 


Thr 


Gly 


Arg 


Asp 


Gly 


Ala 


He 










765 








Lys 


Gly 


Glu 


Thr 


Gly 


Leu 


Ala 


Gly 








780 










Gly 


Ser 


Val 


Gly 


Pro 


Gin 


Gly 


Ala 






795 










800 


Glu 


Arg 


Gly 


Glu 


Thr 


Gly 


Arg 


Pro 




810 










815 




Thr 


Gly 


Pro 


Met 


Gly 


Pro 


Gin 


Gly 
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820 825 830 



Leu 


Arg 


Gly 
835 


Ala 


Thr 


Gly 


Ala 


Pro 
840 


Gly 


Pro 


Gin 


Gly 


Glu 
845 


Arg 


Gly 


Leu 


Lys 


Gly 
850 


Arg 


Pro 


Gly 


Lys 


Asp 
855 


Gly 


Glu 


Thr 


Gly 


Pro 
860 


Pro 


Gly 


Arg 


Gin 


Gly 


Arg 


Asp 


Gly 


He 


Met 


Gly 


Pro 


Arg 


Gly 


Leu 


Arg 


Gly 


Glu 


Lys 


Gly 


865 










870 










875 










880 


Ala 


Pro 


Gly 


Asn 


Asp 
885 


Gly 


Leu 


Glu 


Gly 


Pro 
890 


Glu 


Gly 


Arg 


Asp 


Gly 
895 


Ala 


Pro 


Gly 


Pro 


Ala 
900 


Gly 


Pro 


He 


Gly 


Pro 
905 


Gin 


Gly 


He 


Arg 


Gly 
910 


Leu 


Lys 


Gly 


He 


Gin 
915 


Gly 


Arg 


Pro 


Gly 


Arg 
920 


Asp 


Gly 


Glu 


Met 


Gly 
925 


Pro 


Ala 


Gly 


Lys 


Asp 
930 


Gly 


He 


Glu 


Gly 


Pro 

935 


Arg 


Gly 


Gin 


Asp 


Gly 
940 


Thr 


Thr 


Gly 


Ala 


Lys 


Gly 


Pro 


Arg 


Gly 


Leu 


Arg 


Gly 


Phe 


Gin 


Gly 


Arg 


Thr 


Gly 


Glu 


Thr 


945 










950 










955 










960 


Gly 


Ala 


Gin 


Gly 


Ser 
965 


Arg 


Gly 


Glu 


Lys 


Gly 
970 


Asp 


Arg 


Gly 


Leu 


Thr 

975 


Gly 


Pro 


Gin 


Gly 


Arg 
980 


Asp 


Gly 


Pro 


Pro 


Gly 
985 


Glu 


Glu 


Gly 


Pro 


Gin 
990 


Gly 


Leu 


Arg 


Gly 


Glu 


Arg 


Gly 


Ala 


Pro 


Gly 


Pro 


Arg 


Gly 


Pro 


Arg 


Gly 


He 


Arg 






995 










1000 








1005 






Gly 


Arg 


Ser 


Gly 


Pro 


Gin 


Gly 


Ser 


Asn 


Gly 


Val 


Gin 


Gly 


Pro 


Arg 


Gly 




1010 








1015 








1020 








Pro 


Arg 


Gly 


Thr 


Lys 


Gly 


Arg 


Thr 


Gly 


He 


Gin 


Gly 


Leu 


Thr 


Gly 


He 


1025 








1030 








1035 








1040 


Glu 


Gly 


Pro 


Arg 


Gly 


Pro 


Arg 


Gly 


He 


Gin 


Gly 


Lys 


Glu 


Gly 


Arg 


Met 










1045 








1050 








1055 


Gly 


Lys 


He 


Gly 


His 


Arg 


Gly 


Glu 


Lys 


Gly 


Asp 


Lys 


Gly 


Asp 


Arg 


Gly 








1060 








1065 








1070 




Glu 


Gin 


Gly 


He 


Ala 


Gly 


Ala 


Asp 


Gly 


Glu 


Lys 


Gly 


Pro 


Arg 


Gly 


Leu 






1075 








1080 








1085 






Arg 


Gly 


He 


Arg 


Gly 


Pro 


He 


Gly 


Ala 


Pro 


Gly 


Lys 


Pro 


Gly 


Thr 


Glu 




1090 








1095 








1100 








Gly 


Val 


Arg 


Gly 


Pro 


Arg 


Gly 


Val 


Arg 


Gly 


Val 


Pro 


Gly 


Tyr 


Pro 


Gly 


1105 








1110 








1115 








1120 


Ala 


Gin 


Gly 


Glu 


Leu 


Gly 


Pro 


Gin 


Gly 


Pro 


Thr 


Gly 


Pro 


Gin 


Gly 


Pro 










1125 








1130 








1135 


Ala 


Gly 


Pro 


Gin 


Gly 


Pro 


Met 


Gly 


Arg 


Thr 


Gly 


Asp 


Thr 


Gly 


Pro 


Met 








1140 








1145 








1150 




Gly 


Pro 


Pro 


Gly 


Ala 


Val 


Gly 


Pro 


Arg 


Gly 


Glu 


Lys 


Gly 


Gly 


Arg 


Gly 






1155 








1160 








1165 






Arg 


Lys 


Gly 


Lys 


Asn 


Gly 


Pro 


Lys 


Gly 


Ala 


Asp 


Gly 


Lys 


Asp 


Ala 


Val 




1170 








1175 








1180 








Asn 


He 


He 


Gin 


Lys 


Tyr 


Ser 


He 


Thr 


His 


Ala 


Arg 


Ala 


Glu 


He 


Met 


1185 








1190 








1195 








1200 


Trp 


Glu 


Gly 


Asn 


Glu 


He 


Gly 


Glu 


Ala 


Tyr 


He 


Gly 


Arg 


Ser 


Tyr 


Gly 










1205 








1210 








1215 


Thr 


Asp 


Thr 


He 


Pro 


Val 


Met 


He 


Glu 


Asn 


Arg 


He 


Gly 


Met 


Thr 


Asn 








1220 








1225 








1230 




Glu 


Asp 


Lys 


Lys 


Asn 


Glu 


Tyr 


Cys 


He 


Gin 


Val 


Met 


Thr 


Met 


His 


Ser 






1235 








1240 








1245 






He 


Thr 


Thr 


Arg 


Gly 


Arg 


Thr 


Ser 


Gly 


Val 


Phe 


Val 


Val 


Ser 


Asn 


Lys 




1250 








1255 








1260 








Thr 


Asp 


Tyr 


He 


Leu 


Leu 


Val 


Thr 


Leu 


Leu 


Met 


Pro 


Glu 


Ser 


Val 


Ser 


1265 








1270 








1275 








1280 


Cys 


Arg 


Thr 


Asp 


Val 


Ser 


Thr 


Asn 


Ala 


Arg 


Ser 


Val 


Asn 


Ala 


Val 


Arg 










1285 








1290 








1295 


Glu 


Arg 


Glu 


Ser 


Lys 


Ser 


Tyr 


Arg 


Phe 


He 


Arg 


Pro 


Ser 


Asp 


Gin 


Ser 



1300 1305 1310 
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lie Gly Thr His Ser Arg Ser Lys lie Ala Val Val Met Tyr Pro Asp 

1315 1320 1325 

Ala Ser Met Ser Tyr Ser Val Asp Thr Leu Asp Ala Asp Val Ala Arg 

1330 1335 1340 

Arg Glu Thr Thr Ser Val Leu Leu Leu Ala Glu Thr lie His Gly Glu 
1345 1350 1355 1360 

Lys Asp Arg Gly Phe Tyr Ala Asp Arg Gly Thr Val Gly Arg Leu Met 

1365 1370 1375 

Val Pro Pro Thr Glu Glu Glu Leu Leu Val Leu Gin Xaa 
1380 1385 



<210> 162 
<211> 201 
<212> DNA 
<213> SHRIMP 



<400> 162 

atggccgtaa cagagatacc ttgtggtact cggaacattg cagaagaaga cgtagaatta 60 
gagttgatac ttgtaacagc agaagcagaa gtccgagagg cgatggcagc agctctggcg 120 
gcagcaatca taggggctgt agtggtgcaa atcggcagag tgttggatga agtcgtggca 180 
gcagaggtag agttgatgtg a 201 



<210> 163 
<211> 64 
<212> PRT 
<213> SHRIMP 



<400> 163 
Met Ala Val Thr 
1 

Asp Val Glu Leu 

20 

Glu Ala Met Ala 
35 

lie Gly Arg Val 
50 



Glu lie Pro Cys 
5 

Glu Leu lie Leu 

Ala Ala Ala Ala 
40 

Leu Asp Glu Val 
55 



Gly Thr Arg Asn 
10 

Val Thr Ala Glu 
25 

lie lie Gly Ala 

Val Ala Ala Glu 
60 



lie Ala Glu Glu 
15 

Ala Glu Val Arg 
30 

Val Val Val Gin 
45 

Val Glu Leu Met 



<210> 164 
<211> 627 
<212> DNA 
<213> SHRIMP 



<400> 164 

atgcacatgt 

atatctatag 

gcctaccctg 

cactttaact 

ggccaaggaa 

ttacttggag 

ttgtatgcaa 

tctccaggtc 

aaaccaacag 

gaagacatcg 

gatctgttta 



ggggggttta 
ttgtaaccaa 
ttgaatctga 
ttgtaaacgg 
cgtccgattc 
acggagacca 
gagtatacaa 
taaatgctac 
ttttgaagaa 
atgatacaaa 
aggttgggga 



cgccgctata 
catagaactt 
aataataaac 
cacattacaa 
tgaactggta 
cacactaaat 
taatacaaag 
aggaagggaa 
aaataggatc 
taggcattat 
aaaataa 



ctggcgggtt 
aacaagaaat 
ttgaccatta 
accaggaact 
aaaaagaaag 
gtaaacaaag 
agggatataa 
ttttcagcta 
aacacacttg 
ctgttaagta 



tgacattgat 
tggacaagaa 
acggtgttgc 
atggaaaggt 
gagacataat 
ccgaatctaa 
cagtggactc 
acaaatttgt 
tgtttggagc 
tgcgattttc 



actcgtggtt 
ggataaagac 
tagaggaaac 
atatgtagct 
cctcacatct 
agaattagaa 
tgtttcactg 
attatatttc 
aacgtttgac 
tcctggcaat 



60 

120 

180 

240 

300 

360 

420 

480 

54 0 

600 

627 



<210> 165 
<211> 204 
<212> PRT 
<213> SHRIMP 
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<400> 165 

Met His Met Trp Gly Val Tyr Ala Ala lie Leu Ala Gly Leu Thr Leu 

15 10 15 

lie Leu Val Val He Ser He Val Val Thr Asn He Glu Leu Asn Lys 

20 25 30 

Lys Leu Asp Lys Lys Asp Lys Asp Ala Tyr Pro Val Glu Ser Glu He 

35 40 45 

He Asn Leu Thr He Asn Gly Val Arg Asn His Phe Asn Phe Val Asn 

50 55 60 

Gly Thr Leu Gin Thr Arg Asn Tyr Gly Lys Val Tyr Val Ala Gly Gin 
65 70 75 80 

Gly Thr Ser Asp Ser Glu Leu Val Lys Lys Lys Gly Asp He He Leu 

85 90 95 

Thr Ser Leu Leu Gly Asp Gly Asp His Thr Leu Asn Val Asn Lys Ala 

100 105 110 

Glu Ser Lys Glu Leu Glu Leu Tyr Ala Arg Val Tyr Asn Asn Thr Lys 

115 120 125 

Arg Asp He Thr Val Asp Ser Val Ser Leu Ser Pro Gin Ala Thr Gly 

130 135 140 

Arg Glu Phe Ser Ala Asn Lys Phe Val Leu Tyr Phe Lys Pro Thr Val 
145 150 155 160 

Leu Lys Lys Asn Arg He Asn Thr Leu Val Phe Gly Ala Thr Phe Asp 

165 170 175 

Glu Asp He Asp Asp Thr Asn Arg His Tyr Leu Leu Ser Met Arg Phe 

180 185 190 

Ser Pro Gly Asn Asp Leu Phe Lys Val Gly Glu Lys 
195 200 



<210> 166 
<211> 12543 
<212> DNA 
<213> SHRIMP 



<400> 166 

atgtttaagg 

gatcacttaa 

ctctccaaca 

ggggaacaaa 

atggggcatc 

cttctgtatg 

aaattgataa 

ggcatgtaca 

aatgagcgtt 

actggaaagc 

agtgcagaga 

gcaacagacg 

aaggtaatat 

gaaaatgtgg 

atgcccagaa 

attgatattg 

gcgttcccct 

aacgaaaaca 

tgcaaaacac 

atggccaata 

ccttacagaa 

aagtttctgg 

gagaagatgg 

aatgaggaag 

aacaagcctc 



caaatgtgct 
taaaatgtgc 
aactgcccga 
ttctagtcac 
tagctgtaca 
ctggatgtaa 
tgtacagtca 
aagtaaaaaa 
cttgccagaa 
ttctcaggga 
aactatgtgt 
tgccagtgtt 
tctacccaac 
tgttgttaaa 
tcgtcgaacg 
gcgggagtga 
atcaccatgc 
tggtgacttc 
tggaaagttt 
ccgacgattg 
acaaaagttt 
cagaaggagg 
cgtgtaaagg 
gaggatctgg 
ctccagcccc 



caacctagga 
caaccagatg 
gtacgacaac 
tgataacttg 
agatagagtc 
tgttgttgaa 
acctgcaact 
ggttccagaa 
tgaaaatagg 
acttattat t 
gactcctgcc 
taaggttgcc 
catccttgaa 
gtcaatctta 
tgttgaaagt 
taactggaa t 
gcagttggtg 
tctgggtgtc 
cgagaaaaag 
gagaagaaag 
gattatggat 
tggcttgtta 
attggacga t 
agggaaatct 
taaacaaatc 



ggaggcaagt 
aaagaggaac 
agacgtctac 
accaagaatg 
ggcggagacg 
tatgacacag 
ttaaaggatg 
attactggtg 
cgcatggatg 
atgatgaggc 
ttcagagagt 
ttcatcacca 
gaggaacgtc 
aaggtacaac 
atgatcaaga 
cttcccggac 
ggagaaaaga 
gtgaaggcag 
tgtctatatc 
atcctttttt 
caagatttct 
ttgaccaaga 
tctggagatg 
ggcggcggtt 
cctccccttg 



ttcttgaaag 
caacgactct 
ctcttctcct 
gcaaccctct 
gcagtgtcaa 
tgaacagagg 
ttgcaaaatc 
atcaatttct 
aagaaggacc 
tttatgagga 
ttctgggttg 
atgcctcttt 
ttgcagctgt 
ttgaattact 
aaactgttgc 
attgtaaagt 
agaacattct 
atagagccga 
tcgaaaacct 
ctgaacttgg 
gtacaattgg 
caaatgctac 
gagacgacga 
ctggagacga 
ctgccaatgt 



tgatgtgcga 
acgcatctgt 
tctcaacgaa 
ggtcaaacaa 
ccccaacaat 
aaacgatgga 
caagaagaaa 
ggacaagctg 
ccacgttgga 
agaaacttct 
tggaagaaca 
gatggggctc 
ttcagacaca 
gagtgaatgt 
ctgtttcaag 
gagcgacact 
ctccatcagc 
agagtggatg 
catgggttct 
gccagaaatg 
catgtgttac 
cctattaaag 
ggaagaagat 
gaacaaca tc 
ctataatagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 
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atcatcaatg 
ttctttctga 
aaagccatga 
aggaatttga 
caaattgttg 
caacaaagac 
tggcaattta 
cctcctccac 
tcaacaacac 
gatgaaaacc 
acattcgcca 
gctgagcggt 
ataattcgtg 
gcaattaacc 
tgcagaatgg 
agcaaattgt 
acatttaaaa 
gcccccatgg 
ttggcagaca 
agaaaatgtg 
gaaagcgatg 
ggaggtgcaa 
ggcgaatttg 
aacgtacaag 
agtttatgtc 
ttgtgtgatg 
attaatagaa 
gaaattgtca 
cctccctttt 
aacaatacca 
ctgaaagatc 
agtgatgtta 
tctattttag 
ctatcttcag 
attaacacgt 
gtatcagata 
caagatttag 
gtgtttaatc 
gatgcagcaa 
aaaatgatgg 
gagttctttt 
ctgtggacaa 
ggaagtttag 
aatgtggacg 
gcattttttg 
aatgacgaca 
gacaacgaca 
aacgtaatta 
tgggttgaag 
atggctgcaa 
gtcgctatgt 
ttgcgtctat 
tggtgcagct 
ggtagatttt 
attggacaag 
tcgtttgatt 
tctcacgtcc 
aatttggtta 
gaaaaacgtg 
gactctgtgc 
gtcagtgtta 



acgatgataa 
gtgacattga 
gcaagaagaa 
ctggcgttaa 
tgagatatga 
tcaagaataa 
cccacaaaga 
ctactcaaag 
cagatatggt 
agatagcttc 
cgggtgctga 
tttgtgcgct 
aagattttga 
gagaacttag 
gagttgcagc 
gcaaaattaa 
cgtttgatta 
agactttcca 
agaagtcgga 
acaacgacga 
aagaaagtga 
caggtgatta 
gttctagtga 
atgatgtgtt 
atcgccaaaa 
tactaacatt 
gatctttgag 
agcctaatgt 
gttcttgtgc 
tgtctaacga 
tgactgtagc 
ctgcagcttt 
ataataagag 
cattaccttt 
ttagacctag 
agagtgttaa 
agtctgttgt 
aatcgctttc 
attatttcaa 
aagaatgtat 
caaatttgat 
ttctagagac 
ttaaggattg 
aatctggcgc 
gtaaagatct 
ataaggactg 
ttagatctgt 
agaaattgaa 
acgaatatta 
cagaagaggt 
atcgtatcat 
ctgtggataa 
gtacaggcaa 
acgatattga 
ttgcaagtaa 
ttaatggatt 
taatgggacg 
gtgtttcctc 
ttgggaaggt 
gtgcttcttc 
aatctttgga 



gcttgaccag 
caactctcct 
ttgtgttgaa 
caccatcaag 
gaacagaaac 
cacatgggat 
atcactaaaa 
tgaagaagaa 
ttctagtcta 
tttggaaaat 
caagaacgac 
gtacaacatc 
atttgaagga 
catatatgta 
atacgaaaat 
gtatgatgaa 
tgatgaatct 
acgacttttt 
gaagagggag 
cgtttctgct 
tgaagagagt 
ttatggtggt 
tgatgaaaat 
tagggatgtt 
gtatgtttca 
ggacaattct 
aatgaggaat 
caatagtgaa 
ttctcttaat 
aaagtgcata 
tctccgtttc 
aggatttgtg 
gtgggatgcc 
gggtatctat 
aagtttagcc 
taatggtttc 
gctgggttct 
tgagaaggaa 
ggaccataac 
catgcgtaca 
aacaagatac 
tgttagggag 
ggacggtagt 
cgtatttgag 
ggacacaaat 
gaaagcgcag 
tgaaaatttc 
gataaaaccc 
tccacgaaac 
tctcaagaca 
gatgcaggga 
aatggtacca 
gggagattct 
agaagatcct 
tgatattgtc 
tgatactagt 
gcgaagcagg 
tttggccgac 
aatgaaacgt 
aaatgatttg 
cgagttccgc 



attgtgtgtt 
cttctggcaa 
cgcgtgaaac 
tttgacacaa 
ctgaaattgc 
gcactatcaa 
cctcctacaa 
gctgctgctg 
gaagaaggtg 
atcaagaagc 
accatattcg 
acttctcatc 
ggttttgaga 
cccaaaagtg 
tccatggaga 
agcacaatgg 
gacaagtctt 
gccagcgtca 
aaactctatc 
cgccagatcc 
gacgatgagg 
gatgatgaag 
gtaccatctg 
aatttcatca 
actgtcattg 
gcagctgaat 
tgggtagtgc 
gacggaaccg 
aacttcaaga 
aaactactcc 
aacacaatgg 
aaaaaggaca 
ataaaacagt 
gaaaatgtga 
agattggcat 
ttttctggta 
acagttgttg 
cttaaagaaa 
gtttcaaaac 
gctttgaatg 
ggttctggaa 
tgct ttaata 
gacatgctaa 
ctatcagaat 
ttgga tgctg 
gtcgccaagg 
ataaattcaa 
acccctagca 
aagtctactt 
gaaatgtccc 
gaaagtgtgc 
ctcattcgat 
cccaaaaagg 
ctccactttt 
attgaagaaa 
aattggttgc 
cttctttcta 
aaaaatagcg 
attgcaaggc 
ttggattgtt 
gccaagactc 



tcttcaagag 
tggagtttct 
cagaaacaaa 
tcatgccttt 
ccagagatac 
agggcaaatt 
ttgaagaatt 
ctgctgctgc 
caacatccac 
tcttgtccat 
cgtggacagt 
cagaagagta 
agtttagaca 
tgcttgaaaa 
ggatcaagaa 
tctacgaatt 
ttggaccaat 
agtctgataa 
aacaaaaaca 
tgaacaatgt 
agaat tatgg 
atgattgtta 
acaatgctag 
aaacattcaa 
tggaggaaat 
ctggtgatat 
cttttactat 
ctaacagcaa 
gcgattctcc 
caattcctag 
catgtgaaag 
aagttaatgg 
gtaaattggc 
tcagtgaaga 
gttcatcggg 
tttgggcatt 
atcctctaaa 
aacgtcaaca 
ttaacatata 
gaaaaacaag 
ccaatagtcc 
acagccttcc 
at t taaaggc 
ttctgggcgt 
atacgtggga 
cctatgagtt 
gcaacttgct 
atgatgttag 
tgcgcagtcg 
tctcttgtgt 
gagaaattgc 
gctttaaaat 
ccgacgctag 
accgttttgc 
tgacccgaaa 
aat ttatcac 
ggcccctttc 
aaaaaagcaa 
ttgttttagt 
gtatt ttgga 
gccaagaact 



aaaacacggt 
tcttccccag 
gaacattatc 
cgccatcctt 
ggacattcta 
tgcagaaatg 
ggagagtatt 
atcaacatca 
ttcgtcatct 
catcacgtca 
cgttactctt 
ctatcagcaa 
catgtgcgac 
gcaatctgtc 
caagacaaac 
gaacaacgat 
gtatgaatgc 
ggaagctgtt 
agaatatcta 
ggccagtaac 
agctgccaag 
tggttttcta 
tagtattaat 
ctttagatcc 
ggagaaaaat 
tttgaaagaa 
gcctgtacga 
caacaatatc 
ccttagtagt 
ttctaaacat 
gagatacttt 
taatattagg 
aggaaaatgt 
taacaaactc 
aggtgatggt 
gtgtgcgaat 
accaactaaa 
gatgtgttta 
cgagtgcttt 
taatgattct 
agcttctaga 
tattgattgg 
aggtgtgtca 
tagcgcgagg 
atgtcttttg 
tgctctcaaa 
aacaaataac 
gcatcaaatt 
ggcagagtgg 
gctagcaatg 
aactgctcct 
cactagcaag 
cataaaagaa 
cgcatatgtt 
gattctcatg 
ctatagattt 
actagttaag 
tgatatgtat 
aaaagcggca 
cgttaatgat 
gcaagagact 
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agaatcgata 
gtagaaaagt 
caagctaaag 
aacaaggagg 
cgatcaaaga 
tctagtcggg 
cagatacaca 
ggctatgtac 
gtgaaagtaa 
gagtatgctt 
aggtggaa ta 
actgatactg 
gttggcgagg 
tctaataata 
gaaggaagtg 
aagaagcaag 
cgtcccttga 
acacgctatt 
tacattaaga 
ccctggaaga 
tcaattcaag 
gatttattgt 
aagattgtta 
tttgaccaga 
tataaaagtt 
gttttgctta 
cgtcacatta 
cact t tgaaa 
tcagtatacc 
ttccctctta 
gatccttctc 
gttttgtctt 
tgttcagtgt 
tcttgttatc 
tgtcccaact 
ttaagtacca 
cattctgg tc 
gatcaaggtg 
atgttaattt 
attaagaagg 
tccttcttta 
aagaagtgtt 
tttactgatt 
gagtctactc 
tttgacaatg 
tcagactaca 
aatccagtgt 
acaaatgcag 
gctagaacgt 
tgtattagcg 
cgccttgaag 
aaacacgt ta 
tccatggaag 
aaaaataccg 
gaagaaaccg 
gaagaaaaca 
agttct tctc 
attttatacg 
tcctcctcct 
aagaggaagc 
tgt tacatct 



ctaactacaa 
cctctagaat 
taatggacgc 
ataatagtgg 
aatctggcgg 
gatcatcatc 
tggacgccag 
ttaaccgaat 
tttctgacat 
tgtacggtgt 
aggcagttga 
gaagcgttta 
gaggaaatgg 
ataatgaagc 
attgctcgtt 
gaaagggcag 
gaacttttat 
tctccaatca 
atatggccct 
actttaacaa 
atttggagaa 
ctctttatga 
acgattctaa 
ctttctccca 
tgggtaagat 
aaaaattgct 
gcgttttgtg 
tgacaaagaa 
attctggttt 
tgaatctgca 
ccgaaaaaat 
catgtttaga 
tgtttacacc 
ggcaacaaga 
ttgtttcatc 
actctgatgg 
tatatgctgg 
gtacagataa 
ctatgcggtc 
ataattttaa 
actcccctgt 
gcgatttctt 
gggcctgtat 
tccaaattag 
agtcagaaca 
acaagcagaa 
gtcctgccaa 
gaactgctct 
ttattagtaa 
ttgataataa 
ggataaattt 
ttttgaattc 
gcactacaat 
actctgtcag 
atgaaaatga 
cggaaaatac 
taaaacctcc 
aacttgctaa 
cctcctcttc 
aaaaggatga 
catgtgttga 



tctggttagt 
catcaacact 
taacgaagaa 
tgcaaaacaa 
aggaggtttt 
cgtagttgat 
aaacgaagaa 
taagaacatg 
tgagaacttt 
cactgaaact 
ggaagaaact 
tgatatgaac 
tgggtgtggg 
taaccagacc 
ttggtttaac 
tgttttggca 
ccagtacaag 
tatttctgac 
gggagatgag 
taactctaat 
aaaagactct 
tgattctcta 
agacgcgtac 
gcaatacttt 
gtggaactgt 
gtttaaaaat 
taactgggga 
taatgctcca 
cctgagtgtg 
ccgcactttc 
ttctgcagct 
tcccgagggt 
agggacaaat 
caatgattac 
agaaattgaa 
ccataagcgc 
cattgatact 
aacttgcctt 
tgtcattaat 
ctttttagca 
gcagggtaaa 
attgaacaaa 
tgtttcatca 
aaacggggga 
aggcgagttg 
tattttttcg 
gaatatgctc 
catttctaaa 
tggtacagca 
caggtacttt 
ggtcacggac 
tttattctca 
gggtagagga 
taatatgaac 
ggatgatgat 
tagcgttgta 
atcaattttt 
agccactagc 
ttcttcaaaa 
tgttaattct 
ccaaaaaact 



aattcctgta 
aacatttctt 
gcatttatcg 
atgacaggaa 
aataacgcgg 
gaggactcga 
gacagagaaa 
atcacacaaa 
tttaagattt 
gcgttaagtg 
aacaagataa 
atcatatgtc 
ggtggatcat 
aatgaaattt 
ttctacaaga 
aatacaggcc 
gggaaaggtt 
tcgtactggt 
gacaagtcaa 
agcagtagta 
ttaaagaatg 
actacttcca 
gttatcttgg 
acccaccaaa 
aacaatggta 
ctaaacattt 
tgtattcatc 
tgtggcgtta 
gaagactatg 
tctgctaaat 
agtttggcta 
aatttctgta 
ataagacgag 
tgtttcattg 
attgtcagta 
gtgcttcgtg 
ggctgcgcag 
ctgcaagaag 
gggaaaagtc 
ggaacagata 
tttgtcgctc 
ggcacaggag 
tcaaagggta 
tgtttcagcg 
ttccacgata 
gaacaagcat 
aaaagagcca 
attatggctg 
attcccttca 
ttgattgatg 
atgtatacaa 
accgaa ttta 
ctttgtttaa 
aaaaactttt 
gacgaaaatg 
aagtacgaac 
attgctgatg 
gattgtgaaa 
catagtagta 
acaactacag 
ggcatgccca 



caactgccca 
tccacaaca t 
atccttctct 
agggtggaag 
gtggttttta 
gatcccgcac 
gtggcttgtt 
atcagataaa 
gtgttccttt 
ctggaatgga 
gaaaagagtg 
ctggcgat ta 
cctcatcagg 
ctgaagacca 
aagttgtcaa 
atgagggtag 
ttgctgaaac 
ctcaagttat 
aaaagaagtt 
attcatcggt 
ttcctatggg 
ctgaaaaatt 
gctcttctaa 
aaatatccaa 
tgtcccctaa 
tgtggattaa 
ccaa tagt tc 
ccgat tcaaa 
gccaactact 
ctaaagacaa 
aggctgttta 
cgtcttggat 
gaggagattt 
gtaaagaaga 
tccttaaaac 
tcatcaat ta 
acgatgaaga 
at tcaatgga 
tggacgaatc 
agggtttcta 
ctagaggaac 
gagta tttgc 
agaataacaa 
agcgattaac 
gatactgtcc 
acaaatgctc 
aaaatatacg 
aagttgaaaa 
ggttagctga 
gtacttatc t 
ggtgtaaat t 
tttctgctgc 
tcgaacacgt 
ggtctatggc 
aagaaga tga 
ctgtatcaaa 
aagattatat 
cggcttcttc 
gtagtagtag 
cccttcatgc 
gaatggatg t 



actcgctgct 
tcctgccggt 
agaagagatt 
taatagggga 
caatgatgat 
gggtttcagc 
ttcgtacgat 
taatgacatt 
ctcgaaaaaa 
tgccattgaa 
ccgcgatttg 
catgtcttct 
tcaccttctc 
attgaaacat 
ttcttcggaa 
aatagttggc 
aaaggtcctg 
gccaatttgt 
tggaaagaga 
gaaatatgtt 
ttatgatgaa 
agaaaacatt 
ccaatcttca 
tatcaacacc 
gaaccagata 
attgtacgag 
aaagaatagc 
ccctccactt 
gaaggacact 
taattcctct 
cgcccgtgaa 
cactaatagt 
tttcaacaaa 
aactaagaaa 
tgccgtgttt 
caa taaggac 
tgatgatgac 
tgctaagagg 
tagtctggcc 
cttggataat 
aaaaatcttt 
ccgcatattc 
gaaggcgatc 
tccttcaatg 
cgattttttg 
atttttagct 
tctttgtatt 
aatgggaaat 
aaacactgcc 
tctaggagga 
aaaggcagaa 
tctagcttcg 
atcttatatg 
tgaagatcag 
ggatgagaac 
aacagcattc 
atttttatca 
ctcctcctcc 
tag taataag 
gctaagaaaa 
cgtgtatctt 
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ttacgtggcc 
aaagcccacc 
gctgttgtat 
aagaggactt 
gattcttctt 
attattgacg 
gataaggacg 
atgaatagtg 
atcgttttta 
gagtacgatt 
acaaaaaaac 
gagagaaacg 
tctttctttg 
tttaaagaga 
aaagactgca 
aaacctatcg 
atattaggta 
gaaaactgcc 
tggaatgaga 
aaagttgatg 
tcatttgatc 
gagatgatca 
gataattgcc 
gcccctggtg 
tcaaaaacaa 
gatgatgctg 
attcgtaaca 
gcaaaactat 
tctttattcg 
ggttcagtgg 
aagcgtattg 
atggataaaa 
acgtcttctt 
tttggagaaa 
ggagacagta 
tctgcactcc 
gccatccccc 
tttccctcca 
attttccaac 
tacatcttct 
gatacaacat 
gatagaggaa 
aagcatggaa 
tttagtgggc 
cttccatcag 
aaatcagttt 
agtctttgtg 
actgaagtag 
ccagcaaatt 
aatcctaaat 
gctcaactaa 
acaagattct 
ccagccatca 
gtaggatatg 
tgtgatgcct 
atgttgtttg 
acactaaaga 
taccaattga 
agggcaaata 
tttctggcaa 
cttcctaatg 



taatgaactt 
acatggttca 
ttg tgggaac 
ggttcgaaag 
catctactca 
aagttggcca 
agtctgcttc 
ttgacatcaa 
gtaatattaa 
ttatttctga 
gtaaaagagg 
acggtaaaaa 
ccagaaaagc 
aaaatattgt 
cagattctcc 
aa tatgaagg 
aatttggatc 
aggttaataa 
aatttatgag 
cgattatgaa 
gcgacggttc 
cgtcatcttt 
tg tcgactaa 
tattgat tgc 
actacccagt 
ccgagaaggc 
tcaagactat 
tagtggatag 
atgtggt taa 
ttctgtcaac 
agaaggataa 
atttgctttc 
tacctaaaac 
ataggcaaca 
ataaagaggc 
aaaaacataa 
aaaagaa tga 
gtttattaaa 
agggcgcaaa 
cattctttgt 
ttgtctttga 
tgaagtatat 
taaaacagcg 
attccagtga 
ttttaatgaa 
cccgtttaga 
ctaaggctgc 
aagaagacaa 
acgagtccat 
ct ttgaatac 
atagcatgct 
ttgggagaga 
att ttgacaa 
ataaggaagg 
ggggt tcaat 
aaatggctct 
gtgataaaac 
tggttagtaa 
cagaggatat 
tggttgtgaa 
cttctcaaaa 



tggaggtatg 
aactctttgt 
aaagggaaac 
gtttacaaat 
agcaaatttg 
ccaaggttca 
cagaagtggt 
tgaagcaaga 
ccgtttaatg 
actcaaacac 
tggcgaaatt 
tgacgaaaat 
tcactggtgg 
gtataatatg 
ttggcttaac 
caaattctct 
caaaatatgc 
tcttcattcc 
caagaacaag 
cataatacag 
gcctgtttta 
gtctgcccag 
cgggggcgtt 
gtaccattgc 
tttacttccc 
tcttctgggc 
ttctcgcaag 
agatctggac 
gaaaccgtca 
ttccacttct 
gatgtcggct 
tgacgaaaag 
agacgataac 
aagttcacaa 
ttgtttcaat 
cagtgacgga 
tgtgtccagt 
gaatatgact 
aagcagcatt 
cgacgaagct 
gaataagaat 
ccattcatct 
tcaacacgaa 
caacaaaaag 
aaacttgatc 
ggagaatact 
aacaaattac 
tagttgtgat 
gtgcaataaa 
aatggctatg 
caattcagtt 
ttttaacatc 
ttgta ttggt 
ggaattaata 
ggatgttagg 
ccagtatact 
tggtgttgat 
tctgaaatac 
tgcggatgag 
taaaccgttg 
atctgaccat 



tgcacagcca 
tccgttgcct 
aatttaaaga 
attaacgtta 
gcttcctttg 
tttggctcaa 
aatgtcgatt 
aacgcgtacg 
accgaatcaa 
gaaaagaata 
gaagatgaag 
ggtgtccgca 
aattgttctt 
ctccatagag 
gaaaccgatg 
aaatccgaag 
gataacgaat 
tggaacgatt 
aaaaataagc 
aaaaacaatg 
gtctgcaacc 
gacatgctcg 
aaaaagtcag 
gtttacactg 
cctcccaaga 
cctactctgt 
ttatcgtcca 
tttatgaaca 
aaatatagtt 
gactgcgaaa 
aagaattctg 
gacgactctt 
agcagtgata 
ttttcctttg 
gtcgatacgc 
tcttcttcca 
tctataacca 
tcaccccaaa 
actgtgtctc 
atgtctaaaa 
gaggatgtta 
ttaatgatgg 
tttaatcatc 
aaggatagaa 
agaaacaagg 
aacacctttt 
ggagaatctt 
gctgaggaac 
ttaccatcac 
aatattgctc 
ttatttgtag 
aagatgcact 
atgagtttac 
ggagttggtt 
gatttgatgt 
gaatgtaaga 
tatgttgctg 
cctgtgtttt 
aatcaagttt 
catgctctta 
tctgatattg 



ttgcttcagg 
tgaatat tgc 
ccactcttgt 
cggctctcaa 
ctggcaaaaa 
agaagtcatc 
ttggcggcag 
gtgacggagg 
aactaaaagt 
ggaaaaatgc 
gggtagaatg 
taaaggatcc 
ctggtgttgt 
gcgccatgcc 
ctgtgtacag 
taaaaactgc 
catttgaatc 
gtaaagaaga 
agaatatgaa 
gccttttgaa 
ctgccactga 
aaattaaaaa 
ttattgatgg 
gaaaaatttc 
aacaacactt 
cgaatattaa 
taataaagga 
tgtacgaacg 
ttcctggatt 
atgtcctttc 
gatcctttat 
cctctaattc 
ttgcaaactt 
cttctaactc 
ctaaaagaag 
ttattaccga 
aacatatgct 
acagcgtcat 
ctattatgat 
gacttattgt 
agaaaattat 
aaagatgtat 
acaagaaagc 
ttagtgacgt 
tcctagaatt 
tccatttgta 
cttcttcttc 
aacaattgcg 
ctcttcaaat 
gttctagaca 
aaatgccctt 
ctcctgccac 
caaatcctga 
cttcactaac 
attcttgcca 
ggagactttc 
taatgttagc 
tgtcctcttc 
cttcactatc 
gacactctac 
tcaagtacat 



agatggagag 
cacaaaaact 
cgatttgtgt 
taatgctggg 
gggtatcgtg 
gagcgaagac 
tgggggagaa 
aaactctaag 
atgcgaccaa 
ttgtaatgat 
tgaggaaatt 
tatcaatatt 
ttcaactacc 
tttctctatt 
gcactgtaag 
cttgaagtgc 
cattattgat 
tatagatgaa 
aattgaagac 
atggaataca 
aaggttttcc 
atacctgggg 
taataca tec 
tgatgatctt 
tgtagctgtc 
tattgacagt 
cccagaagct 
ttatgacgca 
tacttctgat 
ttgcttaaag 
tcgcatgtgt 
atcaaaaaat 
tttaagtgtt 
tagtggtgga 
gcaa ttggtg 
aatcgctaag 
tccgggccaa 
gattagaggc 
gtccaacagt 
ttttccttgt 
cgggcttttg 
caaatttgga 
atggaatgac 
atcatccgtt 
gagagacgta 
cacttcaatg 
tgctactatc 
ccgtaagaaa 
gtgccagatt 
aggcgcatgg 
tgttaagacc 
aaagaatagg 
tatggatgtt 
aaaacatttg 
ccatttacac 
ctcattaaaa 
ctgtatggtg 
atcccacaag 
tgttcccatg 
caat ttagct 
tgtaatgaac 
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caatggggtt tgagacttaa ccctgattat ctgtgcccaa actgtgtaaa acatgttctt 12540 
tga 12543 

<210> 167 
<211> 4152 
<212> PRT 
<213> SHRIMP 
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280 










285 








Met 


He 


Lys 


Lys 


Thr 


Val 


Ala 


Cys 


Phe 


Lys 


He 


Asp 


He 


Gly 


Gly 


Ser 




290 










295 










300 










Asp 


Asn 


Trp 


Asn 


Leu 


Pro 


Gly 


His 


Cys 


Lys 


Val 


Ser 


Asp 


Thr 


Ala 


Phe 


305 










310 










315 










320 


Pro 


Tyr 


His 


His 


Ala 


Gin 


Leu 


Val 


Gly 


Glu 


Lys 


Lys 


Asn 


He 


Leu 


Ser 










325 










330 










335 




He 


Ser 


Asn 


Glu 


Asn 


Met 


Val 


Thr 


Ser 


Leu 


Gly 


Val 


Val 


Lys 


Ala 


Asp 








340 










345 










350 






Arg 


Ala 


Glu 


Glu 


Trp 


Met 


Cys 


Lys 


Thr 


Leu 


Glu 


Ser 


Phe 


Glu 


Lys 


Lys 






355 










360 










365 








Cys 


Leu 


Tyr 


Leu 


Glu 


Asn 


Leu 


Met 


Gly 


Ser 


Met 


Ala 


Asn 


Thr 


Asp 


Asp 




370 










375 










380 










Trp 


Arg 


Arg 


Lys 


He 


Leu 


Phe 


Ser 


Glu 


Leu 


Gly 


Pro 


Glu 


Met 


Pro 


Tyr 


385 










390 










395 










400 


Arg 


Asn 


Lys 


Ser 


Leu 


He 


Met 


Asp 


Gin 


Asp 


Phe 


Cys 


Thr 


He 


Gly 


Met 



405 410 415 
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Cys 


Tyr 


Lys 


Phe 


Leu 


Ala 


Glu 


Gly 


Gly 


Gly 


Leu 


Leu 


Leu 


Thr 


Lys 


Thr 








420 










425 










430 






Asn 


Ala 


Thr 


Leu 


Leu 


Lys 


Glu 


Lys 


Met 


Ala 


Cys 


Lys 


Gly 


Leu 


Asp 


Asp 






435 










440 










445 








Ser 


Gly 


Asp 


Gly 


Asp 


Asp 


Glu 


Glu 


Glu 


Asp 


Asn 


Glu 


Glu 


Gly 


Gly 


Ser 




450 










455 










460 










Gly 


Gly 


Lys 


Ser 


Gly 


Gly 


Gly 


Ser 


Gly 


Asp 


Glu 


Asn 


Asn 


He 


Asn 


Lys 


465 










470 










475 










480 


Pro 


Pro 


Pro 


Ala 


Pro 


Lys 


Gin 


He 


Pro 


Pro 


Leu 


Ala 


Ala 


Asn 


Val 


Tyr 










485 










490 










495 




Asn 


Ser 


lie 


He 


Asn 


Asp 


Asp 


Asp 


Lys 


Leu 


Asp 


Gin 


He 


Val 


Cys 


Phe 








500 










505 










510 






Phe 


Lys 


Arg 


Lys 


His 


Gly 


Phe 


Phe 


Leu 


Ser 


Asp 


He 


Asp 


Asn 


Ser 


Pro 






515 










520 










525 








Leu 


Leu 


Ala 


Met 


Glu 


Phe 


Leu 


Leu 


Pro 


Gin 


Lys 


Ala 


Met 


Ser 


Lys 


Lys 




530 










535 










540 










Asn 


Cys 


Val 


Glu 


Arg 


Val 


Lys 


Pro 


Glu 


Thr 


Lys 


Asn 


He 


He 


Arg 


Asn 


545 










550 










555 










560 


Leu 


Thr 


Gly Val 


Asn 


Thr 


He 


Lys 


Phe 


Asp 


Thr 


He 


Met 


Pro 


Phe 


Ala 










565 










570 










575 




lie 


Leu 


Gin 


He 


Val 


Val 


Arg 


Tyr 


Glu 


Asn 


Arg 


Asn 


Leu 


Lys 


Leu 


Pro 








580 










585 










590 






Arg 


Asp 


Thr 


Asp 


He 


Leu 


Gin 


Gin 


Arg 


Leu 


Lys 


Asn 


Asn 


Thr 


Trp 


Asp 






595 










600 










605 








Ala 


Leu 


Ser 


Lys 


Gly 


Lys 


Phe 


Ala 


Glu 


Met 


Trp 


Gin 


Phe 


Thr 


His 


Lys 




610 










615 










620 










Glu 


Ser 


Leu 


Lys 


Pro 


Pro 


Thr 


He 


Glu 


Glu 


Leu 


Glu 


Ser 


He 


Pro 


Pro 


625 










630 










635 










640 


Pro 


Pro 


Thr 


Gin 


Ser 


Glu 


Glu 


Glu 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ser 










645 










650 










655 




Thr 


Ser 


Ser 


Thr 


Thr 


Pro 


Asp 


Met 


Val 


Ser 


Ser 


Leu 


Glu 


Glu 


Gly 


Ala 








660 










665 










670 






Thr 


Ser 


Thr 


Ser 


Ser 


Ser 


Asp 


Glu 


Asn 


Gin 


He 


Ala 


Ser 


Leu 


Glu 


Asn 






675 










680 










685 








lie 


Lys 


Lys 


Leu 


Leu 


Ser 


He 


He 


Thr 


Ser 


Thr 


Phe 


Ala 


Thr 


Gly 


Ala 




690 










695 










700 










Asp 


Lys 


Asn 


Asp 


Thr 


He 


Phe 


Ala 


Trp 


Thr 


Val 


Val 


Thr 


Leu 


Ala 


Glu 


705 










710 










715 










720 


Arg 


Phe 


Cys 


Ala 


Leu 


Tyr 


Asn 


He 


Thr 


Ser 


His 


Pro 


Glu 


Glu 


Tyr 


Tyr 










725 










730 










735 




Gin 


Gin 


He 


He 


Arg 


Glu 


Asp 


Phe 


Glu 


Phe 


Glu 


Gly 


Gly 


Phe 


Glu 


Lys 








740 










745 










750 






Phe 


Arg 


His 


Met 


Cys 


Asp 


Ala 


He 


Asn 


Arg 


Glu 


Leu 


Ser 


He 


Tyr 


Val 






755 










760 










765 








Pro 


Lys 


Ser 


Val 


Leu 


Glu 


Lys 


Gin 


Ser 


Val 


Cys 


Arg 


Met 


Gly 


Val 


Ala 




770 










775 










780 










Ala 


Tyr 


Glu 


Asn 


Ser 


Met 


Glu 


Arg 


He 


Lys 


Asn 


Lys 


Thr 


Asn 


Ser 


Lys 


785 










7 90 










795 










800 


Leu 


Cys 


Lys 


He 


Lys 


Tyr 


Asp 


Glu 


Ser 


Thr 


Met 


Val 


Tyr 


Glu 


Leu 


Asn 










805 










810 










815 




Asn 


Asp 


Thr 


Phe 


Lys 


Thr 


Phe 


Asp 


Tyr 


Asp 


Glu 


Ser 


Asp 


Lys 


Ser 


Phe 








820 










825 










830 






Gly 


Pro 


Met 


Tyr 


Glu 


Cys 


Ala 


Pro 


Met 


Phe 


Gin 


Arg 


Leu 


Phe 


Ala 


Ser 






835 










840 










845 








Val 


Lys 


Ser 


Asp 


Lys 


Glu 


Ala 


Val 


Leu 


Ala 


Asp 


Lys 


Lys 


Ser 


Glu 


Lys 




850 










855 










860 










Arg 


Glu 


Lys 


Leu 


Tyr 


Gin 


Gin 


Lys 


Gin 


Glu 


Tyr 


Leu 


Arg 


Lys 


Cys 


Asp 


865 










870 










875 










880 


Asn 


Asp 


Asp 


Val 


Ser 


Ala 


Arg 


Gin 


He 


Leu 


Asn 


Asn 


Val 


Asn 


Glu 


Ser 










885 










890 










895 




Asp 


Glu 


Glu 


Ser 


Asp 


Glu 


Glu 


Ser 


Asp 


Asp 


Glu 


Glu 


Asn 


Tyr 


Gly 


Ala 
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900 










905 










910 






Ala 


Lys 


Gly 
915 


Gly 


Ala 


Thr 


Gly 


Asp 
920 


Tyr 


Tyr 


Gly 


Gly 


Asp 
925 


Asp 


Glu 


Asp 


Asp 


Cys 
930 


Tyr 


Gly 


Phe 


Leu 


Gly 
935 


Glu 


Phe 


Gly 


Ser 


Ser 
940 


Asp 


Asp 


Glu 


Asn 


Val 


Pro 


Ser 


Asp 


Asn 


Ala 


Ser 


Ser 


He 


Asn 


Asn 


Val 


Gin 


Asp 


Asp 


Val 


945 










950 










955 










960 


Phe 


Arg 


Asp 


Val 


Asn 
965 


Phe 


He 


Lys 


Thr 


Phe 
970 


Asn 


Phe 


Arg 


Ser 


Ser 
975 


Leu 


Cys 


His 


Arg 


Gin 
980 


Lys 


Tyr 


Val 


Ser 


Thr 
985 


Val 


He 


Val 


Glu 


Glu 
990 


Met 


Glu 


Lys 


Asn 


Leu 


Cys 


Asp 


Val 


Leu 


Thr 


Leu 


Asp 


Asn 


Ser 


Ala 


Ala 


Glu 


Ser 






995 










1000 








1005 






Gly 


Asp 


He 


Leu 


Lys 


Glu 


He 


Asn 


Arg 


Arg 


Ser 


Leu 


Arg 


Met 


Arg 


Asn 




1010 








1015 








1020 








Trp 


Val 


Val 


Pro 


Phe 


Thr 


Met 


Pro 


Val 


Arg 


Glu 


He 


Val 


Lys 


Pro 


Asn 


1025 








1030 








1035 








1040 


Val 


Asn 


Ser 


Glu 


Asp 


Gly 


Thr 


Ala 


Asn 


Ser 


Asn 


Asn 


Asn 


He 


Pro 


Pro 










1045 








1050 








1055 


Phe 


Cys 


Ser 


Cys 


Ala 


Ser 


Leu 


Asn 


Asn 


Phe 


Lys 


Ser 


Asp 


Ser 


Pro 


Leu 








1060 








1065 








1070 




Ser 


Ser 


Asn 


Asn 


Thr 


Met 


Ser 


Asn 


Glu 


Lys 


Cys 


He 


Lys 


Leu 


Leu 


Pro 






1075 








1080 








1085 






lie 


Pro 


Ser 


Ser 


Lys 


His 


Leu 


Lys 


Asp 


Leu 


Thr 


Val 


Arg 


Phe 


Asn 


Thr 




1090 








1095 








1100 








Met 


Ala 


Cys 


Glu 


Arg 


Arg 


Tyr 


Phe 


Ser 


Asp 


Val 


Thr 


Ala 


Ala 


Leu 


Gly 


1105 








1110 








1115 








1120 


Phe 


Val 


Lys 


Lys 


Asp 


Lys 


Val 


Asn 


Gly 


Asn 


He 


Arg 


Ser 


He 


Leu 


Asp 










1125 








1130 








1135 


Asn 


Lys 


Arg 


Trp 


Asp 


Ala 


He 


Lys 


Gin 


Cys 


Lys 


Leu 


Ala 


Gly 


Lys 


Cys 








1140 








1145 








1150 




Leu 


Ser 


Ser 


Ala 


Leu 


Pro 


Leu 


Gly 


He 


Tyr 


Glu 


Asn 


Val 


He 


Ser 


Glu 






1155 








1160 








1165 






Asp 


Asn 


Lys 


Leu 


He 


Asn 


Thr 


Phe 


Arg 


Pro 


Arg 


Ser 


Leu 


Ala 


Arg 


Leu 




1170 








1175 








1180 








Ala 


Cys 


Ser 


Ser 


Gly 


Gly Asp 


Gly 


Val 


Ser 


Asp 


Lys 


Ser 


Val 


Asn 


Asn 


1185 








1190 








1195 








1200 


Gly 


Phe 


Phe 


Ser 


Gly 


He 


Trp 


Ala 


Leu 


Cys 


Ala 


Asn 


Gin 


Asp 


Leu 


Glu 










1205 








1210 








1215 


Ser 


Val 


Val 


Leu 


Gly 


Ser 


Thr 


Val 


Val 


Asp 


Pro 


Leu 


Lys 


Pro 


Thr 


Lys 








1220 








1225 








1230 




Val 


Phe 


Asn 


Gin 


Ser 


Leu 


Ser 


Glu 


Lys 


Glu 


Leu 


Lys 


Glu 


Lys 


Arg 


Gin 






1235 








1240 








1245 






Gin 


Met 


Cys 


Leu 


Asp 


Ala 


Ala 


Asn 


Tyr 


Phe 


Lys 


Asp 


His 


Asn 


Val 


Ser 




1250 








1255 








1260 








Lys 


Leu 


Asn 


He 


Tyr 


Glu 


Cys 


Phe 


Lys 


Met 


Met 


Glu 


Glu 


Cys 


He 


Met 


1265 








1270 








1275 








1280 


Arg 


Thr 


Ala 


Leu 


Asn 


Gly 


Lys 


Thr 


Ser 


Asn 


Asp 


Ser 


Glu 


Phe 


Phe 


Ser 










1285 








1290 








1295 


Asn 


Leu 


He 


Thr 


Arg 


Tyr 


Gly 


Ser 


Gly 


Thr 


Asn 


Ser 


Pro 


Ala 


Ser 


Arg 








1300 








1305 








1310 




Leu 


Trp 


Thr 


He 


Thr 


Val 


Arg 


Glu 


Cys 


Phe 


Asn 


Asn 


Ser 


Leu 


Pro 


He 






1315 








1320 








1325 






Asp 


Trp 


Gly 


Ser 


Leu 


Val 


Lys 


Asp 


Trp 


Asp 


Gly 


Ser 


Asp 


Met 


Leu 


Asn 




1330 








1335 








1340 








Leu 


Lys 


Ala 


Gly 


Val 


Ser 


Asn 


Val 


Asp 


Glu 


Ser 


Gly 


Ala 


Val 


Phe 


Glu 


1345 








1350 








1355 








1360 


Leu 


Ser 


Glu 


Phe 


Leu 


Gly 


Val 


Ser 


Ala 


Arg 


Ala 


Phe 


Phe 


Gly 


Lys 


Asp 



1365 1370 1375 



Leu Asp Thr Asn Leu Asp Ala Asp Thr Trp Glu Cys Leu Leu Asn Asp 
1380 1385 1390 
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Asp Asn Lys Asp Trp Lys Ala Gin Val Ala Lys Ala Tyr Glu Phe Ala 

1395 1400 1405 

Leu Lys Asp Asn Asp lie Arg Ser Val Glu Asn Phe lie Asn Ser Ser 

1410 1415 1420 

Asn Leu Leu Thr Asn Asn Asn Val lie Lys Lys Leu Lys lie Lys Pro 
1425 1430 1435 1440 

Thr Pro Ser Asn Asp Val Arg His Gin lie Trp Val Glu Asp Glu Tyr 

1445 1450 1455 

Tyr Pro Arg Asn Lys Ser Thr Leu Arg Ser Arg Ala Glu Trp Met Ala 

1460 1465 1470 

Ala Thr Glu Glu Val Leu Lys Thr Glu Met Ser Leu Ser Cys Val Leu 

1475 1480 1485 

Ala Met Val Ala Met Tyr Arg lie Met Met Gin Gly Glu Ser Val Arg 

1490 1495 1500 

Glu lie Ala Thr Ala Pro Leu Arg Leu Ser Val Asp Lys Met Val Pro 
1505 1510 1515 1520 

Leu lie Arg Cys Phe Lys lie Thr Ser Lys Trp Cys Ser Cys Thr Gly 

1525 1530 1535 

Lys Gly Asp Ser Pro Lys Lys Ala Asp Ala Ser lie Lys Glu Gly Arg 

1540 1545 1550 

Phe Tyr Asp lie Glu Glu Asp Pro Leu His Phe Tyr Arg Phe Ala Ala 

1555 1560 1565 

Tyr Val lie Gly Gin Val Asn Asp lie Val lie Glu Glu Met Thr Arg 

1570 1575 1580 

Lys lie Leu Met Ser Phe Asp Phe Asn Gly Phe Asp Thr Ser Asn Trp 
1585 1590 1595 1600 

Leu Gin Phe lie Tyr Phe Ser His Val Leu Met Gly Arg Arg Ser Arg 

1605 1610 1615 

Leu Leu Ser Arg Pro Leu Ser Leu Val Lys Asn Leu Val Ser Val Ser 

1620 1625 1630 

Ser Leu Ala Asp Lys Asn Ser Glu Lys Ser Asn Asp Met Tyr Glu Lys 

1635 1640 1645 

Arg Val Gly Lys Val Met Lys Arg lie Ala Arg Leu Val Leu Val Lys 

1650 1655 1660 

Ala Ala Asp Ser Val Arg Ala Ser Ser Asn Asp Leu Leu Asp Cys Cys 
1665 1670 1675 1680 

lie Leu Asp Val Asn Asp Val Ser Val Lys Ser Leu Asp Glu Phe Arg 

1685 1690 1695 

Ala Lys Thr Arg Gin Glu Leu Gin Glu Thr Arg He Asp Thr Asn Tyr 

1700 1705 1710 

Asn Leu Val Ser Asn Ser Cys Thr Thr Ala Gin Leu Ala Ala Val Glu 

1715 1720 1725 

Lys Ser Ser Arg He He Asn Thr Asn He Ser Phe His Asn He Pro 

1730 1735 1740 

Ala Gly Gin Ala Lys Val Met Asp Ala Asn Glu Glu Ala Phe He Asp 
1745 1750 1755 1760 

Pro Ser Leu Glu Glu He Asn Lys Glu Asp Asn Ser Gly Ala Lys Gin 

1765 1770 1775 

Met Thr Gly Lys Gly Gly Ser Asn Arg Gly Arg Ser Lys Lys Ser Gly 

1780 1785 1790 

Gly Gly Gly Phe Asn Asn Ala Gly Gly Phe Tyr Asn Asp Asp Ser Ser 

1795 1800 1805 

Arg Gly Ser Ser Ser Val Val Asp Glu Asp Ser Arg Ser Arg Thr Gly 

1810 1815 1820 

Phe Ser Gin He His Met Asp Ala Arg Asn Glu Glu Asp Arg Glu Ser 
1825 1830 1835 1840 

Gly Leu Phe Ser Tyr Asp Gly Tyr Val Leu Asn Arg He Lys Asn Met 

1845 1850 1855 

He Thr Gin Asn Gin He Asn Asn Asp He Val Lys Val He Ser Asp 

1860 1865 1870 

He Glu Asn Phe Phe Lys He Cys Val Pro Phe Ser Lys Lys Glu Tyr 
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1875 1880 1885 

Ala Leu Tyr Gly Val Thr Glu Thr Ala Leu Ser Ala Gly Met Asp Ala 

1890 1895 1900 

lie Glu Arg Trp Asn Lys Ala Val Glu Glu Glu Thr Asn Lys lie Arg 
1905 1910 1915 1920 

Lys Glu Cys Arg Asp Leu Thr Asp Thr Gly Ser Val Tyr Asp Met Asn 

1925 1930 1935 

lie lie Cys Pro Gly Asp Tyr Met Ser Ser Val Gly Glu Gly Gly Asn 

1940 1945 1950 

Gly Gly Cys Gly Gly Gly Ser Ser Ser Ser Gly His Leu Leu Ser Asn 

1955 1960 1965 

Asn Asn Asn Glu Ala Asn Gin Thr Asn Glu lie Ser Glu Asp Gin Leu 

1970 1975 1980 

Lys His Glu Gly Ser Asp Cys Ser Phe Trp Phe Asn Phe Tyr Lys Lys 
1985 1990 1995 2000 

Val Val Asn Ser Ser Glu Lys Lys Gin Gly Lys Gly Ser Val Leu Ala 

2005 2010 2015 

Asn Thr Gly His Glu Gly Arg lie Val Gly Arg Pro Leu Arg Thr Phe 

2020 2025 2030 

lie Gin Tyr Lys Gly Lys Gly Phe Ala Glu Thr Lys Val Leu Thr Arg 

2035 2040 2045 

Tyr Phe Ser Asn His Asp Ser Tyr Trp Ser Gin Val Met Pro lie Cys 

2050 2055 2060 

Tyr lie Lys Asn Met Ala Leu Gly Asp Glu Asp Lys Ser Lys Lys Lys 
2065 2070 2075 2080 

Phe Gly Lys Arg Pro Trp Lys Asn Phe Asn Asn Asn Ser Asn Ser Ser 

2085 2090 2095 

Ser Asn Ser Ser Val Lys Tyr Val Ser lie Gin Asp Leu Glu Lys Lys 

2100 2105 2110 

Asp Ser Leu Lys Asn Val Pro Met Gly Tyr Asp Glu Asp Leu Leu Ser 

2115 2120 2125 

Leu Tyr Asp Asp Ser Leu Thr Thr Ser Thr Glu Lys Leu Glu Asn lie 

2130 2135 2140 

Lys lie Val Asn Asp Ser Lys Asp Ala Tyr Val lie Leu Gly Ser Ser 
2145 2150 2155 2160 

Asn Gin Ser Ser Phe Asp Gin Thr Phe Ser Gin Gin Tyr Phe Thr His 

2165 2170 2175 

Gin Lys lie Ser Asn lie Asn Thr Tyr Lys Ser Leu Gly Lys Met Trp 

2180 2185 2190 

Asn Cys Asn Asn Gly Met Ser Pro Lys Asn Gin He Val Leu Leu Lys 

2195 2200 2205 

Lys Leu Leu Phe Lys Asn Leu Asn He Leu Trp He Lys Leu Tyr Glu 

2210 2215 2220 

Arg His Val Leu Cys Asn Trp Gly Cys He His Pro Asn Ser Ser Lys 
2225 2230 2235 2240 

Asn Ser His Phe Glu Met Thr Lys Asn Asn Ala Pro Cys Gly Val Thr 

2245 2250 2255 

Asp Ser Asn Pro Pro Leu Ser Val Tyr His Ser Gly Phe Leu Ser Val 

2260 2265 2270 

Glu Asp Tyr Gly Gin Leu Leu Lys Asp Thr Phe Pro Leu Met Asn Leu 

2275 2280 2285 

His Arg Thr Phe Ser Ala Lys Ser Lys Asp Asn Asn Ser Ser Asp Pro 

2290 2295 2300 

Ser Pro Glu Lys He Ser Ala Ala Ser Leu Ala Lys Ala Val Tyr Ala 
2305 2310 2315 2320 

Arg Glu Val Leu Ser Ser Cys Leu Asp Pro Glu Gly Asn Phe Cys Thr 

2325 2330 2335 

Ser Trp He Thr Asn Ser Cys Ser Val Leu Phe Thr Pro Gly Thr Asn 

2340 2345 2350 

He Arg Arg Gly Gly Asp Phe Phe Asn Lys Ser Cys Tyr Arg Gin Gin 
. 2355 2360 2365 
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Asp Asn Asp Tyr Cys Phe lie Gly Lys Glu Glu Thr Lys Lys Cys Pro 

2370 2375 2380 

Asn Phe Val Ser Ser Glu lie Glu lie Val Ser lie Leu Lys Thr Ala 
2385 2390 2395 2400 

Val Phe Leu Ser Thr Asn Ser Asp Gly His Lys Arg Val Leu Arg Val 

2405 2410 2415 

lie Asn Tyr Asn Lys Asp His Ser Gly Ala Gly lie Asp Thr Gly Cys 

2420 2425 2430 

Ala Asp Asp Glu Asp Asp Asp Asp Asp Gin Gly Gly Thr Asp Lys Thr 

2435 2440 2445 

Cys Leu Leu Gin Glu Asp Ser Met Asp Ala Lys Arg Met Leu lie Ser 

2450 2455 2460 

Met Arg Ser Val lie Asn Gly Lys Ser Leu Asp Glu Ser Ser Leu Ala 
2465 2470 2475 2480 

lie Lys Lys Asp Asn Phe Asn Phe Leu Ala Gly Thr Asp Lys Gly Phe 

2485 2490 2495 

Tyr Leu Asp Asn Ser Phe Phe Asn Ser Pro Val Gin Gly Lys Phe Val 

2500 2505 2510 

Ala Pro Arg Gly Thr Lys lie Phe Lys Lys Cys Cys Asp Phe Leu Leu 

2515 2520 2525 

Asn Lys Gly Thr Gly Gly Val Phe Ala Arg lie Phe Phe Thr Asp Trp 

2530 2535 2540 

Ala Cys lie Val Ser Ser Ser Lys Gly Lys Asn Asn Lys Lys Ala lie 
2545 2550 2555 2560 

Glu Ser Thr Leu Gin lie Arg Asn Gly Gly Cys Phe Ser Leu Thr Pro 

2565 2570 2575 

Ser Met Phe Asp Asn Glu Ser Glu Gin Gly Glu Leu Phe His Asp Arg 

2580 2585 2590 

Tyr Cys Pro Asp Phe Leu Ser Asp Tyr Asn Lys Gin Asn lie Phe Ser 

2595 2600 2605 

Glu Gin Ala Tyr Lys Cys Ser Phe Leu Ala Asn Pro Val Cys Pro Ala 

2610 2615 2620 

Lys Asn Met Leu Lys Arg Ala Lys Asn lie Arg Leu Cys lie Thr Asn 
2625 2630 2635 2640 

Ala Gly Thr Ala Leu lie Ser Lys lie Met Ala Glu Val Glu Lys Met 

2645 2650 2655 

Gly Asn Ala Arg Thr Phe lie Ser Asn Gly Thr Ala lie Pro Phe Arg 

2660 2665 2670 

Leu Ala Glu Asn Thr Ala Cys lie Ser Val Asp Asn Asn Arg Tyr Phe 

2675 2680 2685 

Leu lie Asp Gly Thr Tyr Leu Leu Gly Gly Arg Leu Glu Gly lie Asn 

2690 2695 2700 

Leu Val Thr Asp Met Tyr Thr Arg Cys Lys Leu Lys Ala Glu Lys His 
2705 2710 2715 2720 

Val lie Leu Asn Ser Leu Phe Ser Thr Glu Phe lie Ser Ala Ala Ser 

2725 2730 2735 

Ser Met Glu Gly Thr Thr Met Gly Arg Gly Leu Cys Leu lie Glu His 

2740 2745 2750 

Val Ser Tyr Met Lys Asn Thr Asp Ser Val Ser Asn Met Asn Lys Asn 

2755 2760 2765 

Phe Trp Ser Met Ala Glu Asp Gin Glu Glu Thr Asp Glu Asn Glu Asp 

2770 2775 2780 

Asp Asp Asp Glu Asn Glu Glu Asp Glu Asp Glu Asn Glu Glu Asn Thr 
2785 2790 2795 2800 

Glu Asn Thr Ser Val Val Lys Tyr Glu Pro Val Ser Lys Thr Ala Phe 

2805 2810 2815 

Ser Ser Ser Leu Lys Pro Pro Ser lie Phe lie Ala Asp Glu Asp Tyr 

2820 2825 2830 

lie Phe Leu Ser lie Leu Tyr Glu Leu Ala Lys Ala Thr Ser Asp Cys 

2835 2840 2845 

Glu Thr Ala Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
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2850 2855 2860 

Ser Lys His Ser Ser Ser Ser Ser Ser Ser Asn Lys Lys Arg Lys Gin 
2865 2870 2875 2880 

Lys Asp Asp Val Asn Ser Thr Thr Thr Ala Leu His Ala Leu Arg Lys 

2885 2890 2895 

Cys Tyr lie Ser Cys Val Asp Gin Lys Thr Giy Met Pro Arg Met Asp 

2900 2905 2910 

Val Val Tyr Leu Leu Arg Gly Leu Met Asn Phe Gly Gly Met Cys Thr 

2915 2920 2925 

Ala lie Ala Ser Gly Asp Gly Glu Lys Ala His His Met Val Gin Thr 

2930 2935 2940 

Leu Cys Ser Val Asn lie Ala Thr Lys Thr Ala Val Val Phe Val Gly 
2945 2950 2955 2960 

Thr Lys Gly Asn Asn Leu Lys Thr Thr Leu Val Asp Leu Cys Lys Arg 

2965 2970 2975 

Thr Trp Phe Glu Arg Phe Thr Asn lie Asn Val Thr Ala Leu Asn Asn 

2980 2985 2990 

Ala Gly Asp Ser Ser Ser Ser Thr Gin Ala Asn Leu Ala Ser Phe Ala 

2995 3000 3005 

Gly Lys Lys Gly lie Val lie lie Asp Glu Val Gly His Gin Gly Ser 

3010 3015 3020 

Phe Gly Ser Lys Lys Ser Ser Ser Glu Asp Asp Lys Asp Glu Ser Ala 
3025 3030 3035 3040 

Ser Arg Ser Gly Asn Val Asp Phe Gly Gly Ser Gly Gly Glu Met Asn 

3045 3050 3055 

Ser Val Asp lie Asn Glu Ala Arg Asn Ala Tyr Gly Asp Gly Gly Asn 

3060 3065 3070 

Ser Lys lie Val Phe Ser Asn lie Asn Arg Leu Met Thr Glu Ser Lys 

3075 3080 3085 

Leu Lys Val Cys Asp Gin Glu Tyr Asp Phe He Ser Glu Leu Lys His 

3090 3095 3100 

Glu Lys Asn Arg Lys Asn Ala Cys Asn Asp Thr Lys Lys Arg Lys Arg 
3105 3110 3115 3120 

Gly Gly Glu He Glu Asp Glu Gly Val Glu Cys Glu Glu He Glu Arg 

3125 3130 3135 

Asn Asp Gly Lys Asn Asp Glu Asn Gly Val Arg He Lys Asp Pro He 

3140 3145 3150 

Asn He Ser Phe Phe Ala Arg Lys Ala His Trp Trp Asn Cys Ser Ser 

3155 3160 3165 

Gly Val Val Ser Thr Thr Phe Lys Glu Lys Asn He Val Tyr Asn Met 

3170 3175 3180 

Leu His Arg Gly Ala Met Pro Phe Ser He Lys Asp Cys Thr Asp Ser 
3185 3190 3195 3200 

Pro Trp Leu Asn Glu Thr Asp Ala Val Tyr Arg His Cys Lys Lys Pro 

3205 3210 3215 

He Glu Tyr Glu Gly Lys Phe Ser Lys Ser Glu Val Lys Thr Ala Leu 

3220 3225 3230 

Lys Cys He Leu Gly Lys Phe Gly Ser Lys He Cys Asp Asn Glu Ser 

3235 3240 3245 

Phe Glu Ser He He Asp Glu Asn Cys Gin Val Asn Asn Leu His Ser 

3250 3255 3260 

Trp Asn Asp Cys Lys Glu Asp He Asp Glu Trp Asn Glu Lys Phe Met 
3265 3270 3275 3280 

Ser Lys Asn Lys Lys Asn Lys Gin Asn Met Lys He Glu Asp Lys Val 

3285 3290 3295 

Asp Ala He Met Asn He He Gin Lys Asn Asn Gly Leu Leu Lys Trp 

3300 3305 3310 

Asn Thr Ser Phe Asp Arg Asp Gly Ser Pro Val Leu Val Cys Asn Pro 

3315 3320 3325 

Ala Thr Glu Arg Phe Ser Glu Met He Thr Ser Ser Leu Ser Ala Gin 
3330 3335 3340 
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Asp Met Leu Glu lie Lys Lys Tyr Leu Gly Asp Asn Cys Leu Ser Thr 
3345 3350 3355 3360 

Asn Gly Gly Val Lys Lys Ser Val lie Asp Gly Asn Thr Ser Ala Pro 

3365 3370 3375 

Gly Val Leu lie Ala Tyr His Cys Val Tyr Thr Gly Lys lie Ser Asp 

3380 3385 3390 

Asp Leu Ser Lys Thr Asn Tyr Pro Val Leu Leu Pro Pro Pro Lys Lys 

3395 3400 3405 

Gin His Phe Val Ala Val Asp Asp Ala Ala Glu Lys Ala Leu Leu Gly 

3410 . 3415 3420 

Pro Thr Leu Ser Asn lie Asn lie Asp Ser lie Arg Asn He Lys Thr 
3425 3430 3435 3440 

He Ser Arg Lys Leu Ser Ser He He Lys Asp Pro Glu Ala Ala Lys 

3445 3450 3455 

Leu Leu Val Asp Arg Asp Leu Asp Phe Met Asn Met Tyr Glu Arg Tyr 

3460 3465 3470 

Asp Ala Ser Leu Phe Asp Val Val Lys Lys Pro Ser Lys Tyr Ser Phe 

3475 3480 3485 

Pro Gly Phe Thr Ser Asp Gly Ser Val Val Leu Ser Thr Ser Thr Ser 

3490 3495 3500 

Asp Cys Glu Asn Val Leu Ser Cys Leu Lys Lys Arg He Glu Lys Asp 
3505 3510 3515 3520 

Lys Met Ser Ala Lys Asn Ser Gly Ser Phe He Arg Met Cys Met Asp 

3525 3530 3535 

Lys Asn Leu Leu Ser Asp Glu Lys Asp Asp Ser Ser Ser Asn Ser Ser 

3540 3545 3550 

Lys Asn Thr Ser Ser Leu Pro Lys Thr Asp Asp Asn Ser Ser Asp He 

3555 3560 3565 

Ala Asn Phe Leu Ser Val Phe Gly Glu Asn Arg Gin Gin Ser Ser Gin 

3570 3575 3580 

Phe Ser Phe Asn Ser Ser Gly Gly Gly Asp Ser Asn Lys Glu Ala Cys 
3585 3590 3595 3600 

Phe Asn Val Asp Thr Pro Lys Arg Arg Gin Leu Val Ser Ala Leu Gin 

3605 3610 3615 

Lys His Asn Ser Asp Gly Ser Ser Ser He He Thr Glu He Ala Lys 

3620 3625 3630 

Ala He Pro Gin Lys Asn Asp Val Ser Ser Ser He Thr Lys His Met 

3635 3640 3645 

Leu Pro Gly Gin Phe Pro Ser Ser Leu Leu Lys Asn Met Thr Ser Pro 

3650 3655 3660 

Gin Asn Ser Val Met He Arg Gly He Phe Gin Gin Gly Ala Lys Ser 
3665 3670 3675 3680 

Ser He Thr Val Ser Pro He Met Met Ser Asn Ser Tyr He Phe Ser 

3685 3690 3695 

Phe Phe Val Asp Glu Ala Met Ser Lys Arg Leu He Val Phe Pro Cys 

3700 3705 3710 

Asp Thr Thr Phe Val Phe Glu Asn Lys Asn Glu Asp Val Lys Lys He 

3715 3720 3725 

He Gly Leu Leu Asp Arg Gly Met Lys Tyr He His Ser Ser Leu Met 

3730 3735 3740 

Met Glu Arg Cys He Lys Phe Gly Lys His Gly He Lys Gin Arg Gin 
3745 3750 3755 3760 

His Glu Phe Asn His His Lys Lys Ala Trp Asn Asp Phe Ser Gly His 

3765 3770 3775 

Ser Ser Asp Asn Lys Lys Lys Asp Arg He Ser Asp Val Ser Ser Val 

3780 3785 3790 

Leu Pro Ser Val Leu Met Lys Asn Leu He Arg Asn Lys Val Leu Glu 

3795 3800 3805 

Leu Arg Asp Val Lys Ser Val Ser Arg Leu Glu Glu Asn Thr Asn Thr 

3810 3815 3820 

Phe Phe His Leu Tyr Thr Ser Met Ser Leu Cys Ala Lys Ala Ala Thr 
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3825 3830 3835 3840 

Asn Tyr Gly Glu Ser Ser Ser Ser Ser Ala Thr lie Thr Glu Val Glu 

3845 3850 3855 

Glu Asp Asn Ser Cys Asp Ala Glu Glu Gin Gin Leu Arg Arg Lys Lys 

3860 3865 3870 

Pro Ala Asn Tyr Glu Ser Met Cys Asn Lys Leu Pro Ser Pro Leu Gin 

3875 3880 3885 

Met Cys Gin lie Asn Pro Lys Ser Leu Asn Thr Met Ala Met Asn lie 

3890 3895 3900 

Ala Arg Ser Arg Gin Gly Ala Trp Ala Gin Leu Asn Ser Met Leu Asn 
3905 3910 3915 3920 

Ser Val Leu Phe Val Glu Met Pro Phe Val Lys Thr Thr Arg Phe Phe 

3925 3930 3935 

Gly Arg Asp Phe Asn lie Lys Met His Ser Pro Ala Thr Lys Asn Arg 

3940 3945 3950 

Pro Ala lie Asn Phe Asp Asn Cys He Gly Met Ser Leu Pro Asn Pro 

3955 3960 3965 

Asp Met Asp Val Val Gly Tyr Asp Lys Glu Gly Glu Leu He Gly Val 

3970 3975 3980 

Gly Ser Ser Leu Thr Lys His Leu Cys Asp Ala Trp Gly Ser Met Asp 
3985 3990 3995 4000 

Val Arg Asp Leu Met Tyr Ser Cys His His Leu His Met Leu Phe Glu 

4005 4010 4015 

Met Ala Leu Gin Tyr Thr Glu Cys Lys Arg Arg Leu Ser Ser Leu Lys 

4020 4025 4030 

Thr Leu Lys Ser Asp Lys Thr Gly Val Asp Tyr Val Ala Val Met Leu 

4035 4040 4045 

Ala Cys Met Val Tyr Gin Leu Met Val Ser Asn Leu Lys Tyr Pro Val 

4050 4055 4060 

Phe Leu Ser Ser Ser Ser His Lys Arg Ala Asn Thr Glu Asp He Ala 
4065 4070 4075 4080 

Asp Glu Asn Gin Val Ser Ser Leu Ser Val Pro Met Phe Leu Ala Met 

4085 4090 4095 

Val Val Asn Lys Pro Leu His Ala Leu Arg His Ser Thr Asn Leu Ala 

4100 4105 4110 

Leu Pro Asn Ala Ser Gin Lys Ser Asp His Ser Asp He Val Lys Tyr 

4115 4120 4125 

He Val Met Asn Gin Trp Gly Leu Arg Leu Asn Pro Asp Tyr Leu Cys 

4130 4135 4140 

Pro Asn Cys Val Lys His Val Leu 
4145 4150 



<210> 168 

<211> 315 

<212> DNA 

<213> SHRIMP 



<400> 168 

atgtacctga 

aaaatgtacc 

acttcttcct 

atccagtact 

tcaaaggatc 

atatctcagt 



gccacattag acaaacacct ttggttgaag 
atcataacaa caataatcag cactcctttg 
ccatcaactg ttcaagctgt tcccgtgaaa 
tcaacaaaac tagcagaaat aatactgcac 
gcaactattc ttctttcgag tatgctgaaa 
ggtga 



aaagacgtgc tctcaccttc 60 
tgaattgcca gtgcaggcga 120 
ctttcaactc agtaaaggct 180 
atcatttcaa gatgccggct 240 
cggcagttgc tgctcacaat 300 

315 



<210> 169 

<211> 104 

<212> PRT 

<213> SHRIMP 
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<400> 169 




























Met 


Tyr 


Leu 


Ser 


His 


lie 


Arg 


Gin 


Thr 


Pro 


Leu 


Val 


Glu 


Glu 


Arg 


Arg 


1 






5 










10 










15 




Ala 


Leu 


Thr 


Phe 
20 


Lys 


Met 


Tyr 


His 


His 
25 


Asn 


Asn 


Asn 


Asn 


Gin 
30 


His 


Ser 


Phe 


val 


Asn 
35 


Cys 


Gin 


Cys 


Arg 


Arg 
AO 


Thr 


Ser 


Ser 


Ser 


lie 
45 


Asn 


Cys 


Ser 


Ser 


Cys 
50 


Ser 


Arg 


Glu 


Thr 


Phe 
55 


Asn 


Ser 


Val 


Lys 


Ala 
60 


lie 


Gin 


Tyr 


Phe 


Asn 


Lys 


Thr 


Ser 


Arg 


Asn 


Asn 


Thr 


Ala 


His 


His 


Phe 


Lys 


Met 


Pro 


Ala 


65 










70 










75 










80 


Ser 


Lys 


Asp 


Arg 


Asn 
85 


Tyr 


Ser 


Ser 


Phe 


Glu 
90 


Tyr 


Ala 


Glu 


Thr 


Ala 
95 


Val 


Ala 


Ala 


His 


Asn 


lie 


Ser 


Gin 


Trp 



















100 



<210> 170 
<211> 3696 
<212> DNA 
<213> SHRIMP 

<400> 170 

atgagcaaga gttctagttc tactgtgaaa tctgcatctt tttttaactc tttgatggaa 60 
aatgcacctt cctccaagat tgaactttta gaagatggat ggactaagaa agctgctgct 120 
gctgatacag atactccaac tgcaaaacct actggtctaa gtatctcgtt gatggatatt 180 
agtggttcaa tgggaagtgt gaaatcggct gtggctgata gttgttctgg catcatggct 24 0 
acactcaatg tgattgctcc aggcattcaa aatgctatag tttactacaa tgatttcgac 300 
aaacactcca ttgaaagtgg acctgttgtg cgtgcaccag actgttcaga gtgggaaggg 360 
ggagattttg tgaagcatat gcgcaagacc gaagtgtgcg gaggcggcgg cggtggcagt 420 
gaagctcttc attcttccct gatgtacgtg tttaacaaca tgattcctgc attcaagaaa 480 
atgcatggca ttaccaggga tgaaaaattc cccatcctca tttttgtgtt caccgatgag 540 
gatgtgcgta ttgccaatag tgacactgga aaattatgtg ccaactcgta cgattcggaa 600 
actgcaccag aagaagaatt catcatgaag acttggggac aaaaaccctt gactattctt 660 
gacatgagga aggcgctagt ggagaatgat tgttggctac gcatcctcaa tttctctaga 720 
tgtagtggca gtaaccagag tgaattgtgc caagaagatg tgatcaattt ttctggatac 780 
gacaacaaca gatggcaact ttttgaatcc tttgacagaa gatcgtgcaa tgtgcgcaag 8 40 
aatatcgcaa cctttattat gaggcaatct atttccctct tcaagaacct gaatgatcaa 900 
tttagtgcct tccccatctt gagggaaatc aaccaagaag aattgaatgt ttttattgaa 960 
agtgaaggaa gatcagagcc tgcaggattt gaaaagtatg gcgacgctca acgtgaatct 1020 
ttcaagtcta gggttttaaa tatggcacct ttagattttg gaagagttgt acagggagga 1080 
ggaagatata acaaccacaa gcgtagcgtg ttcttgaatt gtgcatatga tagtgccttt 1140 
tgttgctcta agcaaacctt taatccacag cagcaacaac aacaacaaca gtcttctagc 1200 
ggtggtggtg gtatcagcaa actggctgta gtgactcaaa gggcccagtc aattacaggg 1260 
ggaggaaatg ccgcctctac tctggcactg cacatgaacg cgtgtttcca gtcgttggac 1320 
gattttggaa ttgaccatac aaacctctgt gattgtaagg gttgtacaaa attgatggca 1380 
agtgttgaag ctacatctga tcaaggaaga aaaaccaagt tgtctcgcaa gtatgctcgc 1440 
gtacactggg ccaagatgtt tgctgaaaaa ctcttcaaaa tgatgatcaa ggaacaatca 1500 
atgatgtacg cttgcagtgc tgtgcctgat gaaattgggg ctatttatgc ctttgttaca 1560 
ggcaataatg caggagtgtg ctcgagggtc tctaccatcc tttctgatct gggaactgaa 1620 
tgcggaaaca aggcagaata tgccttccta aaagaaggaa agcacatgaa atctgctagc 1680 
tatgatgcac tccaagtgat taacaataca gatctcacac ctgaacaatc ctccatgttt 1740 
atgtggttct atgtgcccaa tgatgccttg gaagaagctg gcaaaatctt ccaccagtca 1800 
tttagtttct ccaattcata tacaggagga ggattattat cactggacga atacaagagg 1860 
ttcgagtttg gacagtgctt tgatttcatc aagaaattgg tatcatgtct aaagattact 1920 
agaaatgttg aagatgtgct tttggagacg tcaaaaacct ccaacagata ttttgccatc 1980 
ccagtctttt gtggatcaga tgaccagaag gaagtgcttc gagaagaatt ggccagcgat 2040 
ctatttggag gacgagaaga tgtggcagaa atgatgttta tcgacttgga gactgtgatt 2100 
caaaaattgg gaactttgta tgatgtacgt ctgagtctgc cagaaggagg ttatgcagca 2160 
atcaaatctg tgtgtgcagc cgcctcttgg gctgcctcct gtgaagtacc ctccaacact 2220 
tcaaacatga tcctctctat tgcaaaaatg gcattcacaa agtattacca agagcagaat 2280 
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agcagttccg 
ggagagatcg 
aataagatag 
atcttgacaa 
cgacttgacc 
ctaaagatgg 
atgggcaaac 
actgcatcac 
gagatggcta 
attgctgttt 
tacaagctat 
tgctgcaaat 
tgtccctggt 
ctgatggatt 
acatctggtg 
tatttcttga 
aataagagac 
agtgatcttc 
aagacgacag 
aatgacctgg 
tttaatagtc 
tggaatactt 
acttccccat 
gttaatgtac 



agactgattt 
agaataatct 
gagttgacaa 
aggctggcga 
tatcagagaa 
aatatgataa 
caatggtcta 
ccacaaccct 
gtgatatcat 
cagctgcagc 
tgaatgaggg 
ccaagtacat 
gcaggtacgc 
tgattgaatg 
atgtgatgta 
aactcaagaa 
cagcgcctct 
ccgattcctt 
tcaatctaat 
acgggaaaga 
tatttgtaaa 
tcttgattca 
tttctgttgt 
ttgtccgtca 



ggatattatt 
ttccggtgta 
gaactttacc 
ctcaaagatt 
ggacttgaaa 
tgatgaaaaa 
tcactgggac 
caatcttttg 
tggagctctc 
gaaggctgtg 
caatattgat 
gatgtctact 
caacaaactc 
tggagcacct 
tgaagaactg 
cactgcaatt 
tcaggtcact 
gttggctgcc 
tggactcggg 
ccccttcatt 
cccatggttg 
tgtaaagaag 
tgtaatgaat 
aaaaatatgt 



ttgccctcta 
gtatttttga 
aataaactag 
ggggagaaat 
tatatatgca 
ttgtgcttga 
aataaattga 
aatgccaaga 
aatcttcccc 
gggattttgt 
attccagttg 
cttggtccgg 
gtcgcaatgg 
tctt tggcta 
ggagagggtg 
gccgaaggaa 
tctccttctt 
attggcgagt 
gaagtaaagg 
tctctccagg 
gggtacaggc 
aatga tgtat 
gatggtagtg 
gtgtga 



ttcttgaagg 
ggtgcttaat 
aacacttttt 
atgagacgtt 
aaagatgtgg 
gatgcaaggg 
ccagagatcc 
aaattgatga 
caacagataa 
atgggaagac 
ctgtatgtgt 
acaagccgca 
ggagaggagg 
tggtggaaga 
aagagtttta 
ataaactgca 
caccgccaaa 
gtgcgatcga 
tagtggaaaa 
aatactgttc 
tcgatgagca 
ggaagttctt 
gtctgttgaa 



aactgccgat 
tacatgggcc 
ggctctgaga 
ccctgtacgt 
tgtaaaatct 
aaactacagg 
tcgtgccaaa 
caaagtaaag 
agacaatgaa 
ttgtcttctc 
agaatgtgat 
gaacagaaaa 
gaagaaactg 
agecatcagg 
cattatcgac 
acaaaataat 
gaaaatgagg 
gacaaaagaa 
tgttggacca 
atgggataaa 
atgggatgat 
gtgcaacaaa 
tgttgataat 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3696 



<210> 171 
<211> 1227 
<212> PRT 
<213> SHRIMP 



<400> 171 



Met 


Ser 


Lys 


Ser 


Ser 


Ser 


Ser 


Thr 


Val 


Lys 


Ser 


Ala 


Ser 


Phe 


Phe 


Asn 


1 








5 










10 










15 




Ser 


Leu 


Met 


Glu 


Asn 


Ala 


Pro 


Ser 


Ser 


Lys 


He 


Glu 


Leu 


Leu 


Glu 


Asp 








20 










25 










30 






Gly 


Trp 


Thr 


Lys 


Lys 


Ala 


Ala 


Ala 


Ala 


Asp 


Thr 


Asp 


Thr 


Pro 


Thr 


Ala 






35 










40 










45 








Lys 


Pro 


Thr 


Gly 


Leu 


Ser 


lie 


Ser 


Leu 


Met 


Asp 


He 


Ser 


Gly 


Ser 


Met 




50 










55 










60 










Gly 


Ser 


Val 


Lys 


Ser 


Ala 


Val 


Ala 


Asp 


Ser 


Cys 


Ser 


Gly 


He 


Met 


Ala 


65 










70 










75 










80 


Thr 


Leu 


Asn 


Val 


He 


Ala 


Pro 


Gly 


He 


Gin 


Asn 


Ala 


He 


Val 


Tyr 


Tyr 










85 










90 










95 




Asn 


Asp 


Phe 


Asp 


Lys 


His 


Ser 


He 


Glu 


Ser 


Gly 


Pro 


Val 


Val 


Arg 


Ala 








100 










105 










110 






Pro 


Asp 


Cys 


Ser 


Glu 


Trp 


Glu 


Gly 


Gly 


Asp 


Phe 


Val 


Lys 


His 


Met 


Arg 






115 










120 










125 








Lys 


Thr 


Glu 


Val 


Cys 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Ser 


Glu 


Ala 


Leu 


His 




130 










135 










140 










Ser 


Ser 


Leu 


Met 


Tyr 


Val 


Phe 


Asn 


Asn 


Met 


He 


Pro 


Ala 


Phe 


Lys 


Lys 


145 










150 










155 










160 


Met 


His 


Gly 


He 


Thr 


Arg 


Asp 


Glu 


Lys 


Phe 


Pro 


He 


Leu 


He 


Phe 


Val 










165 










170 










175 




Phe 


Thr 


Asp 


Glu 


Asp 


Val 


Arg 


He 


Ala 


Asn 


Ser 


Asp 


Thr 


Gly 


Lys 


Leu 








180 










185 










190 






Cys 


Ala 


Asn 


Ser 


Tyr 


Asp 


Ser 


Glu 


Thr 


Ala 


Pro 


Glu 


Glu 


Glu 


Phe 


lie 






195 










200 










205 








Met 


Lys 


Thr 


Trp 


Gly 


Gin 


Lys 


Pro 


Leu 


Thr 


He 


Leu 


Asp 


Met 


Arg 


Lys 




210 










215 










220 










Ala 


Leu 


Val 


Glu 


Asn 


Asp 


Cys 


Trp 


Leu 


Arg 


He 


Leu 


Asn 


Phe 


Ser 


Arg 


22S 










230 










235 










240 
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Cys Ser Gly Ser Asn Gin Ser Glu Leu Cys Gin Glu Asp Val He Asn 

245 250 255 

Phe Ser Gly Tyr Asp Asn Asn Arg Trp Gin Leu Phe Glu Ser Phe Asp 

260 265 270 

Arg Arg Ser Cys Asn Val Arg Lys Asn He Ala Thr Phe He Met Arg 

275 280 285 

Gin Ser He Ser Leu Phe Lys Asn Leu Asn Asp Gin Phe Ser Ala Phe 

290 295 300 

Pro He Leu Arg Glu He Asn Gin Glu Glu Leu Asn Val Phe He Glu 
305 310 315 320 



Ser 


Glu 


Gly 


Arg 


Ser 


Glu 


Pro 


Ala 


Gly 


Phe 


Glu 


Lys 


Tyr 


Gly 


Asp 


Ala 










325 










330 










335 




Gin 


Arg 


Glu 


Ser 


Phe 


Lys 


Ser 


Arg 


Val 


Leu 


Asn 


Met 


Ala 


Pro 


Leu 


Asp 








340 










345 










350 






Phe 


Gly 


Arg 


Val 


Val 


Gin 


Gly 


Gly 


Gly 


Arg 


Tyr 


Asn 


Asn 


His 


Lys 


Arg 






355 










360 










365 








Ser 


Val 


Phe 


Leu 


Asn 


Cys 


Ala 


Tyr 


Asp 


Ser 


Ala 


Phe 


Cys 


Cys 


Ser 


Lys 




370 










375 










380 










Gin 


Thr 


Phe 


Asn 


Pro 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Ser 


Ser 


Ser 


385 










390 










395 










400 


Gly 


Gly 


Gly 


Gly 


He 


Ser 


Lys 


Leu 


Ala 


Val 


Val 


Thr 


Gin 


Arg 


Ala 


Gin 










405 










410 










415 




Ser 


He 


Thr 


Gly 


Gly 


Gly 


Asn 


Ala 


Ala 


Ser 


Thr 


Leu 


Ala 


Leu 


His 


Met 








420 










425 










430 






Asn 


Ala 


Cys 


Phe 


Gin 


Ser 


Leu 


Asp 


Asp 


Phe 


Gly 


He 


Asp 


His 


Thr 


Asn 






435 










440 










445 








Leu 


Cys 


Asp 


Cys 


Lys 


Gly 


Cys 


Thr 


Lys 


Leu 


Met 


Ala 


Ser 


Val 


Glu 


Ala 




450 










455 










460 










Thr 


Ser 


Asp 


Gin 


Gly Arg 


Lys 


Thr 


Lys 


Leu 


Ser 


Arg 


Lys 


Tyr 


Ala 


Arg 


465 










470 










475 










480 


Val 


His 


Trp 


Ala 


Lys 


Met 


Phe 


Ala 


Glu 


Lys 


Leu 


Phe 


Lys 


Met 


Met 


He 










485 










490 










495 




Lys 


Glu 


Gin 


Ser 


Met 


Met 


Tyr 


Ala 


Cys 


Ser 


Ala 


Val 


Pro 


Asp 


Glu 


He 








500 










505 










510 






Gly 


Ala 


He 


Tyr 


Ala 


Phe 


Val 


Thr 


Gly 


Asn 


Asn 


Ala 


Gly 


Val 


Cys 


Ser 






515 










520 










525 








Arg 


Val 


Ser 


Thr 


He 


Leu 


Ser 


Asp 


Leu 


Gly 


Thr 


Glu 


Cys 


Gly 


Asn 


Lys 




530 










535 










540 










Ala 


Glu 


Tyr 


Ala 


Phe 


Leu 


Lys 


Glu 


Gly 


Lys 


His 


Met 


Lys 


Ser 


Ala 


Ser 


545 










550 










555 










560 


Tyr 


Asp 


Ala 


Leu 


Gin 


Val 


He 


Asn 


Asn 


Thr 


Asp 


Leu 


Thr 


Pro 


Glu 


Gin 










565 










570 










575 




Ser 


Ser 


Met 


Phe 


Met 


Trp 


Phe 


Tyr 


Val 


Pro 


Asn 


Asp 


Ala 


Leu 


Glu 


Glu 








580 










585 










590 






Ala 


Gly 


Lys 


He 


Phe 


His 


Gin 


Ser 


Phe 


Ser 


Phe 


Ser 


Asn 


Ser 


Tyr 


Thr 






595 










600 










605 








Gly 


Gly 


Gly 


Leu 


Leu 


Ser 


Leu 


Asp 


Glu 


Tyr 


Lys 


Arg 


Phe 


Glu 


Phe 


Gly 




610 










615 










620 










Gin 


Cys 


Phe 


Asp 


Phe 


He 


Lys 


Lys 


Leu 


Val 


Ser 


Cys 


Leu 


Lys 


He 


Thr 


625 










630 










635 










640 


Arg 


Asn 


Val 


Glu 


Asp 


Val 


Leu 


Leu 


Glu 


Thr 


Ser 


Lys 


Thr 


Ser 


Asn 


Arg 










645 










650 










655 




Tyr 


Phe 


Ala 


He 


Pro 


Val 


Phe 


Cys 


Gly 


Ser 


Asp 


Asp 


Gin 


Lys 


Glu 


Val 








660 










665 










670 






Leu 


Arg 


Glu 


Glu 


Leu 


Ala 


Ser 


Asp 


Leu 


Phe 


Gly 


Gly 


Arg 


Glu 


Asp 


Val 






675 










680 










685 








Ala 


Glu 


Met 


Met 


Phe 


He 


Asp 


Leu 


Glu 


Thr 


Val 


He 


Gin 


Lys 


Leu 


Gly 




690 










695 










700 










Thr 


Leu 


Tyr 


Asp 


Val 


Arg 


Leu 


Ser 


Leu 


Pro 


Glu 


Gly 


Gly 


Tyr 


Ala 


Ala 


705 










710 










715 










720 


He 


Lys 


Ser 


Val 


Cys 


Ala 


Ala 


Ala 


Ser 


Trp 


Ala 


Ala 


Ser 


Cys 


Glu 


Val 
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725 










730 










735 




Pro 


Ser 


Asn 


Thr 


Ser 


Asn 


Met 


He 


Leu 


Ser 


He 


Ala 


Lys 


Met 


Ala 


Phe 








740 










745 










750 






Thr 


Lys 


Tyr 


Tyr 


Gin 


Glu 


Gin 


Asn 


Ser 


Ser 


Ser 


Glu 


Thr 


Asp 


Leu 


Asp 






755 










760 










765 








He 


He 


Leu 


Pro 


Ser 


He 


Gly 


Thr 


Ala 


Asp 


Gly 


Glu 


He 


Glu 


Asn 


Asn 




770 










775 










780 










Leu 


Ser 


Gly 


Val 


Val 


Phe 


Leu 


Arg 


Cys 


Leu 


He 


Thr 


Trp 


Ala 


Asn 


Lys 


785 










790 










795 










800 


He 


Gly 


Val 


Asp 


Lys 


Asn 


Phe 


Thr 


Asn 


Lys 


Leu 


Glu 


His 


Phe 


Leu 


Ala 










805 










810 










815 




Leu 


Arg 


He 


Leu 


Thr 


Lys 


Ala 


Gly 


Asp 


Ser 


Lys 


He 


Gly 


Glu 


Lys 


Tyr 








820 










825 










830 






Glu 


Thr 


Phe 


Pro 


Val 


Arg 


Arg 


Leu 


Asp 


Leu 


Ser 


Glu 


Lys 


Asp 


Leu 


Lys 






835 










840 










845 








Tyr 


He 


Cys 


Lys 


Arg 


Cys 


Gly 


Val 


Lys 


Ser 


Leu 


Lys 


Met 


Glu 


Tyr 


Asp 




850 










855 










860 










Asn 


Asp 


Glu 


Lys 


Leu 


Cys 


Leu 


Arg 


Cys 


Lys 


Gly 


Asn 


Tyr 


Arg 


Met 


Gly 


865 










870 










875 










880 


Lys 


Pro 


Met 


Val 


Tyr 


His 


Trp 


Asp 


Asn 


Lys 


Leu 


Thr 


Arg 


Asp 


Pro 


Arg 










885 










890 










895 




Ala 


Lys 


Thr 


Asp 


Thr 


Thr 


Leu 


Asn 


Leu 


Leu 


Asn 


Ala 


Lys 


Lys 


He 


Asp 








900 










905 










910 






Asp 


Lys 


Val 


Lys 


Glu 


Met 


Ala 


Ser 


Asp 


He 


He 


Gly Ala 


Leu 


Asn 


Leu 






915 










920 










925 








Pro 


Pro 


Thr 


Asp 


Lys 


Asp 


Asn 


Glu 


He 


Ala 


Val 


Ser 


Ala 


Ala 


Ala 


Lys 




930 










935 










940 










Ala 


Val 


Gly 


He 


Leu 


Tyr 


Gly 


Lys 


Thr 


Cys 


Leu 


Leu 


Tyr 


Lys 


Leu 


Leu 


945 










950 










955 










960 


Asn 


Glu 


Gly 


Asn 


He 


Asp 


He 


Pro 


Val 


Ala 


Val 


Cys 


Val 


Glu 


Cys 


Asp 










965 










970 










975 




Cys 


Cys 


Lys 


Ser 


Lys 


Tyr 


Met 


Met 


Ser 


Thr 


Leu 


Gly 


Pro 


Asp 


Lys 


Pro 








980 










985 










990 






Gin 


Asn 


Arg 


Lys 


Cys 


Pro 


Trp 


Cys 


Arg 


Tyr 


Ala 


Asn 


Lys 


Leu 


Val 


Ala 






995 










1000 








1005 






Met 


Gly 


Arg 


Gly 


Gly 


Lys 


Lys 


Leu 


Leu 


Met 


Asp 


Leu 


He 


Glu 


Cys 


Gly 




1010 








1015 








1020 








Ala 


Pro 


Ser 


Leu 


Ala 


Met 


Val 


Glu 


Glu 


Ala 


He 


Arg 


Thr 


Ser 


Gly 


Asp 


1025 








1030 








1035 








1041 


Val 


Met 


Tyr 


Glu 


Glu 


Leu 


Gly 


Glu 


Gly 


Glu 


Glu 


Phe 


Tyr 


He 


He 


Asp 










1045 








1050 








1055 


Tyr 


Phe 


Leu 


Lys 


Leu 


Lys 


Asn 


Thr 


Ala 


He 


Ala 


Glu 


Gly 


Asn 


Lys 


Leu 








1060 








1065 








1070 




Gin 


Gin 


Asn 


Asn 


Asn 


Lys 


Arg 


Pro 


Ala 


Pro 


Leu 


Gin 


Val 


Thr 


Ser 


Pro 






1075 








1080 








1085 






Ser 


Ser 


Pro 


Pro 


Lys 


Lys 


Met 


Arg 


Ser 


Asp 


Leu 


Pro 


Asp 


Ser 


Leu 


Leu 




1090 








1095 








1100 








Ala 


Ala 


He 


Gly 


Glu 


Cys 


Ala 


He 


Glu 


Thr 


Lys 


Glu 


Lys 


Thr 


Thr 


Val 


1105 








1110 








1115 








1121 


Asn 


Leu 


He 


Gly 


Leu 


Gly 


Glu 


Val 


Lys 


Val 


Val 


Glu 


Asn 


Val 


Gly 


Pro 










1125 








1130 








1135 


Asn 


Asp 


Leu 


Asp 


Gly 


Lys 


Asp 


Pro 


Phe 


He 


Ser 


Leu 


Gin 


Glu 


Tyr 


Cys 








1140 








1145 








1150 




Ser 


Trp 


Asp 


Lys 


Phe 


Asn 


Ser 


Leu 


Phe 


Val 


Asn 


Pro 


Trp 


Leu 


Gly 


Tyr 






1155 








1160 








1165 






Arg 


Leu 


Asp 


Glu 


Gin 


Trp 


Asp 


Asp 


Trp 


Asn 


Thr 


Phe 


Leu 


He 


His 


Val 




1170 








1175 








1180 








Lys 


Lys 


Asn 


Asp 


Val 


Trp 


Lys 


Phe 


Leu 


Cys 


Asn 


Lys 


Thr 


Ser 


Pro 


Phe 


1185 








1190 








1195 








1201 


Ser 


Val 


Val 


Val 


Met 


Asn 


Asp 


Gly 


Ser 


Gly 


Leu 


Leu 


Asn 


Val 


Asp 


Asn 



1205 1210 1215 
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Val Asn Val Leu Val Arg Gin Lys lie Cys Val 
1220 1225 



<210> 172 
<211> 294 
<212> DNA 
<213> SHRIMP 

<400> 172 

atggacattt tggaagacat ctacaagagc gcgatcacgc tcgtactaca atcgcccgaa 60 

tttgtgaatg atgtaaaaca ggaagcttct caggtagttg aggggctaat accttcaatt 120 

agagaagctg tctttagacg gcttctagaa gaagaaagga aaaaacacga agacgaggtg 180 

ggagatgtgg aagataaaag acaagcagtg atagacaagg caaatacaat gattacaaca 240 

atggcggcag agtacctgga atctgtagat attttagaag agtttgggtt ttaa 294 

<210> 173 
<211> 93 
<212> PRT 
<213> SHRIMP 

<400> 173 

Met Asp lie Asp lie Tyr Lys Ser Ala lie Thr Leu Val Leu Gin Ser 

15 10 15 

Pro Glu Phe Val Asn Asp Val Lys Gin Glu Ala Ser Gin Val Val Glu 

20 25 30 

Gly Leu lie Pro Ser lie Arg Glu Ala Val Phe Arg Arg Leu Leu Glu 

35 40 45 

Glu Glu Arg Lys Lys His Glu Asp Glu Val Gly Asp Val Glu Asp Lys 

50 55 60 

Arg Gin Ala Val lie Asp Lys Ala Asn Thr Met lie Thr Thr Met Ala 
65 70 75 80 

Ala Glu Tyr Leu Glu Ser Val Asp lie Glu Phe Gly Phe 
85 90 



<210> 174 
<211> 1530 
<212> DNA 
<213> SHRIMP 



<400> 174 

atggcgtccg 

atcgagtcta 

ttgcataaaa 

gcaggcaacc 

aatttctata 

gatttgagga 

actttctccc 

tgcaagtaca 

gtcgaaaaga 

cacgatatag 

agttgtcatg 

aatatgtacc 

gttacacatt 

ccaaacgagg 

accaacaaat 

agaataggag 

acaaacagat 

atgtgttcaa 

tctgcctacg 



ggtttgctat 
tcaaaagtat 
atactccaaa 
cctgcttgga 
caaggaaagg 
atatcaacaa 
ctctatatag 
taattgtaac 
aactgttaag 
cacatcgatt 
accaatgctt 
ccacatttaa 
cggccgaaaa 
tataccttga 
tggatgctat 
acaaaagtac 
tcaatgaaga 
attattgtac 
ggccaacatc 



caagggaatc 
acggagaagc 
acatgagg tg 
aaaaacgtca 
tgctgaatct 
tatagtgaaa 
aa ttctcact 
tgctgattta 
tatgggtagt 
atcctacaaa 
agttgttcca 
aaatt ttgat 
aatggacggt 
cgccgcaagg 
cgatgaagaa 
gtgtctcgtg 
cacaccccta 
aatgagagat 
ttaccc tatc 



gttaaaaact 
gaactcgctg 
gatgaaattg 
ttatttcttc 
gaatctgatt 
agagaagctt 
gatagactat 
ttgatgggtt 
attttagggg 
ggcctccgca 
gtgagtatgt 
caatgcatgg 
aagcatgaac 
aggaaatacc 
gcaagggaag 
tttgcattat 
tattctggta 
gagggtggtt 
atcttcaata 



atagaagaat 
aaggtgtata 
tgaataaaat 
aacatcattc 
ggctcaaaag 
taccccatga 
tcaatgcagc 
gtgggataac 
gcgaatcaat 
tagaaaatcc 
tagggaagat 
cattattttt 
gtaataaggt 
tagaagaaaa 
aatatggaaa 
ctgcacgaga 
cagaaagagg 
tcaggccccg 
ctttatatga 



accgtccatt 
tatcgtctct 
acgcctctca 
acaaatgagg 
actaccagaa 
caagtctttc 
aattcacaac 
caacaacaaa 
ggtaccttta 
tatagtgggt 
tttttcaagt 
gaatgcagtt 
catccatatg 
actagaggaa 
cgaaatagga 
ctttttcctc 
aatcagattc 
tttgatcatg 
tcaattcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 
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gtgcaatatt 
gcaccagaac 
agaatttttt 
ggtgagggaa 
ataattaaag 
gaggaagaag 
tcactagaga 



atccatgtgt 
cagaatcatt 
ctggtgatgg 
agaatcagaa 
catataatga 
aggaacaaca 



ttctggagtt 
agtggacatt 
tgaaacagta 
gtttaaccga 
agaacgaact 
aacagcagca 
ttgggtgtag 



gttttatctt 
gttgtacgtt 
taccaggatg 
gaagagcgca 
aaagaagatg 
acagtgacag 



ttattggcga 
ctataaaaaa 
gacgtagggt 
ccattttaaa 
aagatgaaga 
tagaaagtga 



tgatcagtta 1200 
tccgtctatt 1260 
cgatgttggt 1320 
tgtattgagg 1380 
ggaagaagag 14 4 0 
ttgggatctc 1500 
1530 



<210> 175 
<211> 507 
<212> PRT 
<213> SHRIMP 



<400> 175 

Met Ala Ser Gly 

1 

lie Pro Ser He 
20 

Ala Glu Gly Val 

35 

Glu Val Asp Glu 
50 

Cys Leu Glu Lys 
65 

Asn Phe Tyr Thr 

Arg Leu Pro Glu 
100 

Ala Leu Pro His 
115 

Leu Thr Asp Arg 
130 

He Val Thr Ala 
145 

Val Glu Lys Lys 

Met Val Pro Leu 
180 

Arg He Glu Asn 
195 

Val Pro Val Ser 
210 

Thr Phe Lys Asn 

225 

Val Thr His Ser 

Val He His Met 
260 

Tyr Leu Glu Glu 
275 

Glu Glu Ala Arg 
290 

Lys Ser Thr Cys 
305 

Thr Asn Arg Phe 

Arg Phe Met Cys 
340 

Arg Pro Arg Leu 
355 

He Phe Asn Thr 



Phe Ala He Lys 
5 

He Glu Ser He 

Tyr He Val Ser 
40 

He Val Asn Lys 
55 

Thr Ser Leu Phe 
70 

Arg Lys Gly Ala 
85 

Asp Leu Arg Asn 

Asp Lys Ser Phe 
120 

Leu Phe Asn Ala 
135 

Asp Leu Leu Met 
150 

Leu Leu Ser Met 
165 

His Asp He Ala 

Pro He Val Gly 
200 

Met Leu Gly Lys 
215 

Phe Asp Gin Cys 
230 

Ala Glu Lys Met 
245 

Pro Asn Glu Val 

Lys Leu Glu Glu 
280 

Glu Glu Tyr Gly 

295 

Leu Val Phe Ala 
310 

Asn Glu Asp Thr 
325 

Ser Asn Tyr Cys 

He Met Ser Ala 
360 

Leu Tyr Asp Gin 



Gly He Val Lys 
10 

Lys Ser He Arg 
25 

Leu His Lys Asn 

He Arg Leu Ser 
60 

Leu Gin His His 

75 

Glu Ser Glu Ser 
90 

He Asn Asn He 
105 

Thr Phe Ser Pro 

Ala He His Asn 
140 

Gly Cys Gly He 
155 

Gly Ser He Leu 
170 

His Arg Leu Ser 
185 

Ser Cys His Asp 

He Phe Ser Ser 
220 

Met Ala Leu Phe 

235 

Asp Gly Lys His 
250 

Tyr Leu Asp Ala 
265 

Thr Asn Lys Leu 

Asn Glu He Gly 
300 

Leu Ser Ala Arg 
315 

Pro Lys Gly Thr 
330 

Thr Met Arg Asp 
345 

Tyr Gly Pro Thr 
Phe Asn Val Gin 



Asn 


Tyr 


Arg 


Arg 






15 




Arg 


Ser 


Glu 


Leu 




30 






Thr 


Pro 


Lys 


His 


45 








Ala 


Gly 


Asn 


Pro 


Ser 


Gin 


Met 


Arg 








80 


Asp 


Trp 


Leu 


Lys 






95 




Val 


Lys 


Arg 


Glu 




110 






Leu 


Tyr 


Arg 


He 


125 








Cys 


Lys 


Tyr 


He 


Thr 


Asn 


Asn 


Lys 








160 


Gly 


Gly 


Glu 


Ser 






175 




Tyr 


Lys 


Gly 


Leu 




190 






Gin 


Cys 


Leu 


Val 


205 








Asn 


Met 


Tyr 


Pro 


Leu 


Asn 


Ala 


Val 








240 


Glu 


Arg 


Asn 


Lys 






255 




Ala 


Arg 


Arg 


Lys 




270 






Asp 


Ala 


He 


Asp 


285 








Arg 


He 


Gly 


Asp 


Asp 


Phe 


Phe 


Leu 








320 


Glu 


Arg 


Gly 


He 






335 




Glu 


Gly 


Gly 


Phe 




350 






Ser 


Tyr 


Pro 


He 


365 








Tyr 


Tyr 


Pro 


Cys 
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370 375 380 

Val Ser Gly Val Val Leu Ser Phe lie Gly Asp Asp Gin Leu Ala Pro 
385 390 395 400 

Glu Pro Glu Ser Leu Val Asp lie Val Val Arg Ser lie Lys Asn Pro 

405 410 415 

Ser lie Arg lie Phe Ser Gly Asp Gly Glu Thr Val Tyr Gin Asp Gly 

420 425 430 

Arg Arg Val Asp Val Gly Gly Glu Gly Lys Asn Gin Lys Phe Asn Arg 

435 440 445 

Glu Glu Arg Thr lie Leu Asn Val Leu Arg lie lie Lys Ala Tyr Asn 

450 455 460 

Glu Glu Arg Thr Lys Glu Asp Glu Asp Glu Glu Glu Glu Glu Glu Glu 
465 470 475 480 

Glu Glu Glu Gin Gin Thr Ala Ala Thr Val Thr Val Glu Ser Asp Trp 

485 490 495 

Asp Leu Ser Leu Glu Arg Gly Glu Asn Trp Val 
500 505 



<210> 176 
<211> 246 
<212> DNA 
<213> SHRIMP 

<400> 176 

atgacttgtc cagaaatctc taaacacatt tctggaacag acagacgttt ctggaacacg 60 
gctgacccag gtggcctcag ctatcctttc aaccctcttt ttacccttca tctccatctc 120 
aaaaactttt caaaaatttt ttcagctcac tccagtttag ggggtggacc gctgactagg 180 
ccttatgtca agttcgaagg gtggaccgct gggtcgaccc aacgtcagat tacagagagg 240 
agctag 246 

<210> 177 
<211> 77 
<212> PRT 
<213> SHRIMP 

<400> 177 

Met Thr Cys Pro Glu lie Ser Lys His Gly Thr Asp Arg Arg Phe Trp 

15 10 15 

Asn Thr Ala Asp Pro Gly Gly Leu Ser Tyr Pro Phe Asn Pro Leu Phe 

20 25 30 

Thr Leu His Leu His Leu Lys Asn Phe Ser Lys lie Phe Ser Ala His 

35 40 45 

Ser Ser Leu Gly Gly Gly Pro Leu Trp Tyr Val Lys Phe Glu Gly Trp 

50 55 60 

Thr Ala Gly Ser Thr Gin Arg Gin lie Thr Glu Arg Ser 
65 70 75 



<210> 178 
<211> 738 
<212> DNA 
<213> SHRIMP 

<400> 178 

atggtttcca ccaggtctat ggaagcaaaa 
tctcccacga ccagtaagag aaaggcggag 
gaatcagtag aaacacaccc gccgagtaag 
attgatgaaa atgttgatgg tgatgtgcag 
atggagagag aaagacttgc cgaagccatg 
gccgcaacgg aagcgaggaa agaaatagcc 



gctgcagcag cagcaaaagc aaaagaagtt 60 
gacctcactg aaggaacaga agaagaagaa 120 
ctccccagag tcgatgaaga tgaagtctat 180 
atcctcgcct catcaatcga agtcgccaga 240 
gtccgagaca taaaaatcga ggaagaaaaa 300 
tctcgcctaa tttataaaga aatggtatat 360 
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cttttgcctc 
aacgaaatga 
tttgaattcc 
ggttcgagat 
atgtgtaata 
gttgaagtta 
atagtaaaac 



aactggaaaa 
ccattacaga 
ctatactgac 
tctacaacga 
aattcataga 
gtgtatcaag 
acacctag 



catgactaac 
ccgcacgttc 
tgatattgct 
tatgaagatt 
gtctgttgtg 
tgaacttgaa 



cgcctccgtc 
agtgatttgc 
ttccttgccc 
ggacctataa 
caaaaggtga 
gggtcacctt 



cgagatcact 
agatattcaa 
gtgaaaaatc 
cagcctacaa 
aggcagaaat 
tttgggattt 



tctcaggcac 420 
caaagtcact 480 
acgtgtcgag 540 
attgaatttg 600 
atccccattt 660 
caagcaaaga 720 
738 



<210> 179 
<211> 245 
<212> PRT 
<213> SHRIMP 



<400> 179 



Met 


Val 


Ser 


Thr 


Arg 


Ser 


Met 


Glu 


Ala 


Lys 


Ala 


Ala 


Ala 


Ala 


Ala 


Lys 


1 








5 










10 










15 




Ala 


Lys 


Glu 


Val 


Ser 


Pro 


Thr 


Thr 


Ser 


Lys 


Arg 


Lys 


Ala 


Glu 


Asp 


Leu 








20 










25 










30 






Thr 


Glu 


Gly 


Thr 


Glu 


Glu 


Glu 


Glu 


Glu 


Ser 


Val 


Glu 


Thr 


His 


Pro 


Pro 






35 










40 










45 








Ser 


Lys 


Leu 


Pro 


Arg 


Val 


Asp 


Glu 


Asp 


Glu 


Val 


Tyr 


He 


Asp 


Glu 


Asn 




50 










55 










60 










Val 


Asp 


Gly 


Asp 


Val 


Gin 


He 


Leu 


Ala 


Ser 


Ser 


He 


Glu 


Val 


Ala 


Arg 


65 










70 










75 










80 


Met 


Glu 


Arg 


Glu 


Arg 


Leu 


Ala 


Glu 


Ala 


Met 


Val 


Arg 


Asp 


He 


Lys 


He 










85 










90 










95 




Glu 


Glu 


Glu 


Lys 


Ala 


Ala 


Thr 


Glu 


Ala 


Arg 


Lys 


Glu 


He 


Ala 


Ser 


Arg 








100 










105 










110 






Leu 


He 


Tyr 


Lys 


Glu 


Met 


Val 


Tyr 


Leu 


Leu 


Pro 


Gin 


Leu 


Glu 


Asn 


Met 






115 










120 










125 








Thr 


Asn 


Arg 


Leu 


Arg 


Pro 


Arg 


Ser 


Leu 


Leu 


Arg 


His 


Asn 


Glu 


Met 


Thr 




130 










135 










140 










lie 


Thr 


Asp 


Arg 


Thr 


Phe 


Ser 


Asp 


Leu 


Gin 


He 


Phe 


Asn 


Lys 


Val 


Thr 


145 










150 










155 










160 


Phe 


Glu 


Phe 


Pro 


He 


Leu 


Thr 


Asp 


He 


Ala 


Phe 


Leu 


Ala 


Arg 


Glu 


Lys 










165 










170 










175 




Ser 


Arg 


Val 


Glu 


Gly 


Ser 


Arg 


Phe 


Tyr 


Asn 


Asp 


Met 


Lys 


He 


Gly 


Pro 








180 










185 










190 






He 


Thr 


Ala 


Tyr 


Lys 


Leu 


Asn 


Leu 


Met 


Cys 


Asn 


Lys 


Phe 


He 


Glu 


Ser 






195 










200 










205 








Val 


Val 


Gin 


Lys 


Val 


Lys 


Ala 


Glu 


He 


Ser 


Pro 


Phe 


Val 


Glu 


Val 


Ser 




210 










215 










220 










Val 


Ser 


Ser 


Glu 


Leu 


Glu 


Gly 


Ser 


Pro 


Phe 


Trp 


Asp 


Phe 


Lys 


Gin 


Arg 


225 










230 










235 










240 


He 


Val 


Lys 


His 


Thr 

























245 



<210> 180 
<211> 1221 
<212> DNA 
<213> SHRIMP 



<400> 180 

atgtctcaca 

accactacag 

gcttctgctg 

ccacaaggta 

tctactggaa 

aagaacaagt 

cgtagcagtg 



tcaactctac 
cccctatgat 
ttacaagtat 
tctctgttac 
cgaggatgtc 
ccaagtctaa 
acgaggacga 



ctctgctgcc 
tgctgccgcc 
caactctaat 
ggccatgcct 
ttctacaccc 
gaagaaggat 
accatctctt 



acgacttcat 
agagctgctg 
tctacgtctt 
cccgtgccag 
aatgtggatg 
tcaaagagga 
gttatcgacg 



ccaacactct 
ccatcgcctc 
cttctgcaat 
cacttaca tc 
ttatacctgt 
agaagaacca 
acggttctgg 



gccgatttgc 60 
tcggacttct 120 
gttccgagta 180 
tctgactgaa 240 
tcctggcccc 300 
gaatggcaac 360 
aagacagtct 420 
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aagaacaaga 
gacactctcg 
attgacgcag 
gcactcactt 
acttcttacg 
acagttagca 
agtaaggtga 
gctagctgta 
tttgatttta 
ttttttgaga 
agaggcatct 
caacaacgcc 
acaacttctg 
caagaaatca 



aata ttcttg 
ccccacctag 
ggctaagtaa 
ctggagttgg 
gagattgcga 
gaggcacatc 
tccataccgt 
tggcctacgc 
tggctatgac 
agcgcgataa 
ttacacgcac 
gtgaagctgc 
ctgccggagt 
gataccagta 



ggtcacatct 
gcccttcctt 
tccagtcact 
agataatgga 
aacacccata 
tcttctgcac 
taaaccaaga 
tgtggacctt 
tgcatgggca 
ggacatcgga 
cactgaaatg 
tgccgctgcc 
tgg tgctact 
a 



cttgctacta 
cccacacccg 
cgccaaatca 
ctatatcctt 
cctggacctg 
agagaagagg 
aacagcaagt 
ctaaacaacc 
gcccgtcaac 
gaatatagga 
aataaacgcg 
gctgccaccg 
tcttctgcta 



ctacggctga 
aagaaggaaa 
ccggagaagt 
cccacttcac 
cttttgtcct 
cagaattctt 
actccaatat 
ataatatcac 
gttgtggaga 
ataaggtggt 
caaagattat 
gtgccaccgc 
ctactaactc 



aagaaacaac 
tatgtctgaa 
ttatagcgct 
ggttgctgac 
cgacgacggg 
gaatgatgga 
tcaacgtgcc 
ctctgaccaa 
aatggccaag 
ccaatacaac 
cctggaacaa 
ccctatccct 
tctcgaatat 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1221 



<210> 181 
<211> 402 
<212> PRT 
<213> SHRIMP 



<400> 181 
Met Ser His lie 
1 

Leu Pro lie Cys 
20 

Ala Ala lie Ala 
35 

Ser Asn Ser Thr 
50 

Ser Val Thr Ala 
65 

Ser Thr Gly Thr 

Val Pro Gly Pro 
100 

Arg Lys Lys Asn 
115 

Ser Leu Val lie 
130 

Tyr Ser Trp Val 
145 

Asp Thr Leu Ala 

Asn Met Ser Glu 
180 

He Thr Gly Glu 
195 

Asn Gly Pro Ser 
210 

Glu Thr Pro He 
225 

Ser Arg Gly Thr 

Asp Gly Ser Lys 
260 

Ser Asn He Gin 
275 

Leu Asn Asn His 
290 

Thr Ala Trp Ala 

305 



Asn Ser Thr Ser 
5 

Thr Thr Thr Ala 

Ser Arg Thr Ser 
40 

Ser Ser Ser Ala 
55 

Met Pro Pro Val 
70 

Arg Met Ser Ser 
85 

Lys Asn Lys Ser 

Gin Asn Gly Asn 
120 

Asp Asp Gly Ser 
135 

Thr Ser Leu Ala 
150 

Pro Pro Arg Pro 
165 

He Asp Ala Gly 

Val Tyr Ser Ala 
200 

His Phe Thr Val 

215 

Pro Gly Pro Ala 
230 

Ser Leu Leu His 
245 

Val He His Thr 

Arg Ala Ala Ser 
280 

Asn He Thr Ser 
295 

Ala Arg Gin Arg 

310 



Ala Ala Thr Thr 
10 

Pro Met He Ala 
25 

Ala Ser Ala Val 

Met Phe Arg Val 
60 

Pro Ala Leu Thr 
75 

Thr Pro Asn Val 
90 

Lys Ser Lys Lys 
105 

Arg Ser Ser Asp 

Gly Arg Gin Ser 
140 

Thr Thr Thr Ala 
155 

Phe Leu Pro Thr 
170 

Leu Ser Asn Pro 

185 

Ala Leu Thr Ser 

Ala Asp Thr Ser 
220 

Phe Val Leu Asp 
235 

Arg Glu Glu Ala 
250 

Val Lys Pro Arg 

265 

Cys Met Ala Tyr 

Asp Gin Phe Asp 
300 

Cys Gly Glu Met 
315 



Ser Ser Asn Thr 
15 

Ala Ala Arg Ala 
30 

Thr Ser He Asn 
45 

Pro Gin Gly He 

Ser Leu Thr Glu 
80 

Asp Val He Pro 
95 

Lys Asp Ser Lys 
110 

Glu Asp Glu Pro 
125 

Lys Asn Lys Lys 

Glu Arg Asn Asn 
160 

Pro Glu Glu Gly 
175 

Val Thr Arg Gin 
190 

Gly Val Gly Asp 
205 

Tyr Gly Asp Cys 

Asp Gly Thr Val 
240 

Glu Phe Leu Asn 
255 

Asn Ser Lys Tyr 
270 

Ala Val Asp Leu 
285 

Phe Met Ala Met 

Ala Lys Phe Phe 
320 
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Glu 


Lys 


Arg 


Asp 


Lys 


Asp 


He 


Gly 










325 








Tyr 


Asn 


Arg 


Gly 


He 


Phe 


Thr 


Arg 








340 










Lys 


lie 


He 


Gin 


Gin 


Gin 


Arg 


Arg 






355 










360 


Thr 


Gly 


Ala 


Thr 


Ala 


Pro 


He 


Pro 




370 










375 




Ala 


Thr 


Ser 


Ser 


Ala 


Thr 


Thr 


Asn 


385 










390 






Tyr 


Gin 















Glu Tyr Arg Asn Lys Val Val Gin 

330 335 
Thr Thr Glu Met Asn Lys Arg Ala 
345 350 
Glu Ala Ala Ala Ala Ala Ala Ala 
365 

Thr Thr Ser Ala Ala Gly Val Gly 
380 

Ser Leu Glu Tyr Gin Glu He Arg 
395 400 



<210> 182 
<211> 1617 
<212> DNA 
<213> SHRIMP 



<400> 182 

atggaagact 

gaaaattatg 

ctaaaggaag 

ggtaacgaaa 

acatccagtc 

aaatggaacg 

gctctagtta 

ctttcaaatg 

gggtgtgggt 

gtgcaacacg 

tacaatggct 

gtaccaatgg 

attggaaatt 

aatcaaaaat 

ctagttagcg 

gcctcagaag 

tccagatatg 

ttgtgcagaa 

gtctcttcgg 

tccataactg 

aaccatttca 

aggttgttga 

aatggaaaag 

ggcaaggaca 

cgagagcgtc 

ggggaagaag 

gaaggcgatt 



ttaaacaatt 
aacgggtact 
gacattttat 
gattgagaca 
tcccaaaaca 
aatcactacc 
atgacaagaa 
cggccgttaa 
tttctcctag 
gtggtacctc 
taagtatcgc 
ataaacttgg 
acatgtcatg 
tagataattt 
aaagaagtgt 
aacaaaggat 
acggcggtgc 
ccaatggaaa 
acgactacag 
gctcagatgc 
ttgcatccgg 
tggagatggt 
tatatcaaga 
taataaaaaa 
gcatgatttt 
aagaagtagc 
atgttgccct 



aaaagtaaaa 
attaacattc 
agttcgtctt 
attaacagga 
aggtagtttc 
aagcccattc 
ctttaaattt 
gaaatgtgat 
aaattgtacc 
ttctcgtgta 
aaaccccata 
attacttttc 
ccttttcaat 
cgaaaaggaa 
tctggaagaa 
ttctcttcga 
ctgtctggtc 
tggttccttt 
aaagagggac 
acctatatgc 
aaagggtaat 
cagagaggtt. 
tggtgtagaa 
ggaagaaaca 
caacacagtt 
cacaagcagt 
tttgaacaag 



aatggtattt 
aaatcagtca 
agagacaagg 
gatcctacgc 
ttagaagatg 
caggaaatga 
tcacccctat 
ta tatgataa 
cgtactctta 
tcagtgtatg 
gttggcagtt 
tacaacagca 
gctgcagt tg 
atcaga t ttg 
aaacttaaag 
ga tgtgcata 
tttgcctttt 
tactctgcca 
gtggatgaac 
atcagagata 
gaaatatcat 
actggatcag 
ataaaagtga 
ttaccagagg 
aaggcaattg 
ggaggaacaa 
gcatgcaaag 



gtttgtctgg 
agagtgtcag 
aagtactcca 
ttcagattgg 
aagaccctaa 
acaaaattgt 
acagaatcat 
tcacaacaga 
agaatatgga 
atatctgtga 
tttcaaatat 
cacacccgtc 
tatacacgct 
caaaaaatga 
aatccaaaaa 
aaaagtcctc 
ctgaccgaga 
cagaagaagg 
gtaggcccag 
gtatacgaaa 
tcatcgatcc 
acatcaaaat 
ttgacccctc 
aggaaaggaa 
agacgtacaa 
agagaaagag 
aaattaaagt 



ggaaaatacc 
gagaagtgag 
catcaagaac 
actaaaatac 
ttatggaaaa 
ggaagaaaag 
acatgaacgt 
cttcttagta 
acaagtgtta 
taggttaacg 
gtgcctaatt 
agctaaaagc 
agaaaagagt 
agtcaacctt 
gctatatgct 
aattgcatca 
tttctccttg 
aatcagatac 
attggtcatg 
ccattttaat 
tccgaatgaa 
cttcatggat 
ttctaaagaa 
gcgtctgcgc 
cgaggaacgt 
ggaggagaaa 
ttgttga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1617 



<210> 183 
<211> 534 
<212> PRT 
<213> SHRIMP 



<400> 183 

Met Glu Asp Phe Lys Gin Leu Lys Val Lys Asn Gly He Cys Leu Ser 

15 10 15 

Gly Glu Asn Thr Glu Asn Tyr Glu Arg Val Leu Leu Thr Phe Lys Ser 

20 25 30 

Val Lys Ser Val Arg Arg Ser Glu Leu Lys Glu Gly His Phe He Val 

35 40 45 

Arg Leu Arg Asp Lys Glu Val Leu His He Lys Asn Gly Asn Glu Arg 
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50 










55 










60 










Leu 


Arg 


Gin 


Leu 


Thr 


Gly 


Asp 


Pro 


Thr 


Leu 


Gin 


He 


Gly 


Leu 


Lys 


Tyr 


65 










70 










75 










80 


Thr 


Ser 


Ser 


Leu 


Pro 


Lys 


Gin 


Gly 


Ser 


Phe 


Leu 


Glu 


Asp 


Glu 


Asp 


Pro 










85 










90 










95 




Asn 


Tyr 


Gly 


Lys 


Lys 


Trp 


Asn 


Glu 


Ser 


Leu 


Pro 


Ser 


Pro 


Phe 


Gin 


Glu 








100 










105 










110 






Met 


Asn 


Lys 


He 


Val 


Glu 


Glu 


Lys 


Ala 


Leu 


Val 


Asn 


Asp 


Lys 


Asn 


Phe 






115 










120 










125 








Lys 


Phe 


Ser 


Pro 


Leu 


Tyr 


Arg 


He 


He 


His 


Glu 


Arg 


Leu 


Ser 


Asn 


Ala 




130 










135 










140 










Ala 


Val 


Lys 


Lys 


Cys 


Asp 


Tyr 


Met 


He 


He 


Thr 


Thr 


Asp 


Phe 


Leu 


Val 


145 










150 










155 










160 


Gly 


Cys 


Gly 


Phe 


Ser 


Pro 


Arg 


Asn 


Cys 


Thr 


Arg 


Thr 


Leu 


Lys 


Asn 


Met 










165 










170 










175 




Glu 


Gin 


Val 


Leu 


Val 


Gin 


His 


Gly 


Gly 


Thr 


Ser 


Ser 


Arg 


Val 


Ser 


Val 








180 










185 










190 






Tyr 


Asp 


He 


Cys 


Asp 


Arg 


Leu 


Thr 


Tyr 


Asn 


Gly 


Leu 


Ser 


He 


Ala 


Asn 






195 










200 










205 








Pro 


He 


Val 


Gly 


Ser 


Phe 


Ser 


Asn 


Met 


Cys 


Leu 


He 


Val 


Pro 


Met 


Asp 




210 










215 










220 










Lys 


Leu 


Gly 


Leu 


Leu 


Phe 


Tyr 


Asn 


Ser 


Thr 


His 


Pro 


Ser 


Ala 


Lys 


Ser 


225 










230 










235 










240 


lie 


Gly 


Asn 


Tyr 


Met 


Ser 


Cys 


Leu 


Phe 


Asn 


Ala 


Ala 


Val 


Val 


Tyr 


Thr 










245 










250 










255 




Leu 


Glu 


Lys 


Ser 


Asn 


Gin 


Lys 


Leu 


Asp 


Asn 


Phe 


Glu 


Lys 


Glu 


He 


Arg 








260 










265 










270 






Phe 


Ala 


Lys 


Asn 


Glu 


Val 


Asn 


Leu 


Leu 


Val 


Ser 


Ser 


Val 


Leu 


Glu 


Glu 






275 










280 










285 








Lys 


Leu 


Lys 


Glu 


Ser 


Lys 


Lys 


Leu 


Tyr 


Ala 


Ala 


Ser 


Glu 


Glu 


Gin 


Arg 




290 










295 










300 










He 


Ser 


Leu 


Arg 


Asp 


Val 


His 


Lys 


Lys 


Ser 


Ser 


He 


Ala 


Ser 


Ser 


Arg 


305 










310 










315 










320 


Tyr 


Asp 


Gly 


Gly 


Ala 


Cys 


Leu 


Val 


Phe 


Ala 


Phe 


Ser 


Asp 


Arg 


Asp 


Phe 










325 










330 










335 




Ser 


Leu 


Leu 


Cys 


Arg 


Thr 


Asn 


Gly 


Asn 


Gly 


Ser 


Phe 


Tyr 


Ser 


Ala 


Thr 








340 










345 










350 






Glu 


Glu 


Gly 


He 


Arg 


Tyr 


Val 


Ser 


Ser 


Asp 


Asp 


Tyr 


Arg 


Lys 


Arg 


Asp 






355 










360 










365 








Val 


Asp 


Glu 


Arg 


Arg 


Pro 


Arg 


Leu 


Val 


Met 


Ser 


He 


Thr 


Gly 


Ser 


Asp 




370 










375 










380 










Ala 


Pro 


He 


Cys 


He 


Arg 


Asp 


Ser 


He 


Arg 


Asn 


His 


Phe 


Asn 


Asn 


His 


385 










390 










395 










400 


Phe 


He 


Ala 


Ser 


Gly 


Lys 


Gly 


Asn 


Glu 


He 


Ser 


Phe 


He 


Asp 


Pro 


Pro 










405 










410 










415 




Asn 


Glu 


Arg 


Leu 


Leu 


Met 


Glu 


Met 


Val 


Arg 


Glu 


Val 


Thr 


Gly 


Ser 


Asp 








420 










425 










430 






He 


Lys 


He 


Phe 


Met 


Asp 


Asn 


Gly 


Lys 


Val 


Tyr 


Gin 


Asp 


Gly 


Val 


Glu 






435 










440 










445 








He 


Lys 


Val 


He 


Asp 


Pro 


Ser 


Ser 


Lys 


Glu 


Gly 


Lys 


Asp 


He 


He 


Lys 




450 










455 










460 










Lys 


Glu 


Glu 


Thr 


Leu 


Pro 


Glu 


Glu 


Glu 


Arg 


Lys 


Arg 


Leu 


Arg 


Arg 


Glu 


465 










470 










475 










480 


Arg 


Arg 


Met 


He 


Phe 


Asn 


Thr 


Val 


Lys 


Ala 


He 


Glu 


Thr 


Tyr 


Asn 


Glu 










485 










490 










495 




Glu 


Arg 


Gly 


Glu 


Glu 


Glu 


Glu 


Val 


Ala 


Thr 


Ser 


Ser 


Gly 


Gly 


Thr 


Lys 








500 










505 










510 






Arg 


Lys 


Arg 


Glu 


Glu 


Lys 


Glu 


Gly 


Asp 


Tyr 


Val 


Leu 


Asn 


Lys 


Ala 


Cys 






515 










520 










525 








Lys 


Glu 


He 


Lys 


Val 


Cys 























530 
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<210> 184 
<211> 1386 
<212> DNA 
<213> SHRIMP 



<400> 184 

atggactcat 

cttccgccca 

atggatatcg 

cccgacggct 

agtgtaccta 

aaccatagtg 

ttcttaagat 

gagagaacga 

cgtggaactg 

agaaaccaac 

gaaggtttta 

aacgaaactg 

aatatagatg 

atggactctt 

gattcatctg 

ataagacgtc 

ggaagaaatt 

aaaagagagg 

caagtgccca 

caccaccacc 

tccaacaatt 

cgtgatcgca 

tcaaggtata 

tattag 



ctgcatctgt 
gacgtgccac 
aacctatggg 
gttatggaca 
ctggaacgat 
cagaaaaaag 
attgtga tgt 
ttattgactc 
gtggatttgg 
aagaagagaa 
ttgacattcc 
gtgtgacata 
tcgtcagaga 
ctgactcatc 
actctgaatc 
atcagtatca 
ctgagaaaat 
acgaacccag 
aggactatta 
gccaccacca 
cttctgacgt 
actctcgatt 
caactttcga 



cgtgtttatg 
gcccggttct 
actggccaag 
aattgtgtca 
tgatgtggat 
tgtgccaatc 
cgaggaagaa 
tagttctaaa 
atctacagat 
caaaaaggaa 
ttttcttgaa 
ccctaatacg 
attggaagct 
cgattcttct 
atctgatgat 
ccggcgccag 
ggagatggac 
atacgaagaa 
ttgtgagcaa 
acaccgccac 
tactgcttac 
ctcagaaaga 
cccttataga 



agattcgccc 
gtcgcctacg 
atctctactg 
cgttctggga 
tataggggag 
agaaagggaa 
cagattgtgt 
aaggacaaca 
aacccaaatt 
gaattggaag 
ggtttcgaaa 
aatcaagatg 
gaatttagta 
tcttcttcct 
tcagaaggag 
ttgagttatt 
agagtaactc 
agagaaagat 
tacgaacact 
caacaccaga 
gtcaatgaaa 
cccaacaatg 
tatggcgcaa 



ctcccgggga 
acctatttcc 
gatatggaat 
tgacatggaa 
aattgaaagt 
ccagcattgc 
atattaatga 
aaaatcaagc 
ttactgaaac 
aaggggagat 
atatcctcgc 
tggaagaaaa 
gtggaattgg 
ctgactcatc 
gggataataa 
cggatgacgt 
acataaaaac 
atattcatcc 
acgacgcccc 
ggcactttaa 
attcccccac 
gcggttataa 
gaagagggcg 



ggaaactgca 
ctctgaagaa 
agacaagttt 
gaacaacact 
ga ttctgcgc 
ccagttgatt 
aaccacggga 
aagaagcgtg 
caccgtctca 
cgtagaaatg 
agaacaaagc 
agatactaaa 
gagtggctcc 
cgattcgtct 
ggtccgaaga 
caatggaggg 
tgaacacata 
aagaagaatg 
tgctgctgct 
ccaaccccgc 
gaggccatgc 
ccggatcaac 
tggaggagta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1386 



<210> 185 
<211> 457 
<212> PRT 
<213> SHRIMP 



<400> 185 
Met Asp Ser Ser 
1 

Glu Glu Thr Ala 

20 

Tyr Asp Leu Phe 

35 

Ala Lys lie Ser 
50 

Tyr Gly Gin He 
65 

Ser Val Pro Thr 

Val He Leu Arg 
100 

Gly Thr Ser He 
115 

Glu Glu Gin He 
130 

He Asp Ser Ser 
145 

Arg Gly Thr Gly 



Ala Ser Val Val 
5 

Leu Pro Pro Arg 

Pro Ser Glu Glu 
40 

Thr Gly Tyr Gly 

55 

Val Ser Arg Ser 
70 

Gly Thr He Asp 
85 

Asn His Ser Ala 

Ala Gin Leu He 
120 

Val Tyr He Asn 
135 

Ser Lys Lys Asp 
150 

Gly Phe Gly Ser 
165 



Phe Met Arg Phe 
10 

Arg Ala Thr Pro 
25 

Met Asp He Glu 

He Asp Lys Phe 
60 

Gly Met Thr Trp 
75 

Val Asp Tyr Arg 
90 

Glu Lys Ser Val 
105 

Phe Leu Arg Tyr 

Glu Thr Thr Gly 
140 

Asn Lys Asn Gin 
155 

Thr Asp Asn Pro 
170 



Ala Pro Pro Gly 
15 

Gly Ser Val Ala 
30 

Pro Met Gly Leu 
45 

Pro Asp Gly Cys 

Lys Asn Asn Thr 
80 

Gly Glu Leu Lys 
95 

Pro He Arg Lys 
110 

Cys Asp Val Glu 
125 

Glu Arg Thr"^ He 

Ala Arg Ser Val 
160 

Asn Phe Thr Glu 
175 
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Thr 


Thr 


Val 


Ser 


Arg 


Asn 


Gin 


Gin 


Glu 


Glu 


Asn 


Lys 


Lys 


Glu 


Glu 


Leu 








180 










185 










190 






Glu 


Glu 


Gly 


Glu 


He 


Val 


Glu 


Met 


Glu 


Gly 


Phe 


He 


Asp 


He 


Pro 


Phe 






195 










200 










205 








Leu 


Glu 


Gly 


Phe 


Glu 


Asn 


He 


Leu 


Ala 


Glu 


Gin 


Ser 


Asn 


Glu 


Thr 


Gly 




210 










215 










220 










Val 


Thr 


Tyr 


Pro 


Asn 


Thr 


Asn 


Gin 


Asp 


Val 


Glu 


Glu 


Lys 


Asp 


Thr 


Lys 


225 










230 










235 










240 


Asn 


lie 


Asp 


Val 


Val 


Arg 


Glu 


Leu 


Glu 


Ala 


Glu 


Phe 


Ser 


Ser 


Gly 


He 










245 










250 










255 




Gly 


Ser 


Gly 


Ser 


Met 


Asp 


Ser 


Ser 


Asp 


Ser 


Ser 


Asp 


Ser 


Ser 


Ser 


Ser 








260 










265 










270 






Ser 


Ser 


Asp 


Ser 


Ser 


Asp 


Ser 


Ser 


Asp 


Ser 


Ser 


Asp 


Ser 


Glu 


Ser 


Ser 






275 










280 










285 








Asp 


Asp 


Ser 


Glu 


Gly 


Gly 


Asp 


Asn 


Lys 


Val 


Arg 


Arg 


He 


Arg 


Arg 


His 




290 










295 










300 










Gin 


Tyr 


His 


Arg 


Arg 


Gin 


Leu 


Ser 


Tyr 


Ser 


Asp 


Asp 


Val 


Asn 


Gly 


Gly 


305 










310 










315 










320 


Gly 


Arg 


Asn 


Ser 


Glu 


Lys 


Met 


Glu 


Met 


Asp 


Arg 


Val 


Thr 


His 


He 


Lys 










325 










330 










335 




Thr 


Glu 


His 


lie 


Lys 


Arg 


Glu 


Asp 


Glu 


Pro 


Arg 


Tyr 


Glu 


Glu 


Arg 


Glu 








340 










345 










350 






Arg 


Tyr 


lie 


His 


Pro 


Arg 


Arg 


Met 


Gin 


Val 


Pro 


Lys 


Asp 


Tyr 


Tyr 


Cys 






355 










360 










365 








Glu 


Gin 


Tyr 


Glu 


His 


Tyr 


Asp 


Ala 


Pro 


Ala 


Ala 


Ala 


His 


His 


His 


Arg 




370 










375 










380 










His 


His 


Gin 


His 


Arg 


His 


Gin 


His 


Gin 


Arg 


His 


Phe 


Asn 


Gin 


Pro 


Arg 


385 










390 










395 










400 


Ser 


Asn 


Asn 


Ser 


Ser 


Asp 


Val 


Thr 


Ala 


Tyr 


Val 


Asn 


Glu 


Asn 


Ser 


Pro 










405 










410 










415 




Trp 


Cys 


Arg 


Asp 


Arg 


Asn 


Ser 


Arg 


Phe 


Ser 


Pro 


Asn 


Asn 


Gly 


Gly 


Tyr 








420 










425 










430 






Asn 


Arg 


lie 


Asn 


Ser 


Arg 


Tyr 


Thr 


Thr 


Phe 


Asp 


Pro 


Tyr 


Arg 


Tyr 


Gly 






435 










440 










445 








Ala 


Arg 


Arg 


Gly 


Arg 


Gly 


Gly 


Val 


Tyr 

















450 455 



<210> 186 
<211> 1014 
<212> DNA 
<213> SHRIMP 



<400> 186 

atgtcctctt 

tgcacatcct 

gaacttggaa 

ttaaacaaaa 

ttctttcttc 

aaactggcca 

tccagaatga 

cgcaatatcc 

agagaacaga 

gaaacccctt 

ttaaagtatg 

atgcctactg 

aatcctagta 

gataaat tat 

caacattgta 

agctctgatt 

tatttggtac 



ctcaaggttt 
cttccttgtc 
tacctcttgt 
gggacacaat 
gttttggaaa 
atgctatcgc 
tgacgactgc 
aaaaatttga 
taggtttaag 
tctttgcacc 
attctgagtc 
aggatgtaaa 
tgaggttagt 
ccaaggaaca 
tcattggagg 
gtggatttaa 
ttttattttt 



gaataataat 
tttagaggag 
tgaaggaaag 
acgtgtaatt 
gaagaagact 
cctatccatg 
aaagaagatc 
agatgtacat 
tgtattaggt 
ctttaataaa 
tgtttctatt 
gacatcaaga 
gtttcgtccc 
agactcggct 
gagaaatgta 
aaagatgtta 
tcatataatc 



atgtgcacca 
agtgtggatt 
gaagtactac 
ggtgacgagc 
tttaatccac 
ggtgttgtcc 
cctgttccag 
tatatagata 
aaaaatgatg 
tctaaaatag 
ataagagatg 
agttgtataa 
atgtactgga 
ttgattgaaa 
ttattgtatt 
actaatgatg 
tgtcctatta 



cagaaatctt 
atttagaaaa 
tggaatttgc 
aaggagacgt 
aaacaaaaat 
cagaacctgc 
aatcatacaa 
tcaattttca 
tccaaaagaa 
gaggtgaatg 
tgtttaattt 
acccttccca 
gaaattctaa 
agtatatggg 
gtataactgc 
aaataaaaca 
tacaatccaa 



gctgcccaaa 
ggattttgaa 
ctacaaaata 
atgtagcgtc 
gtggctagtg 
ctgcacgtgt 
aaatgttaat 
gtcctttgta 
gaagaaggaa 
catagaagat 
attgggtgaa 
caatgatacg 
gctggtcatg 
aggagaacat 
actatgtttt 
attgatatgg 
atga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1014 
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<210> 187 
<211> 335 
<212> PRT 
<213> SHRIMP 



<400> 187 



Met 


Ser 


Ser 


Ser 


1 








Pro 


Lys 


Cys 


Thr 








20 


Leu 


Glu 


Lys 


Asp 






35 




Glu 


Val 


Leu 


Leu 




50 






lie 


Arg 


Val 


He 


65 








Leu 


Arg 


Phe 


Gly 


Leu 


Val 


Lys 


Leu 








100 


Glu 


Pro 


Ala 


Cys 






115 




Pro 


Val 


Pro 


Glu 




130 






Glu 


Asp 


Val 


His 


145 








Gin 


He 


Gly 


Leu 


Lys 


Glu 


Glu 


Thr 








ISO 


Gly 


Glu 


Cys 


He 






195 




He 


Arg 


Asp 


Val 




210 






Lys 


Thr 


Ser 


Arg 


225 








Ser 


Met 


Arg 


Leu 


Val 


Met 


Asp 


Lys 








260 


Tyr 


Met 


Gly 


Gly 






275 




Leu 


Leu 


Tyr 


Cys 




290 






Lys 


Lys 


Met 


Leu 


305 








Val 


Leu 


Leu 


Phe 



Gin Gin Asn Asn 
5 

Ser Ser Ser Leu 

Phe Glu Glu Leu 
40 

Glu Phe Ala Tyr 
55 

Gly Asp Glu Gin 
70 

Lys Lys Lys Thr 
85 

Ala Asn Ala He 

Thr Cys Ser Arg 
120 

Ser Tyr Lys Asn 
135 

Tyr He Asp He 
150 

Ser Val Leu Gly 
165 

Pro Phe Phe Ala 

Glu Asp Leu Lys 
200 

Phe Asn Leu Leu 
215. 

Ser Cys He Asn 
230 

Val Phe Arg Pro 
245 

Leu Ser Lys Glu 

Glu His Gin His 
280 

He Thr Ala Leu 
295 

Thr Asn Asp Glu 
310 

Phe His He He 
325 



Met Cys Thr Thr 
10 

Ser Leu Glu Glu 
25 

Gly He Pro Leu 

Lys He Leu Asn 
60 

Gly Asp Val Cys 

75 

Phe Asn Pro Gin 
90 

Ala Leu Ser Met 
105 

Met Met Thr Thr 

Val Asn Arg Asn 
140 

Asn Phe Gin Ser 
155 

Lys Asn Asp Val 
170 

Pro Phe Asn Lys 
185 

Tyr Asp Ser Glu 

Gly Glu Met Pro 
220 

Pro Ser His Asn 
235 

Met Tyr Trp Arg 
250 

Gin Asp Ser Ala 

265 

Cys He He Gly 

Cys Phe Ser Ser 
300 

He Lys Gin Leu 
315 

Cys Pro He He 
330 



Glu 


He 


Leu 


Leu 






15 




Ser 


Val 


Asp 


Tyr 




30 






Val 


Glu 


Gly 


Lys 


45 








Lys 


Arg 


Asp 


Thr 


Ser 


Val 


Phe 


Phe 








80 


Thr 


Lys 


Met 


Trp 






95 




Gly 


Val 


Val 


Pro 




110 






Ala 


Lys 


Lys 


He 


125 








He 


Gin 


Lvs 
J 


Phe 


Phe 


Val 


Arg 


Glu 








160 


Gin 


Lys 


Lys 


Lys 






175 




Ser 


Lys 


He 


Gly 




190 






Ser 


Val 


Ser 


He 


205 








Thr 


Glu 


Asp 


Val 


Asp 


Thr 


Asn 


Pro 








240 


Asn 


Ser 


Lys 


Leu 






255 




Leu 


He 


Glu 


Lys 




270 






Gly 


Arg 


Asn 


Val 


285 








Asp 


Cys 


Gly 


Phe 


He 


Trp 


Tyr 


Leu 








320 


Gin 


Ser 


Lys 








335 





<210> 188 
<211> 3627 
<212> DNA 
<213> SHRIMP 

<400> 188 

atgggtgtcc cagaagccaa aaaagtgtac 
gtcatcaagg agaaaactgg atatgaagat 
tgttcaggag aagaagattg cacaactagc 
atcaactcca agaacttcct ggattttgga 



gaaaatgcgt acggtgcaca gaatgggaga 60 
tgttacgatg acgaagacga cgaggattat 120 
tcacttctca aagccacttc tcttgccaac 180 
agaggcaaga aatcttcctc ttcttcacct 240 
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acctgtgact 
atgttgggca 
atcctgttta 
gttatcaggg 
aatttttact 
cacgagaaaa 
gaagttgatt 
aatcacaaaa 
atgaagaaga 
ttcaattctt 
cagatggagg 
ctcaagtcta 
gccatcaaga 
gcaaaatcaa 
tgcgagcgga 
atgaggatcc 
ttcaccgaag 
cctgggaaac 
atgtctggta 
cacaccaacg 
ctggtgaaat 
caatttaagg 
aacagtaatt 
agaacaaagc 
agtacactat 
tgtgagtacg 
tgtgtgggcg 
tgtgatgata 
tgtgataaga 
atagattttg 
gaaactgacg 
aatgtcatgt 
accttatcag 
gggtcggatg 
agtggtaatg 
agtgcacctg 
gagagccaca 
gatgtttttg 
ggaggaggaa 
actgaacaaa 
gatgatgtag 
gatctagaaa 
gactttttag 
aatatagaag 
cccgtgatgg 
cttgacaata 
gacgccaata 
ggcatcgacg 
gttatcgtgt 
atgcccataa 
agaactaatt 
aaaaagactc 
ttggacgatt 
actctcaata 
caattcatgc 
tcttcattct 
cccaa ttctc 



acacactcga 
ggcaaattgc 
tcaatactac 
aagaagatac 
gcagggatga 
ttaaggctag 
ggctaactga 
tggacgacta 
ggaagttttt 
ccccgacagc 
gggtcatctt 
aaaaccttaa 
tgatgacgag 
ggaatgaaaa 
tgatgatgcg 
ctgttctaaa 
cagaggatga 
acttctccct 
acagaaaaga 
atgaaaatca 
ggaacctgat 
actctgaaac 
cttcgggtga 
caaatacgat 
tgattatgct 
acacatgggt 
gatgggagat 
gtgttggtaa 
aactagtact 
tacgagaccc 
ttagtttgaa 
catatttgac 
atatcagtat 
tagaatctga 
accctcaaaa 
ttactaaaaa 
ctgaaggtgg 
gtaataattt 
tagtatcgtc 
ataacagcac 
aggaggagga 
ctgacgactg 
aagctgtaac 
aggatgaaca 
atgaagaaaa 
taacgtacaa 
agagagaaga 
atgaaagatt 
atatgaacat 
tacatagtga 
ttatgtctgc 
ccagaatcaa 
ttaaagacaa 
aagagggaaa 
cagtgctcgg 
acctccagca 
ctcgtcctga 



catggttga t 
aaccaccatg 
aaatcaacag 
tatgcattca 
aaagaaggtt 
aattgaaaga 
acctccaatc 
tgagggtctc 
gactaacagt 
agttttgaaa 
gtactacgct 
agggaggtat 
agccaaacga 
ttgcctcaat 
tcttcctcaa 
gactacgggg 
tatcgtacgt 
actagaaaag 
atccaagagg 
aacactttct 
ggttggacat 
gttcttaaag 
aaatcaagca 
atataatttg 
aactaaagga 
aaggaaagaa 
ttttttgagt 
tatttctatg 
tcctttctgg 
gcttcagttg 
acacgcactt 
aatggggtcc 
acatctttcc 
tactgaagat 
atctacccgt 
attatcatcg 
tataattaat 
agatacgaat 
aataccaggt 
gagtgacttt 
ggaaaaaatg 
gttagatgag 
tgcagccacg 
ttcaagatgt 
gtgggcagaa 
tgacaatagt 
cgggaacaat 
ctctggtgtt 
gtgcggtcta 
attttgtggt 
tgcactagaa 
gcaaataaaa 
ggtatcgaca 
aataccgagt 
cgtggccata 
tcagcagcgg 
acagtaa 



ttgcccactt 
ttgaagggac 
ataatagacg 
cggatgcaga 
atttctgaat 
gtgcctatta 
gaagacatga 
gattactgga 
ttcttgttca 
tccagattca 
tttaggatga 
actgtactct 
cagatcagac 
aggaagacca 
ggtctgatgg 
agcaaatgca 
ttaacatcat 
aggaagatat 
tgggaagatt 
atgaatcttt 
cataaaaacg 
attggccatg 
aacgagttgg 
gcagaatcgc 
tcagactaca 
gttgcagtat 
gaacaagaag 
gggaatttgt 
acaatcaaat 
tgttttcctc 
catagagccc 
tttaatcaac 
ggtgctaata 
cttatcccat 
aagaaaagta 
tcagttttcg 
gataggggaa 
ccagaagcat 
ctatctactg 
ttagat t tct 
gaggaaggtg 
gcaaggaaag 
aacgaaatga 
tctgtatcat 
attgttaacg 
ttattatcac 
agtaacgtgg 
ccttatagtg 
gaggataata 
aaaacagaag 
tacacaatgt 
cgcaagaact 
tgcacagaca 
gaaaacacta 
tctaagccat 
caagacgtat 



ataatgtatc 
agaaaaatat 
ttttacatga 
acaagaaaca 
tcttttctag 
taatcccttc 
tgatggctcc 
tcaacaaaca 
ggaacgtgcc 
aagatgcatt 
tccgagtgat 
ttacggatgg 
aagaaagaag 
acga tctact 
catctgcatt 
tgtacctttc 
gccttttgaa 
tggagtacga 
tattgaatgt 
tctcccacga 
tgtgtcgtct 
tcaagttctt 
gtggttttgc 
cgctcatgct 
a tagtgcaat 
ttgaaaacac 
ctaggaagaa 
ctaaaagtaa 
ttttctatct 
caacacactt 
tggctgtaaa 
gtgtttttgg 
a taatgagag 
tt tctaataa 
aggttaatgc 
aatcaattcg 
tattgactaa 
tgggagaaga 
agcagacgtc 
ttaaaaaatt 
aaaaagagga 
cgttcgagta 
cgtcatcgct 
caaaacttaa 
aattcgacaa 
gtctaagtgg 
tgctctatga 
taaaaaccat 
ctattgtgta 
aggataaaat 
tgcagtatat 
gggagaggat 
actataa taa 
aatggctacc 
ggtcgcccct 
cactaactaa 



agatctggtc 
gggtcagatg 
cggttttaat 
tatttatgaa 
aaaatataaa 
atcccaagag 
tccagtttct 
cacagatgtt 
tactacttca 
ttttgctagc 
gaaaaatctc 
aaaggctcca 
taaggaaaag 
attctactcg 
actggacata 
taatgcatca 
tctggagact 
ttcttataat 
tttgaagcaa 
ttctgatgtg 
tactggtaca 
taggtgcatg 
agctaaaaga 
ttcacctgaa 
tgttagtaac 
gtactgtact 
taataaagac 
ttgtagaaaa 
gtctcaagct 
gattgatttg 
tgcagcggca 
tacaataaca 
taaaaataca 
taaacgtaaa 
aaccagaaaa 
tggatttttt 
agaaaggatt 
aaatggagga 
aatcttaaaa 
caatgaaatg 
agaggaggca 
taaagattct 
cgctaaaaat 
caataaacaa 
atgcatttct 
agttttaatg 
accagttcaa 
gaacctttta 
ccagcaatta 
atgcacagat 
gcctgaacta 
tccaaaagta 
acttttggca 
ctctcagggg 
cacgctgtgg 
cataactcct 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3627 



<210> 189 
.<211> 1204 
<212> PRT 
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<213> SHRIMP 
<400> 189 



Met 


Gly 


Val 


Pro 


Glu 


Ala 


Lys 


Lys 


Val 


Tyr 


Glu 


Asn 


Ala 


Tyr 


Gly 


Ala 


1 








5 










10 










15 




Gin 


Asn 


Gly 


Arg 


Val 


He 


Lys 


Glu 


Lys 


Thr 


Gly 


Tyr 


Glu 


Asp 


Cys 


Tyr 








20 










25 










30 






Asp 


Asp 


Glu 


Asp 


Asp 


Glu 


Asp 


Tyr 


Cys 


Ser 


Gly 


Glu 


Glu 


Asp 


Cys 


Thr 






35 










40 










45 








Thr 


Ser 


Ser 


Leu 


Leu 


Lys 


Ala 


Thr 


Ser 


Leu 


Ala 


Asn 


He 


Asn 


Ser 


Lys 




50 










55 










60 










Asn 


Phe 


Leu 


Asp 


Phe 


Gly Arg 


Gly 


Lys 


Lys 


Ser 


Ser 


Ser 


Ser 


Ser 


Pro 


65 










70 










75 










80 


Thr 


Cys 


Asp 


Tyr 


Thr 


Leu 


Asp 


Met 


Val 


Asp 


Leu 


Pro 


Thr 


Tyr 


Asn 


Val 










85 










90 










95 




Ser 


Asp 


Leu 


Val 


Met 


Leu 


Gly 


Arg 


Gin 


He 


Ala 


Thr 


Thr 


Met 


Leu 


Lys 








100 










105 










110 






Gly 


Gin 


Lys 


Asn 


Met 


Gly Gin 


Met 


He 


Leu 


Phe 


He 


Asn 


Thr 


Thr 


Asn 






115 










120 










125 








Gin 


Gin 


He 


He 


Asp 


Val 


Leu 


His 


Asp 


Gly 


Phe 


Asn 


Val 


He 


Arg 


Glu 




130 










135 










140 










Glu 


Asp 


Thr 


Met 


His 


Ser 


Arg 


Met 


Gin 


Asn 


Lys 


Lys 


His 


He 


Tyr 


Glu 


145 










150 










155 










160 


Asn 


Phe 


Tyr 


Cys 


Arg 


Asp 


Glu 


Lys 


Lys 


Val 


He 


Ser 


Glu 


Phe 


Phe 


Ser 










165 










170 










175 




Arg 


Lys 


Tyr 


Lys 


His 


Glu 


Lys 


He 


Lys 


Ala 


Arg 


He 


Glu 


Arg 


Val 


Pro 








180 










185 










190 






lie 


He 


He 


Pro 


Ser 


Ser 


Gin 


Glu 


Glu 


Val 


Asp 


Trp 


Leu 


Thr 


Glu 


Pro 






195 










200 










205 








Pro 


He 


Glu 


Asp 


Met 


Met 


Met 


Ala 


Pro 


Pro 


Val 


Ser 


Asn 


His 


Lys 


Met 




210 










215 










220 










Asp 


Asp 


Tyr 


Glu 


Gly 


Leu 


Asp 


Tyr 


Trp 


He 


Asn 


Lys 


His 


Thr 


Asp 


Val 


225 










230 










235 










240 


Met 


Lys 


Lys 


Arg 


Lys 


Phe 


Leu 


Thr 


Asn 


Ser 


Phe 


Leu 


Phe 


Arg 


Asn 


Val 










245 










250 










255 




Pro 


Thr 


Thr 


Ser 


Phe 


Asn 


Ser 


Ser 


Pro 


Thr 


Ala 


Val 


Leu 


Lys 


Ser 


Arg 








260 










265 










270 






Phe 


Lys 


Asp 


Ala 


Phe 


Phe 


Ala 


Ser 


Gin 


Met 


Glu 


Gly 


Val 


He 


Leu 


Tyr 






275 










280 










285 








Tyr 


Ala 


Phe 


Arg 


Met 


He 


Arg 


Val 


Met 


Lys 


Asn 


Leu 


Leu 


Lys 


Ser 


Lys 




290 










295 










300 










Asn 


Leu 


Lys 


Gly 


Arg 


Tyr 


Thr 


Val 


Leu 


Phe 


Thr 


Asp 


Gly 


Lys 


Ala 


Pro 


305 










310 










315 










320 


Ala 


He 


Lys 


Met 


Met 


Thr 


Arg 


Ala 


Lys 


Arg 


Gin 


He 


Arg 


Gin 


Glu 


Arg 










325 










330 










335 




Ser 


Lys 


Glu 


Lys 


Ala 


Lys 


Ser 


Arg 


Asn 


Glu 


Asn 


Cys 


Leu 


Asn 


Arg 


Lys 








340 










345 










350 






Thr 


Asn 


Asp 


Leu 


Leu 


Phe 


Tyr 


Ser 


Cys 


Glu 


Arg 


Met 


Met 


Met 


Arg 


Leu 






355 










360 










365 








Pro 


Gin 


Gly 


Leu 


Met 


Ala 


Ser 


Ala 


Leu 


Leu 


Asp 


He 


Met 


Arg 


He 


Pro 




370 










375 










380 










Val 


Leu 


Lys 


Thr 


Thr 


Gly 


Ser 


Lys 


Cys 


Met 


Tyr 


Leu 


Ser 


Asn 


Ala 


Ser 


385 










390 










395 










400 


Phe 


Thr 


Glu 


Ala 


Glu 


Asp 


Asp 


He 


Val 


Arg 


Leu 


Thr 


Ser 


Cys 


Leu 


Leu 










405 










410 










415 




Asn 


Leu 


Glu 


Thr 


Pro 


Gly 


Lys 


His 


Phe 


Ser 


Leu 


Leu 


Glu 


Lys 


Arg 


Lys 








420 










425 










430 






lie 


Tyr 


Asp 


Ser 


Tyr 


Asn 


Met 


Ser 


Gly 


Asn 


Arg 


Lys 


Glu 


Ser 


Lys 


Arg 






435 










440 










445 








Trp 


Glu 


Asp 


Leu 


Leu 


Asn 


Val 


Leu 


Lys 


Gin 


His 


Thr 


Asn 


Asp 


Glu 


Asn 



450 455 460 
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Gin 


Thr 


Leu 


Ser 


Met 


Asn 


Leu 


Phe 


Ser 


His 


Asp 


Ser 


Asp 


Val 


Leu 


Val 


465 










470 










475 










480 


Lys 


Trp 


Asn 


Leu 


Met 


Val 


Gly 


His 


His 


Lys 


Asn 


Val 


Cys 


Arg 


Leu 


Thr 










485 










490 










495 




Gly 


Thr 


Gin 


Phe 


Lys 


Asp 


Ser 


Glu 


Thr 


Phe 


Leu 


Lys 


He 


Gly 


His 


Val 








500 










505 










510 






Lys 


Phe 


Phe 


Arg 


Cys 


Met 


Asn 


Ser 


Asn 


Ser 


Ser 


Gly 


Glu 


Asn 


Gin 


Ala 






515 










520 










525 








Asn 


Glu 


Leu 


Gly 


Gly 


Phe 


Ala 


Ala 


Lys 


Arg 


Arg 


Thr 


Lys 


Pro 


Asn 


Thr 




530 










535 










540 










He 


Tyr 


Asn 


Leu 


Ala 


Glu 


Ser 


Pro 


Leu 


Met 


Leu 


Ser 


Pro 


Glu 


Ser 


Thr 


545 








550 










555 










560 


Leu 


Leu 


He 


Met 


Leu 


Thr 


Lys 


Gly 


Ser 


Asp 


Tyr 


Asn 


Ser 


Ala 


He 


Val 










565 










570 










575 




Ser 


Asn 


Cys 


Glu 


Tyr 


Asp 


Thr 


Trp 


Val 


Arg 


Lys 


Glu 


Val 


Ala 


Val 


Phe 








580 










585 










590 






Glu 


Asn 


Thr 


Tyr 


Cys 


Thr 


Cys 


Val 


Gly 


Gly 


Trp 


Glu 


He 


Phe 


Leu 


Ser 






595 










600 










605 








Glu 


Gin 


Glu 


Ala 


Arg 


Lys 


Asn 


Asn 


Lys 


Asp 


Cys 


Asp 


Asp 


Ser 


Val 


Gly 




610 










615 










620 










Asn 


He 


Ser 


Met 


Gly 


Asn 


Leu 


Ser 


Lys 


Ser 


Asn 


Cys 


Arg 


Lys 


Cys 


Asp 


625 










630 










635 










640 


Lys 


Lys 


Leu 


Val 


Leu 


Pro 


Phe 


Trp 


Thr 


He 


Lys 


Phe 


Phe 


Tyr 


Leu 


Ser 










645 










650 










655 




Gin 


Ala 


He 


Asp 


Phe 


Val 


Arg 


Asp 


Pro 


Leu 


Gin 


Leu 


Cys 


Phe 


Pro 


Pro 








660 










665 










67 0 






Thr 


His 


Leu 


He 


Asp 


Leu 


Glu 


Thr 


Asp 


Val 


Ser 


Leu 


Lys 


His 


Ala 


Leu 






675 










680 










685 








His 


Arg 


Ala 


Val 


Asn 


Ala 


Ala 


Ala 


Asn 


Val 


Met 


Ser 


Tyr 


Leu 


Thr 


Met 




690 










695 










700 










Gly 


Ser 


Phe 


Asn 


Gin 


Arg 


Val 


Phe 


Gly 


Thr 


He 


Thr 


Thr 


Leu 


Ser 


Asp 


705 










710 










715 










720 


He 


Ser 


He 


His 


Leu 


Ser 


Gly 


Ala 


Asn 


Asn 


Asn 


Glu 


Ser 


Lys 


Asn 


Thr 










725 










730 










735 




Gly 


Ser 


Asp 


Val 


Glu 


Ser 


Asp 


Thr 


Glu 


Asp 


Leu 


He 


Pro 


Phe 


Ser 


Asn 








740 










745 










750 






Asn 


Lys 


Arg 


Lys 


Ser 


Gly 


Asn 


Asp 


Pro 


Gin 


Lys 


Ser 


Thr 


Arg 


Lys 


Lys 






755 










760 










765 








Ser 


Lys 


Val 


Asn 


Ala 


Thr 


Arg 


Lys 


Ser 


Ala 


Pro 


Val 


Thr 


Lys 


Lys 


Leu 




770 










775 










780 










Ser 


Ser 


Ser 


Val 


Phe 


Glu 


Ser 


He 


Arg 


Gly 


Phe 


Phe 


Glu 


Ser 


His 


Thr 


785 










790 










795 










800 


Glu 


Gly 


Gly 


He 


He 


Asn 


Asp 


Arg 


Gly 


He 


Leu 


Thr 


Lys 


Glu 


Arg 


He 










805 










810 










815 




Asp 


Val 


Phe 


Gly 


Asn 


Asn 


Leu 


Asp 


Thr 


Asn 


Pro 


Glu 


Ala 


Leu 


Gly 


Glu 








820 










825 










830 






Glu 


Asn 


Gly 


Gly 


Gly 


Gly 


Gly 


He 


Val 


Ser 


Ser 


He 


Pro 


Gly 


Leu 


Ser 






835 










840 










845 








Thr 


Glu 


Gin 


Thr 


Ser 


He 


Leu 


Lys 


Thr 


Glu 


Gin 


Asn 


Asn 


Ser 


Thr 


Ser 




850 










855 










860 










Asp 


Phe 


Leu 


Asp 


Phe 


Phe 


Lys 


Lys 


Phe 


Asn 


Glu 


Met 


Asp 


Asp 


Val 


Glu 


865 










870 










875 










880 


Glu 


Glu 


Glu 


Glu 


Lys 


Met 


Glu 


Glu 


Gly 


Glu 


Lys 


Glu 


Glu 


Glu 


Glu 


Ala 










885 










890 










895 




Asp 


Leu 


Glu 


Thr 


Asp 


Asp 


Trp 


Leu 


Asp 


Glu 


Ala 


Arg 


Lys 


Ala 


Phe 


Glu 








900 










905 










910 






Tyr 


Lys 


Asp 


Ser 


Asp 


Phe 


Leu 


Glu 


Ala 


Val 


Thr 


Ala 


Ala 


Thr 


Asn 


Glu 






915 










920 










925 








Met 


Thr 


Ser 


Ser 


Leu 


Ala 


Lys 


Asn 


Asn 


He 


Glu 


Glu 


Asp 


Glu 


His 


Ser 




930 










935 










940 










Arg 


Gys 


Ser 


Val 


Ser 


Ser 


Lys 


Leu 


Asn 


Asn 


Lys 


Gin 


Pro 


Val 


Met 


Asp 
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945 950 955 960 

Glu Glu Lys Trp Ala Glu He Val Asn Giu Phe Asp Lys Cys He Ser 

965 970 975 

Leu Asp Asn He Thr Tyr Asn Asp Asn Ser Leu Leu Ser Arg Leu Ser 

980 985 990 

Gly Val Leu Met Asp Ala Asn Lys Arg Glu Asp Gly Asn Asn Ser Asn 

995 1000 1005 

Val Val Leu Tyr Glu Pro Val Gin Gly He Asp Asp Glu Arg Phe Ser 

1010 1015 1020 

Gly Val Pro Tyr Ser Val Lys Thr Met Asn Leu Leu Val He Val Tyr 
1025 1030 1035 1040 

Met Asn Met Cys Gly Leu Glu Asp Asn Thr He Val Tyr Gin Gin Leu 

1045 1050 1055 

Met Pro He He His Ser Glu Phe Cys Gly Lys Thr Glu Glu Asp Lys 

1060 1065 1070 

He Cys Thr Asp Arg Thr Asn Phe Met Ser Ala Ala Leu Glu Tyr Thr 

1075 1080 1085 

Met Leu Gin Tyr Met Pro Glu Leu Lys Lys Thr Pro Arg He Lys Gin 

1090 1095 1100 

He Lys Arg Lys Asn Trp Glu Arg He Pro Lys Val Leu Asp Asp Phe 
1105 1110 1115 1120 

Lys Asp Lys Val Ser Thr Cys Thr Asp Asn Tyr Asn Lys Leu Leu Ala 

1125 1130 1135 

Thr Leu Asn Lys Glu Gly Lys He Pro Ser Glu Asn Thr Lys Trp Leu 

1140 1145 H50 

Pro Ser Gin Gly Gin Phe Met Pro Val Leu Gly Val Ala He Ser Lys 

1155 1160 1165 

Pro Trp Ser Pro Leu Thr Leu Trp Ser Ser Phe Tyr Leu Gin His Gin 

1170 1175 1180 

Gin Arg Gin Asp Val Ser Leu Thr Asn He Thr Pro Pro Asn Ser Pro 
1185 1190 1195 1200 

Arg Pro Glu Gin 



<210> 190 
<211> 414 
<212> DNA 
<213> SHRIMP 



<400> 190 

atggatggag 

gagacactta 

aattctattg 

ttccctactg 

attaagaaga 

ttgctgcaga 

ggtaaaaaca 



attcttcctc 
aggaacaagg 
agactgtgaa 
aatctataac 
ggaaactcgc 
ttccaagcac 
ccagactacg 



<210> 191 
<2H> 137 
<212> PRT 
<213> SHRIMP 



cttacagtta 
tgtgtgggaa 

actctacaaa 
ttctggcaag 
tacaaccttt 
cagagtgtgt 



ttgagtgaat 
ttggccctag 
gactacactg 
caacaacagg 
tctccgagaa 
aaagcagctc 
aaaaattacg 



ccgaatttga 
aagtgttcaa 
ttcttcgatc 
aagaggaaga 
gtctctgtag 
cacgaagttc 
gcgcccatca 



ttacgtggtc 60 
cgaggtgtcc 120 
tagaaactac 180 
aagtacccct 240 
agagctgcgt 300 
ttctaggagg 360 
gtga 414 



<400> 191 

Met Asp Gly Asp Ser Ser Ser Leu Gin Leu Leu Ser Glu Ser Glu Phe 

15 10 15 

Asp Tyr Val Val Glu Thr Leu Lys Glu Gin Gly Val Trp Glu Leu Ala 

20 25 30 

Leu Glu Val Phe Asn Glu Val Ser Asn Ser He Glu Thr Val Lys Glu 
35 40 45 
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Glu Glu Asp Tyr 
50 

Ser lie Thr Leu 
65 

lie Lys Lys Arg 

Arg Glu Leu Arg 
100 

Ala Pro Arg Ser 
115 

Val Cys Lys Asn 
130 



Thr Val Leu Arg 
55 

Tyr Lys Gin Gin 
70 

Lys .Leu Ala Ser 
85 

Leu Leu Gin He 

Ser Ser Arg Arg 
120 

Tyr Gly Ala His 
135 



Ser Arg Asn Tyr 
60 

Gin Glu Glu Glu 
75 

Gly Lys Ser Pro 
90 

Pro Ser Thr Thr 
105 

Gly Lys Asn Thr 
Gin 



Phe Pro Thr Glu 

Glu Ser Thr Pro 
80 

Arg Ser Leu Cys 
95 

Thr Phe Lys Ala 
110 

Arg Leu Arg Arg 
125 



<210> 192 
<211> 924 
<212> DNA 
<213> SHRIMP 



<400> 192 

atgtggtgtt 

cttttcagaa 

tgtcaagttc 

tggttccaga 

gtcgacccag 

ctatctcaaa 

agctcgaccg 

accagaaatg 

tttctggaga 

ttttactctt 

gggtggaccg 

atgtcagatt 

ttctggacca 

ccccctttta 

cacaaccacc 

tggagtagtc 



ctacacacct 
attttttcag 
cgagggtgga 
aacgtacatt 
aaatgagagg 
aacttttcaa 
aatgtcaagt 
taatagctcc 
ctggtcgacc 
cctccatctc 
ctagctcgac 
acaccagaaa 
ggtgtttctg 
aaaagggccc 
accaccacca 
ttctttgact 



ttcatatagt 
ggcactcgag 
cctctgggtc 
aacttttgtg 
gtataacctt 
aaattttttc 
tccgagggtg 
agaaacgtac 
cagaaatgag 
aaaaactttt 
cgaatgtcaa 
tgtaatagct 
gacgaagata 
acgtgtatat 
tggaagaaca 
ttag 



gagtttttta 
tttagggggt 
gggccaatgt 
tgtttctgga 
ctgt tt tcac 
agggcactcc 
gaccgctggg 
attaactttt 
agggta tacc 
caaaaatttt 
gttccgaggg 
ccagaaa tag 
t tgcttgatg 
aatgactgta 
tctatccttc 



ctcctcttca 
ggaccgctag 
cagattacac 
acaggcgttt 
tagagaattt 
agtttagggg 
tcgggccaat 
gtgtgtttct 
cttctgtttt 
ttagggcgct 
tggacctctg 
aggaagtatc 
gcaaccctct 
ccacacctca 
aacaaaccct 



atctcaaaaa 
ctcgaccgaa 
cagaaattgt 
ctggagactg 
tactcttcct 
gtggaccgct 
gtcagattac 
ggaacaggcg 
cactagagaa 
ccagtttagg 
ggtcgggcca 
ttatggccat 
ccccctccct 
ttcaaaccat 
ctccagaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

924 



<210> 193 
<211> 305 
<212> PRT 
<213> SHRIMP 



<400> 193 



Met 


Trp 


Cys 


Ser 


Thr 


His 


Leu 


Ser 


Tyr 


Ser 


Glu 


Phe 


Phe 


Thr 


Pro 


Leu 


1 








5 










10 










15 




Gin 


Ser 


Gin 


Lys 
20 


Leu 


Phe 


Arg 


Asn 


Phe 
25 


Phe 


Arg 


Ala 


Leu 


Glu 

30 


Phe 


Arg 


Gly 


Trp 


Thr 

35 


Ala 


Ser 


Ser 


Thr 


Glu 
40 


Cys 


Gin 


Val 


Pro 


Arg 
45 


Val 


Asp 


Leu 


Trp 


Val 
50 


Gly 


Pro 


Met 


Ser 


Asp 
55 


Tyr 


Thr 


Arg 


Asn 


Cys 
60 


Trp 


Phe 


Gin 


Lys 


Arg 


Thr 


Leu 


Thr 


Phe 


Val 


Cys 


Phe 


Trp 


Asn 


Arg 


Arg 


Phe 


Trp 


Arg 


Leu 


65 










70 










75 










80 


Val 


Asp 


Pro 


Glu 


Met 
85 


Arg 


Gly 


Tyr 


Asn 


Leu 
90 


Leu 


Phe 


Ser 


Leu 


Glu 
95 


Asn 


Phe 


Thr 


Leu 


Pro 
100 


Leu 


Ser 


Gin 


Lys 


Leu 
105 


Phe 


Lys 


Asn 


Phe 


Phe 
110 


Arg 


Ala 


Leu 


Gin 


Phe 
115 


Arg 


Gly 


Trp 


Thr 


Ala 
120 


Ser 


Ser 


Thr 


Glu 


Cys 
125 


Gin 


Val 


Pro 


Arg 


Val 
130 


Asp 


Arg 


Trp 


Val 


Gly 
135 


Pro 


Met 


Ser 


Asp 


Tyr 
140 


Thr 


Arg 


Asn 


Val 
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lie 


Ala 


Pro 


Glu 


Thr 


Tyr 


He 


Asn 


Phe 


Cys 


Val 


Phe 


Leu 


Glu 


Gin 


Ala 


145 










150 










155 










160 


Phe 


Leu 


Glu 


Thr 


Gly 


Arg 


Pro 


Arg 


Asn 


Glu 


Arg 


Val 


Tyr 


Pro 


Ser 


Val 










165 










170 










175 




Phe 


Thr 


Arg 


Glu 


Phe 


Tyr 


Ser 


Ser 


Ser 


He 


Ser 


Lys 


Thr 


Phe 


Gin 


Lys 








180 










185 










190 






Phe 


Phe 


Arg 


Ala 


Leu 


Gin 


Phe 


Arg 


Gly 


Trp 


Thr 


Ala 


Ser 


Ser 


Thr 


Glu 






195 










200 










205 








Cys 


Gin 


Val 


Pro 


Arg 


Val 


Asp 


Leu 


Trp 


Val 


Gly 


Pro 


Met 


Ser 


Asp 


Tyr 




210 










215 










220 










Thr 


Arg 


Asn 


Val 


He 


Ala 


Pro 


Glu 


He 


Glu 


Glu 


Val 


Ser 


Tyr 


Gly 


His 


225 










230 










235 










240 


Phe 


Trp 


Thr 


Arg 


Cys 


Phe 


Trp 


Thr 


Lys 


He 


Leu 


Leu 


Asp 


Gly 


Asn 


Pro 










245 










250 










255 




Leu 


Pro 


Leu 


Pro 


Pro 


Pro 


Phe 


Lys 


Lys 


Gly 


Pro 


Arg 


Val 


Tyr 


Asn 


Asp 








260 










265 










270 






Cys 


Thr 


Thr 


Pro 


His 


Ser 


Asn 


His 


His 


Asn 


His 


His 


His 


His 


His 


Gly 






275 










280 










285 








Arg 


Thr 


Ser 


He 


Leu 


Gin 


Gin 


Thr 


Leu 


Ser 


Arg 


Lys 


Trp 


Ser 


Ser 


Leu 



290 295 300 



Leu 
305 



<210> 194 
<211> 447 
<212> DNA 
<213> SHRIMP 



<400> 194 

atgacccacc 

accccatatc 

ccccacctta 

caga tctgga 

ctagcggtgt 

agaagagcaa 

aagctacaag 

ctaagttact 



ttgtccttct 
tctcacccca 
ttcactcgct 
ccccagtctt 
atataaggcg 
cacaacaagc 
aatggcctct 
tagttctgag 



tatcctatcc 
tctcacctat 
ccagtttcaa 
cccttttatc 
gcgcggccag 
actctctctc 
ccagcccccg 
gaactaa 



ttatccttgt 
acccccatct 
caccctgttc 
cctaaccggc 
gccagaagca 
cttctaccta 
ccgcaccaag 



<210> 195 
<211> 146 
<212> PRT 
<213> SHRIMP 



cccctgtcta 
cacccatcac 
ttgccgagcc 
accatt ta tg 
tcagttctct 
gaagagacct 
tccttacacc 



ccaccacctc 
ctctatattc 
aacccataac 
cccccaggcg 
gcaagccagc 
gccaatactc 
atgttggact 



60 

120 

180 

240 

300 

360 

420 

447 



<400> 195 
Met Thr His 
1 

Tyr His His 

He Ser Pro 
35 

Phe Gin His 
50 

Pro Val Phe 
65 

Val Tyr He 

Ala Ser Arg 

Arg Asp Leu 
115 



Leu Val Leu 
5 

Leu Thr Pro 
20 

He Thr Ser 

Pro Val Leu 

Pro Phe He 
70 

Arg Arg Arg 
85 

Arg Ala Thr 
100 

Pro He Leu 



Leu He Leu 

Tyr Leu Ser 
25 

He Phe Pro 
40 

Ala Glu Pro 

55 

Pro Asn Arg 

Gly Gin Ala 

Gin Gin Ala 
105 

Lys Leu Gin 
120 



Ser Leu 
10 

Pro His 

His Leu 

Thr His 

His His 

75 
Arg Ser 
90 

Leu Ser 
Glu Trp 



Ser Leu 

Leu Thr 

He His 
45 

Asn Gin 
60 

Leu Cys 

He Ser 

Leu Leu 

Pro Leu 
125 



Ser Pro Val 
15 

Tyr Thr Pro 

30 

Ser Leu Gin 

He Trp Thr 

Pro Gin Ala 
80 

Ser Leu Gin 
95 

Leu Pro Arg 
110 

Gin Pro Pro 
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Pro His Gin Val Leu Thr Pro Cys Trp Thr Leu Ser Tyr Leu Vai Leu 

130 135 140 

Arg Asn 
145 



<210> 196 
<211> 339 
<212> DNA 
<213> SHRIMP 



<400> 196 

atgttggact 

actagctctc 

aagatcttcc 

attttctatg 

cctttactct 

cccgcctcac 



ctaagttact tagttctgag gaactaaagg aactaacttc atacgtctcg 60 
gccggtctga tatgaagaaa cacttgctcc atctattcga ggagcacgag 120 
aattcataca aggtaagcac aagttctcac tatacacttt ggactttgaa 180 
ttatgctgaa tattttgttg gttgaagtga aaaatattct aagtccaatt 240 
ttgacagaaa tctccaacca gtacggagac tatggatgtt tcacaatggc 300 
ctgaacgctg cagccgatct cttggataa 339 



<210> 197 
<211> 110 
<212> PRT 
<213> SHRIMP 



<400> 197 

Met Leu Asp Ser Lys Leu Leu Ser Ser Glu Glu Leu Lys Glu Leu Thr 

15 10 15 

Ser Tyr Val Ser Thr Ser Ser Arg Arg Ser Asp Met Lys Lys His Leu 

20 25 30 

Leu His Leu Phe Glu Glu His Glu Lys lie Phe Gin Phe lie Gin Gly 

35 40 45 

Lys His Lys Phe Ser Leu Tyr Thr Leu Asp Phe Glu lie Phe Tyr Val 

50 55 60 

Met Leu Asn lie Leu Leu Val Glu Val Lys Asn lie Leu Ser Pro lie 
65 70 75 80 

Pro Leu Leu Phe Asp Arg Asn Leu Gin Pro Val Arg Arg Leu Trp Met 

85 90 95 

Phe His Asn Gly Pro Asp Glu Arg Cys Ser Arg Ser Leu Gly 
100 105 110 



<210> 198 

<211> 3438 

<212> DNA 

<213> SHRIMP 



<400> 198 

atggccccgc 

ctatctcccg 

aaggcactgg 

gttaaagcca 

ccccaacgcc 

atgggagtgt 

ccctctggac 

gtgtggagtt 

aggacagtct 

gtgtctaatg 

gtggctccaa 

tcgacctcta 

ggcggcggaa 

ggggacaatc 



ctcacctgaa 
aaggcgccca 
tgagcttttg 
tggagttcca 
ccgtgactga 
ctgcaaactt 
aagaagtcga 
tgcttgaaga 
tagtttgcac 
tgaacgtact 
gtaataataa 
ataatagtaa 
gatcaaagaa 
atggcatgt t 



cgctgcagcc 
aacttcgtcc 
ccagaagacc 
aggagacgat 
gaggcagatg 
ctctccagtc 
ctcatcaact 
gtgtggtagt 
cctgtttata 
taaccagttg 
tagtgtccca 
caacatcagt 
agttacagcc 
tgccggtgcc 



gatctcttgg 
ccggctgctt 
cgcttcacca 
tttaactact 
ttcgccctta 
tctgatgacg 
tccgctcaaa 
ggctctaata 
atccaagtgt 
tttggacatg 
tcaactgttg 
aacaagcgtg 
acagccaaaa 
cctgttgatg 



ataaggtgat 
gtgttggtgc 
ccaacattgt 
ctgccttgtg 
tgaagagtga 
tcatcaaccc 
tttctggtat 
gtaatagttc 
tcaagttttt 
ttgtttttgg 
ttaacaacaa 
ttggtggtag 
atccctttaa 
ttaatttgga 



gtccggacct 
gaaggt tgtg 
gatgagagaa 
tgcaagtatg 
ggacgaagaa 
t tcaagcctc 
gtttcaaaac 
ccctgtctct 
ggtgactaaa 
atcact tgat 
caacaaaccc 
taataacagt 
taa tgtagat 
tgactttg tt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 
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tttccccaag ttgaaactct tacaagtaag 
gatgaagatt tgagtaaaat gtgccgtaaa 
tttaatgtgt tcatctctga gattaacccc 
ggatttttga ctgcatggga taagtttgta 
cgtaactata tcctcactcg ctcaaactat 
tcaattaagg ggactgttta ttataatgac 
aaccctgatc tggacaagag ctggggtaac 
aacttgatgg cattcatggg tattgacatt 
tatatgtttg caaagcaatt ccaacatccg 
gtatacgcaa aggttgccgc aggattgaag 
cctgaatacg aaaattgtca gtttatcaag 
gagttatacg ccaaacaact tctccgcaca 
ggacagagtg cattcccatt tgcaaatatt 
attcacggtg acacaatcat tgaattgatg 
tcaaaaattg aggacgaaaa catcatcaag 
agagtacaag ccgagaagac cctgtatttt 
agaacagagg aagaattctt caggcctaca 
gtaccttcta atgtggagga agaggaagag 
gaggaagtag aaagggaaga aggatctgat 
gaagaaatgg aggaggagaa ggaagaagaa 
ggtaatgaga accaagaaga agaacaacaa 
aataaggatg cagatagtga cagcgacagt 
agcagcagta gtagtagtag cagcagtagt 
gctgaaaaga agaaagaaga ggaagtacct 
gaaaggccat cagaagctgc ttcctctccc 
caacaactat caccatcatt ggacatactc 
attcctgcgc ctgagcctat cgttgctaca 
aagacaggat ttagttactc ttcattcgta 
acttccccta ctgaaccagc agctgtgccc 
aatgactttt tggacatgtt ggacggttta 
tgcgacgtga ccgcaaaatt ctttgagggc 
tgcacaggtt tcgatgatct tcttaaagcc 
tgctttacct ccccgattca cccttctagc 
tgcagttcta ttaagaggtc tagggcaggt 
acaaatgagg ttgaaaagga agcccctaaa 
aagtcccacc gtggttcctc tggttctgct 
gaatcagagg atgaaaggga tatgaaatca 
catgttaaac attcccctaa atatgatgct 
aatgttaata gtacaacatg catgtcgtca 
agccataata aaagccactc tcgtaaacac 
aaccaacaat gctcaatcaa tactcaaaat 
agttttagaa gttttagtcc taagaaagat 
acaaagccag ttaggcccta taacaagaag 
gtacagaggt ctgcctga 



agcaccatcc ctaaagaaga ggtaaatgta 900 
actgccctta cccccctaga aattcatacc 960 
tccaaatatg accgttcaat gttttgcaag 1020 
gagggggata ctgctggcgt taaacgcttc 1080 
gcctcagccg ccagggccgt gtatgaagcg li40 
aagtcaaagt ttctgttcca cgataatgtt 1200 
aagaatggga agaaacctag actcccagct 1260 
gtaaaggtgt gcgctaaggg gattcaaaag 1320 
gaagtggaag aacttgtgcc tcctatggct 138 0 
tcggggactt tgtttgatga ctgggacctg 14 4 0 
tatgacacag aagggtgcaa aaagcacagt 1500 
ggacttaatc aatacaataa actggaagag 1560 
gtgacggtaa catccgcctc tagtgatgat 1620 
tacaagacaa aggatggcgt aaagggagtc 168 0 
gtgaatccag cagaagaaaa gaagaataat 174 0 
gagattgatt ccgatgatga ggtgtgtgag 1800 
tctgttgttg ctgccccgac aacacccctc 1860 
gaagaagagc agatggaaga agaggaggaa 192 0 
aaggaagatg acggagacgc accagcacag 1980 
caacaacaac agccagaaga agaaagcaat 204 0 
caacaacaac aaccagaaag agaagaggag 2100 
gatagcagca gcagcagtag tagcagcagt 2160 
agcagcagca gtagtagcag tgaaaatgaa 2220 
gccaagattc agaagagaaa gaggctaagt 2280 
aagagaatga gagtagaaga agaacaacaa 234 0 
cagactgcag ttgatgagat gatggaagaa 24 00 
acctcaccca aggcagcgac acttgcactc 2460 
agaggagatg acctttcagt agctggtaat 2520 
gctgctgcca cttgcacttc cgatgttgga 2580 
cctggcgata tagtaatgca acctggcgaa 2640 
atcaccctac cagatggtac tgataatgaa 2700 
accgagactg ataacattat aaccaccaca 2760 
aactcagccc ccagaaagga tattgataat 2820 
tcactttttg acactgatga tgatagtgaa 2880 
cgtaagaagc acttgaaaaa gaggcgtaac 2 940 
tcttcttctc attgtatgag tagtgatgaa 3000 
acatcaaagg ttcacaagtc accaaaagct 3060 
gtaaatagtg atgtaaataa ctcatacaac 3120 
tcagatagtg atgcagaagc acagcctaaa 3180 
tcttcttcct ccacaagtga taagaaacag 324 0 
gtcaagaaga ctgttgtaca gtctccacct 3300 
gagcttggtg atttcttgtc acgcaagcac 3360 
cgtgataatg ttaacaccac taataatgta 3420 

3438 



<210> 199 
<211> 1141 
<212> PRT 
<213> SHRIMP 

<400> 199 

Met Ala Pro Pro His Leu Asn Ala Ala Ala Asp Leu Leu Asp Lys Val 

15 10 15 

Met Ser Gly Pro Leu Ser Pro Glu Gly Ala Gin Thr Ser Ser Pro Ala 

20 25 30 

Ala Cys Val Gly Ala Lys Val Val Lys Ala Leu Val Ser Phe Cys Gin 

35 40 45 

Lys Thr Arg Phe Thr Thr Asn lie Val Met Arg Glu Val Lys Ala Met 

50 55 60 

Glu Phe Gin Gly Asp Asp Phe Asn Tyr Ser Ala Leu Cys Ala Ser Met 
65 70 75 80 
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Pro 


Gin 


Arg 


Pro 


Val 


Thr 


Glu 


Arg 


Gin 


Met 


Phe 


Ala 


Leu 


Met 


Lys 


Ser 










85 










90 










95 




Glu 


Asp 


Glu 


Glu 


Met 


Gly 


Val 


Ser 


Ala 


Asn 


Phe 


Ser 


Pro 


Val 


Ser 


Asp 








100 










105 










110 






Asp 


Val 


He 


Asn 


Pro 


Ser 


Ser 


Leu 


Pro 


Ser 


Gly 


Gin 


Glu 


Val 


Asp 


Ser 






115 










120 










125 








Ser 


Thr 


Ser 


Ala 


Gin 


He 


Ser 


Gly 


Met 


Phe 


Gin 


Asn 


Val 


Trp 


Ser 


Leu 




130 










135 










140 










Leu 


Glu 


Glu 


Cys 


Gly 


Ser 


Gly 


Ser 


Asn 


Ser 


Asn 


Ser 


Ser 


Pro 


Val 


Ser 


145 










150 










155 










160 


Arg 


Thr 


Val 


Leu 


Val 


Cys 


Thr 


Leu 


Phe 


He 


He 


Gin 


Val 


Phe 


Lys 


Phe 










165 










170 










175 




Leu 


Val 


Thr 


Lys 


Val 


Ser 


Asn 


Val 


Asn 


Val 


Leu 


Asn 


Gin 


Leu 


Phe 


Gly 








180 










185 










190 






His 


Val 


Val 


Phe 


Gly 


Ser 


Leu 


Asp 


Val 


Ala 


Pro 


Ser 


Asn 


Asn 


Asn 


Ser 






195 










200 










205 








Val 


Pro 


Ser 


Thr 


Val 


Val 


Asn 


Asn 


Asn 


Asn 


Lys 


Pro 


Ser 


Thr 


Ser 


Asn 




210 










215 










220 










Asn 


Ser 


Asn 


Asn 


He 


Ser 


Asn 


Lys 


Arg 


Val 


Gly 


Gly 


Ser 


Asn 


Asn 


Ser 


225 










230 










235 










240 


Gly 


Gly 


Gly 


Arg 


Ser 


Lys 


Lys 


Val 


Thr 


Ala 


Thr 


Ala 


Lys 


Asn 


Pro 


Phe 










245 










250 










255 




Asn 


Asn 


Val 


Asp 


Gly 


Asp 


Asn 


His 


Gly 


Met 


Phe 


Ala 


Gly 


Ala 


Pro 


Val 








260 










265 










270 






Asp 


Val 


Asn 


Leu 


Asp 


Asp 


Phe 


Val 


Phe 


Pro 


Gin 


Val 


Glu 


Thr 


Leu 


Thr 






275 










280 










285 








Ser 


Lys 


Ser 


Thr 


He 


Pro 


Lys 


Glu 


Glu 


Val 


Asn 


Val 


Asp 


Glu 


Asp 


Leu 




290 










295 










300 










Ser 


Lys 


Met 


Cys 


Arg 


Lys 


Thr 


Ala 


Leu 


Thr 


Pro 


Leu 


Glu 


He 


His 


Thr 


305 










310 










315 










320 


Phe 


Asn 


Val 


Phe 


He 


Ser 


Glu 


He 


Asn 


Pro 


Ser 


Lys 


Tyr 


Asp 


Arg 


Ser 










325 










330 










335 




Met 


Phe 


Cys 


Lys 


Gly 


Phe 


Leu 


Thr 


Ala 


Trp 


Asp 


Lys 


Phe 


Val 


Glu 


Gly 








340 










345 










350 






Asp 


Thr 


Ala 


Gly 


Val 


Lys 


Arg 


Phe 


Arg 


Asn 


Tyr 


He 


Leu 


Thr 


Arg 


Ser 






355 










360 










365 








Asn 


Tyr 


Ala 


Ser 


Ala 


Ala 


Arg 


Ala 


Val 


Tyr 


Glu 


Ala 


Ser 


He 


Lys 


Gly 




37 0 










375 










380 










Thr 


Val 


Tyr 


Tyr 


Asn 


Asp 


Lys 


Ser 


Lys 


Phe 


Leu 


Phe 


His 


Asp 


Asn 


Val 


385 










390 










395 










400 


Asn 


Pro 


Asp 


Leu 


Asp 


Lys 


Ser 


Trp 


Gly 


Asn 


Lys 


Asn 


Gly 


Lys 


Lys 


Pro 










405 










410 










415 




Arg 


Leu 


Pro 


Ala 


Asn 


Leu 


Met 


Ala 


Phe 


Met 


Gly 


He 


Asp 


He 


Val 


Lys 








420 










425 










430 






Val 


Cys 


Ala 


Lys 


Gly 


He 


Gin 


Lys 


Tyr 


Met 


Phe 


Ala 


Lys 


Gin 


Phe 


Gin 






435 










440 










445 








His 


Pro 


Glu 


Val 


Glu 


Glu 


Leu 


Val 


Pro 


Pro 


Met 


Ala 


Val 


Tyr 


Ala 


Lys 




450 










455 










4 60 










Val 


Ala 


Ala 


Gly 


Leu 


Lys 


Ser 


Gly 


Thr 


Leu 


Phe 


Asp 


Asp 


Trp 


Asp 


Leu 


465 










470 










475 










480 


Pro 


Glu 


Tyr 


Glu 


Asn 


Cys 


Gin 


Phe 


He 


Lys 


Tyr 


Asp 


Thr 


Glu 


Gly 


Cys 










485 










4 90 










495 




Lys 


Lys 


His 


Ser 


Glu 


Leu 


Tyr 


Ala 


Lys 


Gin 


Leu 


Leu 


Arg 


Thr 


Gin 


Gin 








500 










505 










510 






Tyr 


Asn 


Lys 


Leu 


Glu 


Glu 


Gly 


Gin 


Ser 


Ala 


Phe 


Pro 


Phe 


Ala 


Asn 


He 






515 










520 










525 








Val 


Thr 


Val 


Thr 


Ser 


Ala 


Ser 


Ser 


Asp 


Asp 


He 


His 


Gly 


Asp 


Thr 


He 




530 










535 










540 










He 


Glu 


Leu 


Met 


Tyr 


Lys 


Thr 


Lys 


Asp 


Gly 


Val 


Lys 


Gly 


Val 


Ser 


Lys 


545 










550 










555 










560 


He 


Glu 


Asp 


Glu 


Asn 


He 


He 


Lys 


Val 


Asn 


Pro 


Ala 


Glu 


Glu 


Lys 


Lys 
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565 570 575 



Asn 


Asn 


Arg 


Val 


Gin 


Ala 


Glu 


Lys 


Thr 


Leu 


Tyr 


Phe 


Glu 


He 


Asp 


Ser 








580 










585 










590 






Asp 


Asp 


Glu 


Val 


Cys 


Glu 


Arg 


Thr 


Glu 


Glu 


Glu 


Phe 


Phe 


Arg 


Pro 


Thr 






595 










600 










605 








Ser 


Val 


Val 


Ala 


Ala 


Pro 


Thr 


Thr 


Pro 


Leu 


Val 


Pro 


Ser 


Asn 


Val 


Glu 




610 










615 










620 










Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Gin 


Met 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


625 










630 










635 










640 


Val 


Glu 


Arg 


Glu 


Glu 


Gly 


Ser 


Asp 


Lys 


Glu 


Asp 


Asp 


Gly 


Asp 


Ala 


Pro 










645 










650 










655 




Ala 


Gin 


Glu 


Glu 


Met 


Glu 


Glu 


Glu 


Lys 


Glu 


Glu 


Glu 


Gin 


Gin 


Gin 


Gin 








660 










665 










670 






Pro 


Glu 


Glu 


Glu 


Ser 


Asn 


Gly 


Asn 


Glu 


Asn 


Gin 


Glu 


Glu 


Glu 


Gin 


Gin 






675 










680 










685 








Gin 


Gin 


Gin 


Gin 


Pro 


Glu 


Arg 


Glu 


Glu 


Glu 


Asn 


Lys 


Asp 


Ala 


Asp 


Ser 




690 










695 










700 










Asp 


Ser 


Asp 


Ser 


Asp 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


705 










710 










715 










720 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Glu 










725 










730 










735 




Asn 


Glu 


Ala 


Glu 


Lys 


Lys 


Lys 


Glu 


Glu 


Glu 


Val 


Pro 


Ala 


Lys 


He 


Gin 








740 










745 










750 






Lys 


Arg 


Lys 


Arg 


Leu 


Ser 


Pro 


Ser 


Glu 


Ala 


Ala 


Ser 


Ser 


Pro 


Lys 


Arg 






755 










760 










765 








Met 


Arg 


Val 


Glu 


Glu 


Glu 


Gin 


Gin 


Gin 


Gin 


Leu 


Ser 


Pro 


Ser 


Leu 


Asp 




770 










775 










780 










He 


Leu 


Gin 


Thr 


Ala 


Val 


Asp 


Glu 


Met 


Met 


Glu 


Glu 


He 


Pro 


Ala 


Pro 


785 










790 










795 










800 


Glu 


Pro 


He 


Val 


Ala 


Thr 


Thr 


Ser 


Pro 


Lys 


Ala 


Ala 


Thr 


Leu 


Ala 


Leu 










805 










810 










815 




Lys 


Thr 


Gly 


Phe 


Ser 


Tyr 


Ser 


Ser 


Phe 


Val 


Arg 


Gly 


Asp 


Asp 


Leu 


Ser 








820 










825 










830 






Val 


Ala 


Gly 


Asn 


Thr 


Ser 


Pro 


Thr 


Glu 


Pro 


Ala 


Ala 


Val 


Pro 


Ala 


Ala 






835 










840 










845 








Ala 


Thr 


Cys 


Thr 


Ser 


Asp 


Val 


Gly 


Asn 


Asp 


Phe 


Leu 


Asp 


Met 


Leu 


Asp 




850 










855 










860 










Gly 


Leu 


Pro 


Gly 


Asp 


He 


Val 


Met 


Gin 


Pro 


Gly 


Glu 


Cys 


Asp 


Val 


Thr 


865 










870 










875 










880 


Ala 


Lys 


Phe 


Phe 


Glu 


Gly 


He 


Thr 


Leu 


Pro 


Asp 


Gly 


Thr 


Asp 


Asn 


Glu 










885 










890 










895 




Cys 


Thr 


Gly 


Phe 


Asp 


Asp 


Leu 


Leu 


Lys 


Ala 


Thr 


Glu 


Thr 


Asp 


Asn 


He 








900 










905 










910 






He 


Thr 


Thr 


Thr 


Cys 


Phe 


Thr 


Ser 


Pro 


He 


His 


Pro 


Ser 


Ser 


Asn 


Ser 






915 










920 










925 








Ala 


Pro 


Arg 


Lys 


Asp 


He 


Asp 


Asn 


Cys 


Ser 


Ser 


He 


Lys 


Arg 


Ser 


Arg 




930 










935 










9A0 










Ala 


Gly 


Ser 


Leu 


Phe 


Asp 


Thr 


Asp 


Asp 


Asp 


Ser 


Glu 


Thr 


Asn 


Glu 


Val 


945 










950 










955 










960 


Glu 


Lys 


Glu 


Ala 


Pro 


Lys 


Arg 


Lys 


Lys 


His 


Leu 


Lys 


Lys 


Arg 


Arg 


Asn 










965 










970 










975 




Lys 


Ser 


His 


Arg 


Gly 


Ser 


Ser 


Gly 


Ser 


Ala 


Ser 


Ser 


Ser 


His 


Cys 


Met 








980 










985 










990 






Ser 


Ser 


Asp 


Glu 


Glu 


Ser 


Glu 


Asp 


Glu 


Arg 


Asp 


Met 


Lys 


Ser 


Thr 


Ser 






995 










1000 








1005 






Lys 


Val 


His 


Lys 


Ser 


Pro 


Lys 


Ala 


His 


Val 


Lys 


His 


Ser 


Pro 


Lys 


Tyr 




1010 








1015 








1020 








Asp 


Ala 


Val 


Asn 


Ser 


Asp 


Val 


Asn 


Asn 


Ser 


Tyr 


Asn 


Asn 


Val 


Asn 


Ser 


1025 








1030 








1035 








1041 


Thr 


Thr 


Cys 


Met 


Ser 


Ser 


Ser 


Asp 


Ser 


Asp 


Ala 


Glu 


Ala 


Gin 


Pro 


Lys 



1045 1050 1055 
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Ser His Asn Lys Ser His Ser Arg Lys His Ser Ser Ser Ser Thr Ser 

1060 1065 1070 

Asp Lys Lys Gin Asn Gin Gin Cys Ser lie Asn Thr Gin Asn Val Lys 

1075 1080 1085 

Lys Thr Val Val Gin Ser Pro Pro Ser Phe Arg Ser Phe Ser Pro Lys 

1090 1095 1100 

Lys Asp Glu Leu Gly Asp Phe Leu Ser Arg Lys His Thr Lys Pro Val 
1105 1110 1115 1120 

Arg Pro Tyr Asn Lys Lys Arg Asp Asn Val Asn Thr Thr Asn Asn Val 

1125 1130 1135 

Val Gin Arg Ser Ala 
1140 



<210> 200 
<211> 1875 
<212> DNA 
<213> SHRIMP 



<400> 200 

atgtcctcga 

gacgaaaaag 

actgtatctt 

tcaaactttg 

cctcc tccct 

gatgaatgtg 

tctatccctg 

gaaattaaga 

gaacctcctg 

cctagtatcg 

tccatagatg 

aagaaagggt 

cacgagaatg 

gaacaaagat 

agagtattga 

atgacattct 

gatga tggag 

gtagtgttcc 

ctcacgcctg 

ggaacaaata 

acttcatcag 

actcctttaa 

ggaggtggaa 

attcctacct 

acgttcccct 

gaggataaag 

ccatcatata 

tctcttgcaa 

ttttgtaata 

ggacaata tt 

tacattgttt 

gacatggatg 



cgacctctcc 
atgtcaagca 
ttagtgataa 
atatgtttgg 
ctacaacaac 
acgaagaagg 
tcgggatctc 
aactcaaaaa 
tcgcagcaga 
atatgtgcgg 
tgtatacttc 
ccaagaaatg 
tatgcatggc 
tcttggctga 
acaaagcttt 
atactaacct 
gaatacatgg 
acaagaggga 
acactagtaa 
ctgatatgca 
tgggtgtgaa 
cgaatgatgt 
attctagaat 
gcttcacagg 
tagaatgtga 
tctttcatcc 
acaagggaca 
agcgtagagg 
tggatgtatg 
gtaatagatg 
gcagagacgc 
actga 



atcgtctttg 
agaagtctcg 
cgatgaggac 
tgattcagat 
accacttcct 
agcagcagca 
tgaagctgaa 
gatgatgaaa 
ttataaaaca 
tgtcgcccct 
ttcctatgta 
tcaattcctt 
ccctcccggt 
acatggagat 
ccccaatctg 
ttgttggata 
cgatgcgtca 
agaacaaaag 
tatgggaaca 
agaattccaa 
taccaaccca 
accagaattg 
gagtgcaaat 
aggtgcgaat 
cgaagatgat 
attttatgag 
gacttgtaat 
aacagtcccc 
tactgccatg 
tatcgtaagg 
ttcatgtgat 



tgggatgatg 

aaccgtcccc 

gataacaagg 

aacatgccat 

actcctcgat 

gcatcagcac 

ttgaaaaaaa 

gatcctctcc 

agggcaaaca 

cctcaatttt 

tttcctcctc 

aaggggagaa 

aaaaggggag 

gaatacaagg 

attgctcgtg 

gttaatggag' 

gagtatgcaa 

accaatggga 

agcactgatg 

tcaatgggaa 

cttcccaacc 

gacatgatgt 

acaggaacat 

gtagtagtgc 

ccaagtattc 

tatatggcca 

gttgtccagg 

aaattctgta 

tgcaaatggg 

aggtcatgta 

gttcccaagt 



acgatgatga 
ccattttttc 
gagaagaaga 
caact tctac 
ccatcatgga 
cgtctattgc 
tggaaaagaa 
ctcacctata 
tttcccttta 
gcgctgaatt 
ccacacctgc 
aggctttgag 
gtgtattttt 
tcccaaggat 
cagacacact 
ttgtcgtatg 
caggagaaaa 
gtgccagtaa 
tgcaagaatt 
caaataccaa 
ctcccccaag 
ggctttattc 
ctcccctgtc 
ctaatggatt 
ccaattctta 
aatatctatc 
agtggttcaa 
gtaacatttc 
cgaagactgt 
ca tccatgct 
gcagggaaag 



tgacgaagaa 
gtacatggaa 
atgttttgga 
tgcccctttc 
tactgattcg 
cgcctcttct 
aaagaggaag 
tgtaggagga 
taaagttgac 
gcccacccca 
catgcataat 
gaaatggatt 
ggctcacttg 
gtttgtttca 
gtgcagtgat 
ctttgataaa 
tt ttgatact 
gaagaggcgt 
ccaaacgatg 
ccccatagag 
attggtaatt 
gccttccaga 
taacacccca 
tgtccctccc 
caattacgaa 
ccctcttgtt 
gggatccttc 
ccacgctttc 
aattagacat 
cgcatatcac 
ggttcgcaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1875 



<210> 201 
<211> 620 
<212> PRT 
<213> SHRIMP 



<400> 201 

Met Ser Ser Thr Thr Ser Pro Ser Ser Leu Trp Asp Asp Asp Asp Asp 
15 10 15 
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Asp Asp Glu Glu Asp Glu I*ys Asp Val Lys Gin Glu Val Ser Asn Arg 

20 25 30 

Pro Pro lie Phe Ser Tyr Met Val Ser Phe Ser Asp Asn Asp Glu Asp 

35 40 45 

Asp Asn Lys Gly Glu Glu Glu Cys Phe Gly Ser Asn Phe Asp Met Phe 

50 55 60 

Gly Asp Ser Asp Asn Met Pro Ser Thr Ser Thr Ala Pro Phe Pro Pro 
65 70 75 80 

Pro Ser Thr Thr Thr Pro Leu Pro Thr Pro Arg Ser lie Met Asp Thr 

85 90 95 

Asp Ser Asp Glu Cys Asp Glu Glu Gly Ala Ala Ala Ala Ser Ala Pro 

100 105 110 

Ser lie Ala Ala Ser Ser Ser lie Pro Val Gly He Ser Glu Ala Glu 

115 120 125 

Leu Lys Lys Met Glu Lys Lys Lys Arg Lys Glu He Lys Lys Leu Lys 

130 135 140 

Lys Met Met Lys Asp Pro Leu Pro His Leu Tyr Val Gly Gly Glu Pro 
145 150 155 160 

Pro Val Ala Ala Asp Tyr Lys Thr Arg Ala Asn He Ser Leu Tyr Lys 

165 170 175 

Val Asp Pro Ser He Asp Met Cys Gly Val Ala Pro Pro Gin Phe Cys 

180 185 190 

Ala Glu Leu Pro Thr Pro Ser He Asp Val Tyr Thr Ser Ser Tyr Val 

195 200 205 

Phe Pro Pro Pro Thr Pro Ala Met His Asn Lys Lys Gly Ser Lys Lys 

210 215 220 

Cys Gin Phe Leu Lys Gly Arg Lys Ala Leu Arg Lys Trp He His Glu 
225 230 235 240 

Asn Val Cys Met Ala Pro Pro Gly Lys Arg Gly Gly Val Phe Leu Ala 

245 250 255 

His Leu Glu Gin Arg Phe Leu Ala Glu His Gly Asp Glu Tyr Lys Val 

260 265 270 

Pro Arg Met Phe Val Ser Arg Val Leu Asn Lys Ala Phe Pro Asn Leu 

275 280 285 

He Ala Arg Ala Asp Thr Leu Cys Ser Asp Met Thr Phe Tyr Thr Asn 

290 295 300 

Leu Cys Trp He Val Asn Gly Val Val Val Cys Phe Asp Lys Asp Asp 
305 310 315 320 

Gly Gly He His Gly Asp Ala Ser Glu Tyr Ala Thr Gly Glu Asn Phe 

325 330 335 

Asp Thr Val Val Phe His Lys Arg Glu Glu Gin Lys Thr Asn Gly Ser 

340 345 350 

Ala Ser Lys Lys Arg Arg Leu Thr Pro Asp Thr Ser Asn Met Gly Thr 

355 360 365 

Ser Thr Asp Val Gin Glu Phe Gin Thr Met Gly Thr Asn Thr Asp Met 

370 375 380 

Gin Glu Phe Gin Ser Met Gly Thr Asn Thr Asn Pro He Glu Thr Ser 
385 390 395 400 

Ser Val Gly Val Asn Thr Asn Pro Leu Pro Asn Pro Pro Pro Arg Leu 

405 410 415 

Val He Thr Pro Leu Thr Asn Asp Val Pro Glu Leu Asp Met Met Trp 

420 425 430 

Lys Pro Ser Arg Gly Gly Gly Asn Ser Arg Met Ser Ala Asn Thr Gly 

435 440 445 

Thr Ser Pro Leu Ser Asn Thr Pro He Pro Thr Cys Phe Thr Gly Gly 

450 455 460 

Ala Asn Val Val Val Pro Asn Gly Phe Val Pro Pro Thr Phe Pro Leu 
465 470 475 480 

Glu Cys Asp Glu Asp Asp Pro Ser He Pro Asn Ser Tyr Asn Tyr Glu 

485 490 495 

Glu Asp Lys Val Phe His Pro Phe Tyr Glu Tyr Met Ala Lys Tyr Leu 
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500 










505 










510 






Ser 


Pro 


Leu 
515 


Val 


Pro 


Ser 


Tyr 


Asn 
520 


Lys 


Gly 


Gin 


Thr 


Cys 
525 


Asn 


Val 


Val 


Gin 


Glu 
530 


Trp 


Phe 


Lys 


Gly 


Ser 
535 


Phe 


Ser 


Leu 


Ala 


Lys 
540 


Arg 


Arg 


Gly 


Thr 


Val 


Pro 


Lys 


Phe 


Cys 


Ser 


Asn 


He 


Ser 


His 


Ala 


Phe 


Phe 


Cys 


Asn 


Met 


545 






550 










555 










560 


Asp 


Val 


Cys 


Thr 


Ala 


Met 


Cys 


Lys 


Trp 


Ala 


Lys 


Thr 


Val 


He 


Arg 


His 






565 










570 










575 




Gly 


Gin 


Tyr 


Cys 


Asn 


Arg 


Cys 


He 


Val 


Arg 


Arg 


Ser 


Cys 


Thr 


Ser 


Met 




580 










585 










590 






Leu 


Ala 


Tyr 

595 


His 


Tyr 


He 


Val 


Cys 
600 


Arg 


Asp 


Ala 


Ser 


Cys 
605 


Asp 


Val 


Pro 


Lys 


Cys 
610 


Arg 


Glu 


Arg 


Val 


Arg 
615 


Asn 


Asp 


Met 


Asp 


Asp 
620 











<210> 202 

<211> 2820 

<212> DNA 

<213> SHRIMP 

<400> 202 

atgtctgagc cttcagttta tgcttttatt gacataaaag aaatagaaaa tggttgggaa 60 
aaagaattcg ggcttttagt acaaccagga cagaaattag ctcctttcag ggatatttct 120 
tatgactcaa gcaaacttga ttgtgacgca ttttcttgca taccttcaga tatacttcat 180 
tctgataatg aaaaaagagt aggagagtgc aactttgccg aacacacctc tgtctcattt 240 
cctgtcaaga accctgaggg aaaaacattg cgccatttca cggcatgtgg tccagggtgt 300 
taccggaggt acaagcaaag agacccccat actggtttgc cagtagccag aggcgtcctg 360 
atgcaagatc acgttgacca tgaaactgga aataaaatgt gtgaatatct gaaccagagc 420 
ttagtcatgt gggctgctgt cccttggata cgacctggag accttactga aggttacaac 480 
acaacacacg tgcctggctt tgcattcaag gaggatgacg aaagggattc aaaaagagtt 540 
aaatatgaaa atgtggtcat ttcaaaggct tattgtgatt tctttaaaca gtattatgac 600 
gcagactctg gctcatgtta tcgatctgga tggatgaaat ttgtccattt aatgtttggg 660 
cagtatttta ctaatctttc ttacaattta gctaacccaa aaccttacaa tttaactgga 720 
aatacatggt ctgatgtagt ttctgtattg acagataacc ctattgtaga tgcgggtgct 780 
gctccgtcca gatctgaaat ggatgaaatt atcaccaaga agaagttcaa cgtattccct 840 
tctgagcaga catctgctcg ccagaaagca gagaatataa tacgttctca gtacggagat 900 
ggtgtagaaa tagacccatc ttctgtggat gctttaatgc agtttgttaa tagggaaggt 960 
gtagtaggaa cagagaaaaa atccgaccgc ctcatgcgag tggcagacgc tgttatggat 1020 
gcggctatgc gccttcaagt catgggtctg gacgacagtc aatctagacg attattgtta 1080 
aaaaatatga ttaaaatgag tagaaacaac ccagaatatg caagacattt ttccagttct 1140 
ctcaaattaa ttggagtaac tttggccata aaaagatctg ttttctctaa aggcgcttca 1200 
gctaaaagaa aggaaacggc cattaataat ggcgaacagc atagaagaag cagatggtca 1260 
cccgagacgg tgacagaaga agatgcactt ttatttgcaa gagaaaatat cactgaagac 1320 
ccaaaacacc ctgcgccctt tgtggacatt ttacactcac cagatatcaa ttcatctatc 1380 
aagagcggtt cttcttcctc aatatggaac gatatacttt caagaatttc ttcgaccaga 1440 
aaactggaag aaaaggcgag tgttttcgtt aaaaatcttg tggtgaaagt agtgagacag 1500 
ttcttggaca ttttagaagg taaactgttt tcggacggat acgaatggga cgataatatt 1560 
cctctgatga taggcgtgga ccaaatattg agagaagtca ttaaggcggc cagtaatatg 1620 
tgcgctagat ttgcctcctc agctctggag tctagtttgg ttactggctt tatagactcg 1680 
gcgagtgcta tcacttctag gttagccgta caactggcag ccagaacgtt ctccgtattc 1740 
ttggaggagt ctgttataga atttgtagtt gccgcaagcc ttcggctagc gatacaggca 1800 
tttgccgatc tggcaactct tgccgcgtca gccttaaccg ttattggaat tgttatattc 1860 
gtaatacaag tacttgggtt gattttggat cttgcgctag ggttaggttg gtacgatcac 1920 
atttttagcc cagaggattt aaagaagcag gttctagtgt ttaggagaga gtttgcaaag 1980 
gcaggaaatg tagatgtggg tgtagctcaa ccagtcaccc ctgaagaaat cgtcgctatc 2040 
aacgttttcc ttcaaactga agaaaacggg gaagaaaaga aggaagaagg agcacgaaaa 2100 
tcaaagattg attttcttca aaaatacttc cattctactc ctttgatggg aaagaaaagt 2160 
aagtttgtct acatacaaga agcagctcaa gaatacttgg gaggaagaac aatgaacgct 2220 
tttgggcagc gtataataac agctgctgat gatagtgaca ccaccaccac cacacaagag 2280 
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ggaaggaggg atgacgaaac agtgactaag aaaatgagga gtattattct agaaacaggt 2340 
caaactctta aggattactc gtctgctgtt aactataacg cctcccgtct agattacgtg 2400 
ggagaggaat gggtaagaaa tactgcccta aaagaagaga caaggagcaa cactactagt 24 60 
gataacctat tcaagaaaac tgtttctctt gctagtatgg ccggcgcatt tctggttcta 2520 
ggaataggtg tattggtagc gtcccatatt acacttttac gtttcaccaa tattggtcta 2580 
gcttttgcgt tcgcaggtct cctagcattt attgcactta tgagtatatc atatataaac 2640 
atgaatgcta tgggtgtagt gaattcggac gcaatataca ggtctactgc tctagttgga 2700 
gatatcaaaa cagaccccag aagagtagga atggtccagc gccacgtagg tgtcggggct 27 60 
aaatacaaca tgattacaga tttcgtctct ccaatgttag acgagatcga gagtgactaa 2820 



<210> 203 
<211> 931 
<212> PRT 
<213> SHRIMP 

<400> 203 



Met 


Ser 


Glu 


Pro 


Ser 


Val 


Tyr 


Ala 


Phe 


He 


Asp 


He 


Lys 


Glu 


He 


Glu 


1 








5 










10 










15 




Asn 


Gly 


Trp 


Glu 


Lys 


Glu 


Phe 


Gly 


Leu 


Leu 


Val 


Gin 


Pro 


Gly 


Gin 


Lys 








20 










25 










30 






Leu 


Ala 


Pro 


Phe 


Arg 


Asp 


He 


Ser 


Tyr 


Asp 


Ser 


Ser 


Lys 


Leu 


Asp 


Cys 






35 










40 










45 








Asp 


Ala 


Phe 


Ser 


Cys 


He 


Pro 


Ser 


Asp 


He 


Leu 


His 


Ser 


Asp 


Asn 


Glu 




50 










55 










60 










Lys 


Arg 


Val 


Gly 


Glu 


Cys 


Asn 


Phe 


Ala 


Glu 


His 


Thr 


Ser 


Val 


Ser 


Phe 


65 








70 










75 










80 


Pro 


Val 


Lys 


Asn 


Pro 


Glu 


Gly 


Lys 


Thr 


Leu 


Arg 


His 


Phe 


Thr 


Ala 


Cys 










85 










90 










95 




Gly 


Pro 


Gly 


Cys 


Tyr 


Arg 


Arg 


Tyr 


Lys 


Gin 


Arg 


Asp 


Pro 


His 


Thr 


Gly 








100 










105 










110 






Leu 


Pro 


Val 


Arg 


Val 


Leu 


Met 


Gin 


Asp 


His 


Val 


Asp 


His 


Glu 


Thr 


Gly 






115 










120 










125 








Asn 


Lys 


Met 


Cys 


Glu 


Tyr 


Leu 


Asn 


Gin 


Ser 


Leu 


Val 


Met 


Trp 


Ala 


Ala 




130 










135 










140 










Val 


Pro 


Trp 


He 


Arg 


Pro 


Gly 


Asp 


Leu 


Thr 


Glu 


Gly 


Tyr 


Asn 


Thr 


Thr 


145 










150 










155 










160 


His 


Val 


Pro 


Gly 


Phe 


Ala 


Phe 


Lys 


Glu 


Asp 


Asp 


Glu 


Arg 


Asp 


Ser 


Lys 










165 










170 










175 




Arg 


Val 


Lys 


Tyr 


Glu 


Asn 


Val 


Val 


He 


Ser 


Lys 


Ala 


Tyr 


Cys 


Asp 


Phe 






180 










185 










190 






Phe 


Lys 


Gin 


Tyr 


Tyr 


Asp 


Ala 


Asp 


Ser 


Gly 


*Ser 


Cys 


Tyr 


Arg 


Ser 


Gly 






195 










200 










205 








Trp 


Met 


Lys 


Phe 


Val 


His 


Leu 


Met 


Phe 


Gly 


Gin 


Tyr 


Phe 


Thr 


Asn 


Leu 




210 










215 










220 










Ser 


Tyr 


Asn 


Leu 


Ala 


Asn 


Pro 


Lys 


Pro 


Tyr 


Asn 


Leu 


Thr 


Gly 


Asn 


Thr 


225 








230 










235 










240 


Trp 


Ser 


Asp 


Val 


Val 


Ser 


Val 


Leu 


Thr 


Asp 


Asn 


Pro 


He 


Val 


Asp 


Ala 










245 










250 










255 




Gly 


Ala 


Ala 


Pro 


Ser 


Arg 


Ser 


Glu 


Met 


Asp 


Glu 


He 


He 


Thr 


Lys 


Lys 






260 










265 










270 






Lys 


Phe 


Asn 


Val 


Phe 


Pro 


Ser 


Glu 


Gin 


Thr 


Ser 


Ala 


Arg 


Gin 


Lys 


Ala 




275 










280 










285 








Glu 


Asn 


He 


He 


Arg 


Ser 


Gin 


Tyr 


Gly 


Asp 


Gly 


Val 


Glu 


He 


Asp 


Pro 




290 










295 










300 










Ser 


Ser 


Val 


Asp 


Ala 


Leu 


Met 


Gin 


Phe 


Val 


Asn 


Arg 


Glu 


Gly 


Val 


Val 


305 










310 










315 










320 


Gly 


Thr 


Glu 


Lys 


Lys 


Ser 


Asp 


Arg 


Leu 


Met 


Arg 


Val 


Ala 


Asp 


Ala 


Val 










325 










330 










335 




Met 


Asp 


Ala 


Ala 


Met 


Arg 


Leu 


Gin 


Val 


Met 


Gly 


Leu 


Asp 


Asp 


Ser 


Gin 








340 










345 










350 
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Ser Arg Arg Leu Leu Leu Lys Asn Met lie Lys Met Ser Arg Asn Asn 

355 360 365 

Pro Glu Tyr Ala Arg His Phe Ser Ser Ser Leu Lys Leu lie Gly Val 

370 375 380 

Thr Leu Ala lie Lys Arg Ser Val Phe Ser Lys Gly Ala Ser Ala Lys 
385 390 395 400 

Arg Lys Glu Thr Ala lie Asn Asn Gly Glu Gin His Arg Arg Ser Arg 

405 410 415 

Trp Ser Pro Glu Thr Val Thr Glu Glu Asp Ala Leu Leu Phe Ala Arg 

420 425 430 

Glu Asn lie Thr Glu Asp Pro Lys His Pro Ala Pro Phe Val Asp lie 

435 440 445 

Leu His Ser Pro Asp lie Asn Ser Ser lie Lys Ser Gly Ser Ser Ser 

450 455 460 

Ser lie Trp Asn Asp lie Leu Ser Arg lie Ser Ser Thr Arg Lys Leu 
465 470 475 480 

Glu Glu Lys Ala Ser Val Phe Val Lys Asn Leu Val Val Lys Val Val 

485 490 495 

Arg Gin Phe Leu Asp lie Gly Lys Leu Phe Ser Asp Gly Tyr Glu Trp 

500 505 510 

Asp Asp Asn lie Pro Leu Met lie Gly Val Asp Gin lie Leu Arg Glu 

515 520 525 

Val lie Lys Ala Asn Met Cys Ala Arg Phe Ala Ser Ser Ala Leu Glu 

530 535 540 

Ser Ser Leu Val Thr Gly Phe lie Asp Ser Ala Ser Ala lie Thr Ser 
545 550 555 560 

Arg Leu Ala Val Gin Leu Ala Ala Arg Thr Phe Ser Val Phe Leu Glu 

565 570 575 

Glu Ser Val lie Glu Phe Val Val Ala Ala Ser Leu Arg Leu Ala lie 

580 585 590 

Gin Ala Phe Ala Asp Leu Ala Thr Leu Ala Ala Ser Ala Leu Thr Val 

595 600 605 

lie Gly lie Val lie Phe Val lie Gin Val Leu Gly Leu lie Leu Asp 

610 615 620 

Leu Ala Leu Gly Leu Gly Trp Tyr Asp His lie Phe Ser Pro Glu Asp 
625 630 635 640 

Leu Lys Lys Gin Val Leu Val Phe Arg Arg Glu Phe Ala Lys Ala Gly 

645 650 655 

Asn Val Asp Val Gly Val Ala Gin Pro Val Thr Pro Glu Glu lie Val 

660 665 670 

Ala lie Asn Val Phe Leu Gin Thr Glu Glu Asn Gly Glu Glu Lys Lys 

675 680 685 

Glu Glu Gly Ala Arg Lys Ser Lys lie Asp Phe Leu Gin Lys Tyr Phe 

690 695 700 

His Ser Thr Pro Leu Met Gly Lys Lys Ser Lys Phe Val Tyr lie Gin 
705 710 715 720 

Glu Ala Ala Gin Glu Tyr Leu Gly Gly Arg Thr Met Asn Ala Phe Gly 

725 730 735 

Gin Arg lie lie Thr Ala Ala Asp Asp Ser Asp Thr Thr Thr Thr Thr 

740 • 745 750 

Gin Glu Gly Arg Arg Asp Asp Glu Thr Val Thr Lys Lys Met Arg Ser 

755 760 765 

lie lie Thr Gly Gin Thr Leu Lys Asp Tyr Ser Ser Ala Val Asn Tyr 

770 775 780 

Asn Ala Ser Arg Leu Asp Tyr Val Gly Glu Glu Trp Val Arg Asn Thr 
785 790 795 800 

Ala Leu Lys Glu Glu Thr Arg Ser Asn Thr Thr Ser Asp Asn Leu Phe 

805 810 815 

Lys Lys Thr Val Ser Leu Ala Ser Met Ala Gly Ala Phe Leu Val Leu 

820 825 830 

Gly lie Gly Val Leu Val Ala Ser His lie Thr Leu Leu Arg Phe Thr 
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835 840 



Asn 


lie 
850 


Gly 


Leu 


Ala 


Phe 


Ala 
855 


Phe 


Leu 


Met 


Ser 


lie 


Ser 


Tyr 


He 


Asn 


865 










870 






Ser 


Asp 


Ala 


He 


Tyr 
885 


Arg 


Ser 


Thr 


Asp 


Pro 


Arg 


Arg 
900 


Val 


Gly 


Met 


Val 


Lys 


Tyr 


Asn 
915 


Met 


He 


Thr 


Asp 


Phe 
920 


Glu 


Ser 
930 


Asp 













845 



Ala 


Gly 


Leu 


Leu 
860 


Ala 


Phe 


He 


Ala 


Met 


Asn 


Ala 

875 


Met 


Gly 


Val 


Val 


Asn 
880 


Ala 


Leu 


Val 


Gly Asp 


He 


Lys 


Thr 




890 










895 




Gin 


Arg 


His 


Val 


Gly 


Val 


Gly 


Ala 


905 










910 






Val 


Ser 


Pro 


Met 


Leu 

925 


Asp 


Glu 


He 



<210> 204 

<211> 3135 

<212> DNA 

<213> SHRIMP 



<400> 204 

atggattcgt 

acaaacaaac 

agggcaagtc 

tgtataccca 

ttcgaatgga 

ctttctacaa 

aaagaagaag 

aatgtggaag 

ataattgttg 

ttctctaatc 

gctgctgaca 

tccgccctcc 

cagcgctcct 

ccgagtaatg 

gcgcaatata 

aacggccttg 

gaagtgaaag 

aaacatctcg 

aaaaaaatca 

tcgtgtccgc 

aatgaattct 

tctcaaaaaa 

atggagggaa 

tcattagcta 

ttttattttt 

tcagagacgg 

cctaatataa 

atcgaggaca 

caatgtatgg 

cgtggtaacc 

aatagtaaag 

ggaaattata 

gggatcgaac 

ttgtatggta 

aatagaatca 

gccgaaagtg 

gctgtttcat 

tggtggatcg 

aagaaatgtt 

aaaacttgtg 



tgactaatac 
caatcaccga 
ttttgaagtt 
agaacaagta 
gagatggaac 
aaaaatcgcc 
aatttgaaga 
atttcataaa 
atgacgaaga 
atctgaggcg 
ttgagaagaa 
agcaacagca 
catctgagaa 
gcttctcaga 
atggactcgt 
gtctgaaatg 
atgtgtggtg 
cattcgcaga 
ccgcctattt 
tt ttgaaaag 
caggagttaa 
accccaacat 
ttataaccca 
cggctatggc 
ctgtatattt 
ataaggtaca 
gtgctcacca 
gacctatccg 
catgcaagga 
agcacttgtc 
tagattcatt 
ca tctaagtt 
cagtgtctac 
aagtcgt tga 
gggtcttttt 
caaaaaatgc 
attttgagca 
aacttaa tga 
ctaaagtaaa 
ataaatacaa 



agtaacactc 
acaagatgtg 
atattctgtt 
cacgaatgtg 
aaaagtacac 
tcggtcctca 
agaatttgaa 
tggtgatgga 
agaagagaat 
ccagactgcc 
ggacaaaaac 
acaaaaactg 
ggtcacctcc 
acagaccgag 
ggagctagac 
cagtatggaa 
tcagcctaaa 
atcgcctata 
ttgtttagac 
tggatcaaac 
aaatgacgat 
tgaaattgta 
tagaaaagat 
attttgccac 
accattctct 
tattggttca 
gaacaataac 
aaataataat 
aagatgcaca 
acatagtgta 
gaaaaaattg 
ttgtgataga 
ttctttctca 
cgttatcaat 
taactctgag 
attcaaggat 
aaacaaatta 
cttgaacatt 
tgatgcagaa 
aactgccaag 



ctcgtgaatg 
gaaaatactc 
ttcatcaaag 
caagaaatat 
agatcagttt 
ccttcccctc 
gatgatgaag 
gaagattcag 
gaagagggag 
gccgctgccg 
cacgcagtta 
ctccagcaac 
acacccaaca 
ctctttgttt 
atcttaccca 
actcccccag 
actagctttg 
cttcaaaggc 
gattctgtag 
tttcgagtgt 
gacacgtctt 
tcaaaattaa 
ttgttcgaga 
ccaaaagcta 
aaaataactc 
gacgcgatct 
aacaataata 
ataagcagaa 
aacaattgca 
aaaggggaag 
agcagagtac 
agctctatgt 
tcggacagtt 
agttattctg 
gaaaaagata 
atacttgttc 
tcctctaaag 
atgtttgaga 
tctttaaagg 
agggcaatta 



accgtcttgg 
ttaaccttaa 
aaatgcagtc 
tcgaagatgg 
caccaagttc 
ctccatcgat 
aaatatacga 
aagaggaaga 
aaaacaagta 
ccgccgccgc 
gtgcgcatga 
agcaacagca 
aattcaacaa 
gtttcgatgt 
ttgttgctga 
tgaaaccgtg 
aaaatgatgc 
ctagagattt 
acattaaaaa 
ccgaatattc 
caaacacttg 
atattgaatt 
cgggcatttt 
gagttcgaaa 
gcaaagaaac 
tttctccccc 
acaataatac 
aaatgaccat 
ctaacggtaa 
atttctttaa 
tgattcccgc 
gccatagctt 
ttgaaaagac 

ataagactat 
acgaatgtaa 
atgggcatct 
aacacgtgga 
atatttttaa 
ttggagcaca 



aaatcatcga 
cagtctagaa 
ctattctgga 
actaattact 
ccctatacct 
gccttctatt 
aacagatgaa 
agaagaagat 
cgttctcgca 
agctgctgct 
ctacacactg 
acaacaccag 
gtttttactg 
ggataaaatt 
atacattatc 
caggagaaag 
tgtggaaga t 
ccctatccct 
cccctggggt 
gcgtcat ttt 
ctttatatac 
tgaggtaatg 
gagcgat tct 
tgttgcattg 
tataaaatgt 
aagtgataat 
cagtgtgaat 
cacaaactac 
ctatcccgat 
gattttaaat 
tcctccctct 
ct tttgtaga 
taaacttgtt 
ttcccataat 
cccctctaga 
taaagaacga 
atctaacaag 
agatttttac 
tgattt tgaa 
agaccct tct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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acttctactc 
tttcattcaa 
tcgaggacaa 
tttgaaaata 
atgggaggtg 
agaggtgctg 
acatggaagg 
tacaattcaa 
tcagatttta 
gatgttaaca 
ccaacgagca 
ccaacagtga 
actactgctg 



cctctaaaaa 
aagatgaagc 
taatgtctgg 
agaacactaa 
ctgcagtgac 
tctctgtcac 
atgaacaaac 
aaaggaataa 
attggagaac 
agaagaggta 
gcagcagcag 
agaatagtaa 
tttga 



ggagaa tggt 
tacagtaagt 
tgttagatgt 
taataattgg 
aaagatttcc 
tacagcacct 
actaaaaaca 
tagggataat 
acccaatatc 
cgcagaagtc 
caacagcaac 
taataaacca 



atcactagga 
gcccttctcg 
gttatacgta 
gaacttgaga 
gatgaagatt 
aatgataagc 
aacactaaac 
aataaaataa 
tctattcaag 
gtggcgtcag 
agcagcagcc 
ttgtatattc 



ttattagtac 
acaaaacaat 
acaatagtgt 
ttagacacta 
tggaacaatt 
tacctgtagg 
gtaatagtct 
aaaatcgttc 
aatttaatgc 
ctgctccaaa 
ctcctctttc 
ctccccataa 



attatccgaa 
gctcttggga 
gttttcgggc 
tgtcatctct 
cacgcctgta 
ggcacatcag 
atatgactct 
attaaaacta 
aaataaagat 
gtcaccttca 
accgctctca 
aagaatgacc 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3135 



<210> 205 
<211> 1036 
<212> PRT 
<213> SHRIMP 



<400> 205 



Met 


Asp 


Ser 


Leu 


Thr 


Asn 


Thr 


Val 


Thr 


Leu 


Leu 


Val 


Asn 


Asp 


Arg 


Leu 


1 








5 










10 










15 




Gly 


Asn 


His 


Arg 


Thr 


Asn 


Lys 


Pro 


He 


Thr 


Glu 


Gin 


Asp 


Val 


Glu 


Asn 








20 










25 










30 






Thr 


Leu 


Asn 


Leu 


Asn 


Ser 


Leu 


Glu 


Arg 


Ala 


Ser 


Leu 


Leu 


Lys 


Lys 


Val 






35 










40 










45 








Phe 


He 


Lys 


Glu 


Met 


Gin 


Ser 


Tyr 


Ser 


Gly 


Cys 


He 


Pro 


Lys 


Asn 


Lys 




50 










55 










60 










Tyr 


Thr 


Asn 


Val 


Gin 


Glu 


He 


Phe 


Glu 


Asp 


Gly 


Leu 


He 


Thr 


Phe 


Glu 


65 










70 










75 










80 


Trp 


Arg 


Asp 


Gly 


Thr 


Lys 


Val 


His 


Arg 


Ser 


Val 


Ser 


Pro 


Ser 


Ser 


Pro 










85 










90 










95 




He 


Pro 


Leu 


Ser 


Thr 


Lys 


Lys 


Ser 


Pro 


Arg 


Ser 


Ser 


Pro 


Ser 


Pro 


Pro 








100 










105 










110 






Pro 


Ser 


Met 


Pro 


Ser 


He 


Lys 


Glu 


Glu 


Glu 


Phe 


Glu 


Glu 


Glu 


Phe 


Glu 






115 










120 










125 








Asp 


Asp 


Glu 


Glu 


He 


Tyr 


Glu 


Thr 


Asp 


Glu 


Asn 


Val 


Glu 


Asp 


Phe 


He 




130 










135 










140 










Asn 


Gly 


Asp 


Gly 


Glu 


Asp 


Ser 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


He 


He 


145 










150 










155 










160 


Val 


Asp 


Asp 


Glu 


Glu 


Glu 


Glu 


Asn 


Glu 


Glu 


Gly 


Glu 


Asn 


Lys 


Tyr 


Val 










165 










170 










175 




Leu 


Ala 


Phe 


Ser 


Asn 


His 


Leu 


Arg 


Arg 


Gin 


Thr 


Ala 


Ala 


Ala 


Ala 


Ala 








180 










185 










190 






Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Asp 


He 


Glu 


Lys 


Lys 


Asp 


Lys 


Asn 






195 










200 










205 








His 


Ala 


Val 


Ser 


Ala 


His 


Asp 


Tyr 


Thr 


Leu 


Ser 


Ala 


Leu 


Gin 


Gin 


Gin 




210 










215 










220 










Gin 


Gin 


Lys 


Leu 


Leu 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


His 


Gin 


Gin 


Arg 


225 










230 










235 










240 


Ser 


Ser 


Ser 


Glu 


Lys 


Val 


Thr 


Ser 


Thr 


Pro 


Asn 


Lys 


Phe 


Asn 


Lys 


Phe 










245 










250 










255 




Leu 


Leu 


Pro 


Ser 


Asn 


Gly 


Phe 


Ser 


Glu 


Gin 


Thr 


Glu 


Leu 


Phe 


Val 


Cys 








260 










265 










270 






Phe 


Asp 


Val 


Asp 


Lys 


He 


Ala 


Gin 


Tyr 


Asn 


Gly 


Leu 


Val 


Glu 


Leu 


Asp 






275 










280 










285 








He 


Leu 


Pro 


He 


Val 


Ala 


Glu 


Tyr 


He 


He 


Asn 


Gly 


Leu 


Gly 


Leu 


Lys 




290 










295 










300 










Cys 


Ser 


Met 


Pro 


Pro 


Val 


Lys 


Pro 


Cys 


Arg 


Arg 


Lys 


Glu 


Val 


Lys 


Asp 


305 










310 










315 










320 


Val 


Trp 


Cys 


Gin 


Pro 


Lys 


Thr 


Ser 


Phe 


Glu 


Asn 


Asp 


Ala 


Val 


Glu 


Asp 
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325 



Lys 


His 


Leu 


Ala 


Phe 


Ala 


Glu 


Ser 








340 










Phe 


Pro 


He 


Pro 


Lys 


Lys 


He 


Thr 






355 










360 


Val 


Asp 


He 


Lys 


Asn 


Pro 


Trp 


Gly 




370 










375 




Ser 


Asn 


Phe 


Arg 


Val 


Ser 


Glu 


Tyr 


385 










390 






Gly 


Val 


Lys 


Asn 


Asp 


Asp 


Asp 


Thr 










405 








Ser 


Gin 


Lys 


Asn 


Pro 


Asn 


He 


Glu 








420 










Phe 


Glu 


Val 


Met 


Met 


Glu 


Gly 


He 






435 










440 


Gly 


He 


Leu 


Ser 


Asp 


Ser 


Ser 


Leu 




450 










455 




Pro 


Lys 


Ala 


Arg 


Val 


Arg 


Asn 


Val 


465 










470 






Phe 


Ser 


Lys 


He 


Thr 


Arg 


Lys 


Glu 










485 








Lys 


Val 


His 


He 


Gly 


Ser 


Asp 


Ala 








500 










Pro 


Asn 


He 


Ser 


Ala 


His 


Gin 


Asn 






515 










520 


Thr 


Ser 


Val 


Asn 


He 


Glu 


Asp 


Arg 




530 










535 




Arg 


Lys 


Met 


Thr 


He 


Thr 


Asn 


Tyr 


545 










550 






Cys 


Thr 


Asn 


Asn 


Cys 


Thr 


Asn 


Gly 










565 








His 


Leu 


Ser 


His 


Ser 


Val 


Lys 


Gly 








580 










Asn 


Ser 


Lys 


Val 


Asp 


Ser 


Leu 


Lys 






595 










600 


Ala 


Pro 


Pro 


Ser 


Gly 


Asn 


Tyr 


Thr 




610 










615 




Met 


Cys 


His 


Ser 


Phe 


Phe 


Cys 


Arg 


625 










630 






Phe 


Ser 


Ser 


Asp 


Ser 


Phe 


Glu 


Lys 










645 








Val 


Val 


Asp 


Val 


He 


Asn 


Ser 


Tyr 








660 










Asn 


Arg 


He 


Arg 


Val 


Phe 


Phe 


Asn 






675 










680 


lie 


Pro 


Ser 


Arg 


Ala 


Glu 


Ser 


Ala 




690 










695 




Val 


His 


Glu 


Cys 


Asn 


Lys 


Glu 


Arg 


705 










710 






Lys 


Leu 


Ser 


Ser 


Lys 


Asp 


Gly 


His 










725 








Leu 


Asn 


Asp 


Leu 


Asn 


He 


Met 


Phe 








740 










Lys 


Lys 


Cys 


Ser 


Lys 


Val 


Asn 


Asp 






755 










760 


Asn 


Asp 


Phe 


Glu 


Lys 


Thr 


Cys 


Asp 




770 










775 




He 


He 


Gly 


Ala 


Gin 


Asp 


Pro 


Ser 


785 










790 






Asn 


Gly 


He 


Thr 


Arg 


He 


He 


Ser 



805 





330 










335 




Pro 


He 


Leu 


Gin 


Arg 


Pro 


Arg 


Asp 


345 










350 






Ala 


Tyr 


Phe 


Cys 


Leu 


Asp 


Asp 


Ser 










365 








Ser 


Cys 


Pro 


Leu 


Leu 


Lys 


Ser 


Gly 








380 










Ser 


Arg 


His 


Phe 


Asn 


Glu 


Phe 


Ser 






395 










400 


Ser 


Ser 


Asn 


Thr 


Cys 


Phe 


He 


Tyr 




410 










415 




He 


Val 


Ser 


Lys 


Leu 


Asn 


He 


Glu 


425 










430 






He 


Thr 


Lys 


Asp 


Leu 


Phe 


Glu 


Thr 










445 








Ala 


Thr 


Ala 


Met 


Ala 


Phe 


Cys 


His 








4 60 










Phe 


Tyr 


Phe 


Ser 


Val 


Tyr 


Leu 


Pro 






475 










480 


Thr 


He 


Lys 


Cys 


Ser 


Glu 


Thr 


Asp 




4 90 










495 




He 


Phe 


Ser 


Pro 


Pro 


Ser 


Asp 


Asn 


505 










510 






Asn 


Asn 


Asn 


Asn 


Asn 


Asn 


Asn 


Asn 










525 








Pro 


He 


Arg 


Asn 


Asn 


Asn 


He 


Ser 








540 










Gin 


Cys 


Met 


Ala 


Cys 


Lys 


Glu 


Arg 






555 










560 


Asn 


Tyr 


Pro 


Asp 


Arg 


Gly 


Asn 


Gin 




570 










575 




Glu 


Asp 


Phe 


Phe 


Lys 


He 


Leu 


Asn 


585 










590 






Lys 


Leu 


Ser 


Arg 


Val 


Leu 


He 


Pro 










605 








Ser 


Lys 


Phe 


Cys 


Asp 


Arg 


Ser 


Ser 








620 










Gly 


He 


Glu 


Pro 


Val 


Ser 


Thr 


Ser 






635 










640 


Thr 


Lys 


Leu 


Val 


Leu 


Tyr 


Gly 


Lys 




650 










655 




Ser 


Ala 


He 


Lys 


Thr 


Ser 


His 


Asn 


665 










670 






Ser 


Glu 


Glu 


Lys 


Asp 


Asn 


Lys 


Thr 










685 








Lys 


Asn 


Ala 


Phe 


Lys 


Asp 


He 


Leu 








700 










Ala 


Val 


Ser 


Tyr 


Phe 


Glu 


Gin 


Asn 






715 










720 


Leu 


Ser 


Asn 


Lys 


Trp 


Trp 


He 


Glu 




730 










735 




Glu 


Lys 


His 


Val 


Glu 


Asp 


Phe 


Tyr 


745 










750 






Ala 


Glu 


Ser 


Leu 


Lys 


Asp 


He 


Phe 










765 








Lys 


Tyr 


Lys 


Thr 


Ala 


Lys 


Arg 


Ala 








780 










Thr 


Ser 


Thr 


Pro 


Ser 


Lys 


Lys 


Glu 






795 










800 


Thr 


Leu 


Ser 


Glu 


Phe 


His 


Ser 


Lys 




810 










815 
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Asp 


Glu 


Ala 


Thr 


Val 


Ser 


Ala 


Leu 


Leu 


Asp 


Lys 


Thr 


Met 


Leu 


Leu 


Gly 








820 










825 










830 






Ser 


Arg 


Thr 


He 


Met 


Ser 


Gly 


Val 


Arg 


Cys 


Val 


He 


Arg 


Asn 


Asn 


Ser 






835 










840 










845 








Val 


Phe 


Ser 


Gly 


Phe 


Glu 


Asn 


Lys 


Asn 


Thr 


Asn 


Asn 


Asn 


Trp 


Glu 


Leu 




850 










855 










860 










Glu 


He 


Arg 


His 


Tyr 


Val 


He 


Ser 


Met 


Gly 


Gly 


Ala 


Ala 


Val 


Thr 


Lys 


865 










870 










875 










880 


He 


Ser 


Asp 


Glu 


Asp 


Leu 


Glu 


Gin 


Phe 


Thr 


Pro 


Val 


Arg 


Gly 


Ala 


Val 










885 










890 










895 




Ser 


Val 


Thr 


Thr 


Ala 


Pro 


Asn 


Asp 


Lys 


Leu 


Pro 


Val 


Gly 


Ala 


His 


Gin 








900 










905 










910 






Thr 


Trp 


Lys 


Asp 


Glu 


Gin 


Thr 


Leu 


Lys 


Thr 


Asn 


Thr 


Lys 


Arg 


Asn 


Ser 






915 










920 










925 








Leu 


Tyr 


Asp 


Ser 


Tyr 


Asn 


Ser 


Lys 


Arg 


Asn 


Asn 


Arg 


Asp 


Asn 


Asn 


Lys 




930 










935 










940 










He 


Lys 


Asn 


Arg 


Ser 


Leu 


Lys 


Leu 


Ser 


Asp 


Phe 


Asn 


Trp 


Arg 


Thr 


Pro 


945 










950 










955 










960 


Asn 


He 


Ser 


He 


Gin 


Glu 


Phe 


Asn 


Ala 


Asn 


Lys 


Asp 


Asp 


Val 


Asn 


Lys 










965 










970 










975 




Lys 


Arg 


Tyr 


Ala 


Glu 


Val 


Val 


Ala 


Ser 


Ala 


Ala 


Pro 


Lys 


Ser 


Pro 


Ser 








980 










985 










990 






Pro 


Thr 


Ser 


Ser 


Ser 


Ser 


Ser 


Asn 


Ser 


Asn 


Ser 


Ser 


Ser 


Pro 


Pro 


Leu 






995 










1000 








1005 






Ser 


Pro 


Leu 


Ser 


Pro 


Thr 


Val 


Lys 


Asn 


Ser 


Asn 


Asn 


Lys 


Pro 


Leu 


Tyr 




1010 








1015 








1020 








He 


Pro 


Pro 


His 


Lys 


Arg 


Met 


Thr 


Thr 


Thr 


Ala 


Val 











1025 1030 1035 



<210> 206 
<211> 1082 
<212> DNA 
<213> SHRIMP 



<400> 206 

atgaagatat 

taccacgtga 

ctcctccact 

aacttgtgtc 

cacttggacg 

ggcgcgaagc 

ccccacttaa 

cgcttggacg 

aagaacgtga 

gtgaagcagc 

cacttaaagg 

tgtctgttga 

gacgtaagag 

aagcagctcc 

aggaagaact 

aaggtgctct 

gtaagaggcg 

tttacgcacc 

aa 



gccagatttc 
agcagctcct 
taaaggtgcg 
tgctgaagaa 
tgaaaagcgc 
agctccccca 
aggtgcactt 
taagaggcgc 
aaagcgcgaa 
tcctccactt 
tgcacttgga 
agaagaacgt 
gcgcgaagca 
cccacttaaa 
tgtgtctgct 
tggacgtaag 
cgaagcagca 
tgttgctttg 



ttcccctacc 
ccacttaaag 
cttggacgta 
gaacgtgaaa 
gaagcagctc 
cttaaaggtg 
ggacgtaaga 
gaagcagaat 
gcagctcccc 
aaaggtgcgc 
cgtaagaggc 
gaaaagcgcg 
gctcccccac 
ggtgctcttg 
gaagaagaac 
aggcgcgaag 
gcagcagcta 
cctctacatg 



ctcaccctga 
gtgcacttgg 
agaggcgcga 
agcgcgaagc 
ccccacttaa 
cgcttggacg 
ggcgcgaagc 
ccttggagga 
cacttaaagg 
ttggacgtaa 
gcgaagcaga 
aagcagctcc 
ttaaaggtgc 
gacgtaagag 
gtgaaaagcg 
cagctccccc 
tgcctccccc 
gagtatggaa 



gcatcccatt 
acgtaaaagg 
agcagaatcc 
agctccccca 
aggtgcactt 
tgaaaagcgc 
agctccccca 
agaacttgtg 
tgcacttgga 
gaggcgcgaa 
a tccttggag 
cccacttaaa 
tcttggacgt 
gcgcgaagca 
cgaagcagct 
acttaaaggt 
tgaagacgat 
agcaccagaa 



ggagggcgtc 
cgtgaagcag 
ttggaggaag 
cttaaaggtg 
ggacgtaaga 
gaagcagctc 
cttaaaggtg 
tctgctgaag 
cgtaaaaggc 
gcagctcccc 
gaagaacttg 
ggtgctcttg 
aagaggcgcg 
gaatccttgg 
cccccactta 
gcacttggac 
ctcgacttct 
cctacaggtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1082 



<210> 207 
<2H> 361 
<212> PRT 
<213> SHRIMP 
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<220> 

<221> VARIANT 

<222> (1) . . . (361) 

<223> Xaa = Any Amino Acid 



<400> 207 



Met 


Lys 


lie 


Cys 


Gin 


He 


Ser 


Ser 


1 








5 








Leu 


Glu 


Gly 


Val 


Tyr 


His 


Val 


Lys 








20 










Leu 


Asp 


Val 


Lys 


Gly 


Val 


Lys 


Gin 






35 










40 


Asp 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Asn 




50 










55 




Leu 


Lys 


Lys 


Asn 


Val 


Lys 


Ser 


Ala 


65 










70 






His 


Leu 


Asp 


Val 


Lys 


Ser 


Ala 


Lys 










85 








Leu 


Asp 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 








100 










Asp 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 


Leu 






115 










120 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


Pro 




130 










135 




Arg 


Gly 


Ala 


Lys 


Gin 


Asn 


Pro 


Trp 


146 










150 






Lys 


Asn 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 










165 








Asp 


Val 


Lys 


Gly 


Val 


Lys 


Gin 


Leu 








180 










Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


Pro 






195 










200 


Arg 


Gly 


Ala 


Lys 


Gin 


Asn 


Pro 


Trp 




210 










215 




Lys 


Asn 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 


225 










230 






Asp 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 










245 








Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


Pro 








260 










Arg 


Gly 


Ala 


Lys 


Gin 


Asn 


Pro 


Trp 






275 










280 


Lys 


Asn 


Val 


Lys 


Ser 


Ala 


Lys 


Gin 




290 










295 




Asp 


Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Leu 


305 










310 






Val 


Arg 


Gly 


Ala 


Lys 


Gin 


Gin 


Gin 










325 








He 


Ser 


Thr 


Ser 


Phe 


Thr 


His 


Leu 








340 










Gly 


Lys 


His 


Gin 


Asn 


Leu 


Gin 


Val 



355 360 



Pro 


Thr 


Leu 


Thr 


Leu 


Ser 


He 


Pro 




10 










15 




Gin 


Leu 


Leu 


His 


Leu 


Lys 


Val 


His 


25 










30 






Leu 


Leu 


His 


Leu 


Lys 


Val 


Arg 


Leu 










45 








Pro 


Trp 


Arg 


Lys 


Asn 


Leu 


Cys 


Leu 








60 










Lys 


Gin 


Leu 


Pro 


His 


Leu 


Lys 


Val 






75 










80 


Gin 


Leu 


Pro 


His 


Leu 


Lys 


Val 


His 




90 










95 




Leu 


Pro 


His 


Leu 


Lys 


Val 


Arg 


Leu 


105 










110 






Pro 


His 


Leu 


Lys 


Val 


His 


Leu 


Asp 










125 








His 


Leu 


Lys 


Val 


Arg 


Leu 


Asp 


Val 








140 










Arg 


Lys 


Asn 


Leu 


Cys 


Leu 


Leu 


Lys 






155 










160 


Leu 


Pro 


His 


Leu 


Lys 


Val 


His 


Leu 




170 










175 




Leu 


His 


Leu 


Lys 


Val 


Arg 


Leu 


Asp 


185 










190 






His 


Leu 


Lys 


Val 


His 


Leu 


Asp 


Val 










205 








Arg 


Lys 


Asn 


Leu 


Cys 


Leu 


Leu 


Lys 








220 










Leu 


Pro 


His 


Leu 


Lys 


Val 


Leu 


Leu 






235 










240 


Pro 


His 


Leu 


Lys 


Val 


Leu 


Leu 


Asp 




250 










255 




His 


Leu 


Lys 


Val 


Leu 


Leu 


Asp 


Val 


265 










270 






Arg 


Lys 


Asn 


Leu 


Cys 


Leu 


Leu 


Lys 










285 








Leu 


Pro 


His 


Leu 


Lys 


Val 


Leu 


Leu 








300 










Pro 


His 


Leu 


Lys 


Val 


His 


Leu 


Asp 






315 










320 


Gin 


Leu 


Cys 


Leu 


Pro 


Leu 


Lys 


Thr 




330 










335 




Leu 


Leu 


Cys 


Leu 


Tyr 


Met 


Glu 


Tyr 


345 










350 







Xaa 



<210> 208 
<211> 816 
<212> DNA 
<213> SHRIMP 



<400> 208 

atggacgtcg agttcggttt cttccacggt ctgctctcca aggcccttct cccagatgaa 60 
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aaacatcaac 
ggctgctgct 
caactaatag 
tctaatcttt 
tcccattcta 
ttgcctcatt 
cttcaaccca 
gacgacaatg 
aaaactattg 
ggtaatgaat 
tacgacactg 
agcacgtgca 
aaacaagatg 



ccgttataag 
cgttctgtgg 
acgtttg ttc 
caaccagttg 
gttttctgga 
ctttacgtgc 
tagaagaaga 
ttgatgatgg 
gagaaaaatc 
ttataaggat 
aagctaaatt 
catgtaagga 
gaatggataa 



gcgtctttgt 
aagaagagga 
ctttatagga 
ttctagacta 
tgtggtatat 
catttggaat 
ggatattggt 
tgaccaacaa 
aaccattgaa 
tcct ttaaga 
tgttaaccac 
acgtcttatt 
accaacggac 



gcggatgatt 
acaggagaga 
actgtctcat 
caaaaaacgt 
ccaagtttga 
aaacaacttc 
tataaggatt 
gaacaaatga 
ctgataggca 
gaaactgcgt 
aaggactcta 
gatttttctg 
aagtag 



ctagaaataa 
gcaatactgc 
ctattggtac 
cagacagtta 
agaaaacaac 
caaagttgta 
atgttgtttc 
ttattgatga 
tgtataacaa 
tgcacgcaca 
tacctctatt 
agagacaact 



gggagaggat 
gtgccttgaa 
aa ttatcaat 
tgcggcatta 
tgaagacgta 
tgaaaaaact 
aattgaagac 
agaatcttat 
taacaagttt 
atctctgagg 
ttatgaaaac 
acaacaacta 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
816 



<210> 209 

<211> 269 

<212> PRT 

<213> SHRIMP 



<400> 209 
Met Asp Val Glu 
1 

Leu Pro Asp Glu 
20 

Asp Ser Arg Asn 
35 

Arg Gly Thr Gly 
50 

Val Cys Ser Phe 
65 

Ser Asn Leu Ser 

Tyr Ala Ala Leu 
100 

Leu Lys Lys Thr 
115 

Trp Asn Lys Gin 
130 

Glu Glu Glu Asp 
145 

Asp Asp Asn Val 

Glu Glu Ser Tyr 
180 

Gly Met Tyr Asn 
195 

Leu Arg Glu Thr 
210 

Ala Lys Phe Val 
225 

Ser Thr Cys Thr 

Gin Leu Lys Gin 

260 



Phe Gly Phe Phe 
5 

Lys His Gin Pro 

Lys Gly Glu Asp 
40 

Glu Ser Asn Thr 
55 

lie Gly Thr Val 
70 

Thr Ser Cys Ser 
85 

Ser His Ser Ser 

Thr Glu Asp Val 
120 

Leu Pro Lys Leu 
135 

lie Gly Tyr Lys 
150 

Asp Asp Gly Asp 
165 

Lys Thr lie Gly 

Asn Asn Lys Phe 
200 

Ala Leu His Ala 
215 

Asn His Lys Asp 
230 

Cys Lys Glu Arg 
245 

Asp Gly Met Asp 



His Gly Leu Leu 
10 

Val lie Arg Arg 
25 

Gly Cys Cys Ser 

Ala Cys Leu Glu 
60 

Ser Ser lie Gly 
75 

Arg Leu Gin Lys 
90 

Phe Leu Asp Val 
105 

Leu Pro His Ser 

Tyr Glu Lys Thr 
140 

Asp Tyr Val Val 
155 

Gin Gin Glu Gin 
170 

Glu Lys Ser Thr 
185 

Gly Asn Glu Phe 

Gin Ser Leu Arg 
220 

Ser lie Pro Leu 
235 

Leu lie Asp Phe 
250 

Lys Pro Thr Asp 
265 



Ser Lys Ala Leu 
15 

Leu Cys Ala Asp 
30 

Phe Cys Gly Arg 
45 

Gin Leu lie Asp 

Thr lie lie Asn 
80 

Thr Ser Asp Ser 
95 

Val Tyr Pro Ser 
110 

Leu Arg Ala He 
125 

Leu Gin Pro He 

Ser He Glu Asp 
160 

Met lie He Asp 
175 

He Glu Leu He 
190 

He Arg He Pro 
205 

Tyr Asp Thr Glu 

Phe Tyr Glu Asn 
240 

Ser Gin Leu Gin 
255 

Lys 



<210> 210 
<211> 3813 
<212> DNA 
<213> SHRIMP 
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<400> 210 

atggtgctaa 

aaggaagatg 

attgcaagac 

tggaacgaaa 

gatatcatat 

gataaattgg 

gtggatgaaa 

aaggttatga 

acatctattt 

ctgaaaagcg 

cttccccacc 

gctggaacag 

tcctttagaa 

gaagaagaag 

ggacacaacg 

tatgactatt 

ggcacattca 

acagaattag 

tatggggagt 

gtggtagatt 

gtgtataaga 

ttggccgtta 

gaagagatgg 

cctgcagaag 

tctgtgttaa 

gtgttaaccg 

tttgaagcat 

aggaatactg 

ccccctagcc 

tccctgcaaa 

ggaggatttg 

tgtgatggag 

catgaaaaaa 

gtgcttcaga 

atctacgctg 

gggattgctc 

acggttgaag 

ctcaccacat 

gaaggactga 

ttaaggtctg 

aaaaatgaca 

tctttttcgt 

tttgctgcag 

ctcgacaaat 

agttatttga 

aaggctacag 

aactttggaa 

gccgctgtcg 

agccctaaag 

ggatgggcta 

gatgcatcat 

tcgaaatata 

ggatcttggg 

gatttgccta 

tcttccgttc 

gacgccgtgg 

atatttggag 

aaaactggcc 

gcagggtcgg 

cagactgaac 



ctctttcctg 
cggtgttggg 
ctccctcaat 
gtgggggaga 
ccttagtgga 
cagataaaac 
aattgacaat 
aaaaggaccc 
tattggacag 
ccgaaagagt 
atccgagcct 
tcatcgcatc 
aatggggtcc 
aagaagaaaa 
ggcaacatta 
tggcctacag 
atatggagga 
agtacaaaag 
ttttatgtta 
ctctagaaaa 
acactgctaa 
ggagtatggc 
aagaagaaga 
tagacttcct 
ataactttgt 
atgatactgt 
tatcttccca 
atatacaaag 
gtgtccgtgg 
cgtatggcgg 
aagataccga 
gaaaagtgct 
cagcagaaac 
ctgcaagtaa 
ttttgcgtga 
tcaaggataa 
gagtcaaggg 
atagtgcaga 
cagctgcaac 
atgcgaatct 
cctctcgttc 
cgtttgcaag 
agatgagaga 
tcgaagagac 
cagagaatcc 
attcagtatt 
aggttacaat 
ccctggtgca 
gtgtcatcag 
agccaaccaa 
tcttaactga 
aggcctacgc 
caacattcga 
aaggacaatc 
ttttgtccat 
tggggttggc 
tgcctcttgg 
tcattgtatt 
gcccgctaac 
aattcgaaga 



tacgactagg 
gaaccagttc 
agaatctttt 
aaaaattttc 
aagtgtatat 
atgtctaacc 
cttatcaggt 
caaaaaaata 
gttaaaaaca 
agaagtactt 
atttgataat 
agatatgcat 
ctgtggagat 
gaaacattcc 
tcataggccc 
gaatctccct 
tttacctttc 
gtttgtgcaa 
ctgtatcttc 
tagttccatg 
ttacaaaagg 
agaatttatc 
agaggaagaa 
ttcagtgcct 
ggagaacgat 
atcaggaaca 
tttattcctt 
aaccctattt 
aagcaatgtc 
tatagaagag 
catgtattcc 
aagtgtcacc 
ttttaaggat 
gaccggaaac 
aaatggcatt 
aacaaatcaa 
atttttcagc 
aaccttcaga 
agtggcagaa 
aggtctagaa 
tttattggaa 
ggacatgggc 
aacatacccc 
tataccagaa 
agaaagggct 
ggggaagaaa 
tgtagtaggg 
tgcgtccaga 
ttataaaatt 
gcacccgttc 
aaatggagca 
accaatctgt 
cgacccgcat 
cctctcctgc 
aggaaaggat 
gagcaaggca 
atttggtata 
ttcaatacta 
cctgaattgg 
cgggggagga 



cgcgtagcgt 
cccattttaa 
tctgcatcag 
gacatatctc 
gaacctgtat 
gcggctgcct 
agtgatggga 
gcagaatctc 
gccaagaaac 
catcggttga 
tttagtggag 
ttcaaattgg 
aaaactgaaa 
atatcaagat 
gaaagtgctt 
aatgagtaca 
cgccctttcg 
tcaacaaatc 
ggagcagatt 
atatcatttg 
ttggggaaga 
cgcactgaag 
gaagcggaag 
catttacgcc 
ctttctatat 
gatacggaca 
tcaagaatat 
tcaacgcacg 
aactttaata 
t tgcctgaaa 
ggagaggatg 
t tcaattgtg 
gatacggatt 
cttaacaaaa 
gagcgccccg 
ccccctcccc 
ggttttcgtg 
gatttaggcc 
acatccttct 
t tttcagagg 
gaaactaggg 
gtccaagtta 
gatgcagccc 
agtcaagtga 
ggcaaagaaa 
ctaggcaatg 
gcttctgtgg 
ggggcacatc 
gtggactttt 
agggaagaaa 
tatgtattcc 
ggaacaaaag 
tctgtattgc 
gataaaggga 
gttgcagagg 
a tttcagctg 
gctgctacac 
ctagtggtca 
tttggtgcaa 
aatcgttcaa 



ctagcaaggg 
agaaatcaaa 
tggaaaaaat 
agaatgaaga 
tttctaaatc 
ttgcagcact 
gtgtgcttca 
ttttaaataa 
ttctaagcag 
ataaactcaa 
gaaaaacatc 
ttgaacatat 
gcggggaaga 
tcgtgcttca 
ctgtttactt 
aattatcgtc 
cagtaccttc 
ttccccagct 
ggtacaaaca 
attctcagac 
aaagaaacgg 
cgcataaggc 
aagaagcgat 
gtaaaattcg 
tggtttctaa 
actttggttc 
tggatgaagt 
tgtctctgtc 
ataacgctgg 
atgtactagt 
ttgttgtcgt 
gtgataattt 
tagttgaacg 
aagcatattc 
gggacgattt 
ctgcaagaag 
acattttgga 
aaggcatagt 
ctgaaggttt 
acgccaaaac 
cattaagagc 
gtgccgattt 
ttgaacaaaa 
agaaactaaa 
ttaacgacac 
cagttacagt 
tggccgggtt 
tcaacgtcgt 
cttgtgcaga 
tagaccatgt 
ctgaagacgg 
atgctgctca 
cttgggtggc 
tgtccacttt 
ctatttttga 
taa taaataa 
gcctcaatcc 
tactgatagt 
agaattcagc 
aaatagtatt 



gaatttctct 
caacttgtca 
attcagggaa 
agaatggatg 
acttaaacct 
agcttctgcc 
acgtacaaca 
tgaaaaatgg 
acgaggtgca 
ggaggctcct 
agcagtatct 
ttttaaggtc 
agaagatgag 
atttatgaac 
ttgtgattat 
aatgcatcct 
aacttataag 
aagtttcgac 
cctgggggat 
attgagtggt 
aatagccgat 
attgacagca 
ggaccaggag 
tcaagctgtt 
cttcaagaat 
tagtggagaa 
gcacattctt 
ggataaatct 
gaacatttct 
cggtttgtcc 
atgggatggt 
tatccagctc 
aataagagat 
aaggaagaac 
tacagaaaag 
tgccaagata 
gacgagggcg 
aaaagagacc 
agctgaaagt 
ggttgtattc 
aaacaatact 
agatgctgaa 
cttgaaagat 
gaaaatagac 
tgaactgtca 
gttgatgaac 
tttaggtcca 
ggaccacacc 
tagaaacacc 
tatcgcacta 
aggacccaaa 
aggagaatgt 
aagcatgaaa 
aaaggcagtt 
ggttgcagag 
ccccttgttc 
atccaactgg 
tcgcttcttt 
taaaaggaaa 
ggcagaaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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gacaacgcca atagtaaact tcaatcgagg aggaatgaaa ctgggcccat gagattagag 3660 

gagcttcctg ggcatgaaga tttgcgccca gttttcttcc ctgccacaac aaattattcc 3720 

aaatctgcca agattctggg ctacaaatct aaacccttca acgactttta tacaaaaata 3780 

ataaacacag acatcataaa aatggatagg taa 3813 



<210> 211 
<211> 1264 
<212> PRT 
<213> SHRIMP 



<400> 211 
Met Val Leu Thr 
1 

Gly Asn Phe Ser 
20 

Leu Lys Lys Ser 
35 

Ser Phe Ser Ala 
50 

Gly Gly Glu Lys 
65 

Asp lie lie Ser 

Ser Leu Lys Pro 
100 

Ala Phe Ala Ala 
115 

Ser Asp Gly Ser 
130 

Pro Lys Lys lie 
145 

lie Leu Leu Asp 

Gly Ala Leu Lys 
180 

Lys Leu Lys Glu 
195 

Phe Ser Gly Gly 
210 

Ser Asp Met His 

225 

Arg Lys Trp Gly 

Asp Glu Glu Glu 
260 

Val Leu Gin Phe 
275 

Glu Ser Ala Ser 
290 

Arg Asn Leu Pro 
305 

Phe Asn Met Glu 

Tyr Lys Thr Glu 
340 

Pro Gin Leu Ser 
355 

Gly Ala Asp Trp 
370 

Asn Ser Ser Met 
385 



Leu 


Ser 


Cys 


Thr 


.5 








Lys 


Glu 


Asp 


Ala 


Asn 


Asn 


Leu 


Ser 








40 


Ser 


Val 


Glu 


Lys 






55 




He 


Phe 


Asp 


He 




70 






Leu 


Val 


Glu 


Ser 


85 








Asp 


Lys 


Leu 


Ala 


Ser 


Ala 


Val 


Asp 








120 


Val 


Leu 


Gin 


Arg 






135 




Ala 


Glu 


Ser 


Leu 




150 






Arg 


Leu 


Lys 


Thr 


165 








Ser 


Ala 


Glu 


Arg 


Ala 


Pro 


Leu 


Pro 








200 


Lys 


Thr 


Ser 


Ala 






215 




Phe 


Lys 


Leu 


Val 




230 






Pro 


Cys 


Gly 


Asp 


245 








Glu 


Glu 


Glu 


Glu 


Met 


Asn 


Gly 


His 








280 


Val 


Tyr 


Phe 


Cys 






295 




Asn 


Glu 


Tyr 


Lys 




310 






Asp 


Leu 


Pro 


Phe 


325 








Leu 


Glu 


Tyr 


Lys 


Phe 


Asp 


Tyr 


Gly 








360 


Tyr 


Lys 


His 


Leu 






375 




He 


Ser 


Phe 


Asp 




390 







Thr Arg Arg Val 
10 

Val Leu Gly Asn 
25 

He Ala Arg Pro 

He Phe Arg Glu 
60 

Ser Gin Asn Glu 
75 

Val Tyr Glu Pro 
90 

Asp Lys Thr Cys 
105 

Glu Lys Leu Thr 

Thr Thr Lys Val 
140 

Leu Asn Asn Glu 
155 

Ala Lys Lys Leu 
170 

Val Glu Val Leu 
185 

His His Pro Ser 

Val Ser Ala Gly 
220 

Glu His He Phe 

235 

Lys Thr Glu Ser 
250 

Lys Lys His Ser 
265 

Asn Gly Gin His 

Asp Tyr Tyr Asp 
300 

Leu Ser Ser Met 
315 

Arg Pro Phe Ala 
330 

Arg Phe Val Gin 
345 

Glu Phe Leu Cys 

Gly Asp Val Val 
380 

Ser Gin Thr Leu 
395 



Ala Ser Ser Lys 
15 

Gin Phe Pro He 
30 

Pro Ser He Glu 
45 

Trp Asn Glu Ser 

Glu Glu Trp Met 
80 

Val Phe Ser Lys 
95 

Leu Thr Ala Ala 
110 

He Leu Ser Gly 
125 

Met Lys Lys Asp 

Lys Trp Thr Ser 
160 

Leu Ser Arg Arg 
175 

His Arg Leu Asn 
190 

Leu Phe Asp Asn 
205 

Thr Val He Ala 

Lys Val Ser Phe 
240 

Gly Glu Glu Glu 
255 

He Ser Arg Phe 
270 

Tyr His Arg Pro 

285 

Tyr Leu Ala Tyr 

His Pro Gly Thr 
320 

Val Pro Ser Thr 
335 

Ser Thr Asn Leu 
350 

Tyr Cys He Phe 
365 

Asp Ser Leu Glu 

Ser Gly Val Tyr 
400 
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Lys 


Asn 


Thr 


Ala 


Asn 


Tyr 


Lys 


Arg 


Leu 


Gly 


Lys 


Lys 


Arg 


Asn 


Gly 


He 










405 










410 










415 




Ala 


Asp 


Leu 


Ala 


Val 


Arg 


Ser 


Met 


Ala 


Glu 


Phe 


He 


Arg 


Thr 


Glu 


Ala 








420 










425 










430 






His 


Lys 


Ala 


Leu 


Thr 


Ala 


Glu 


Glu 


Met 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 






435 










440 










445 








Glu 


Ala 


Glu 


Glu 


Glu 


Ala 


Met 


Asp 


Gin 


Glu 


Pro 


Ala 


Glu 


Val 


Asp 


Phe 




450 










455 










460 










Leu 


Ser 


Val 


Pro 


His 


Leu 


Arg 


Arg 


Lys 


He 


Arg 


Gin 


Ala 


Val 


Ser 


Val 


4 65 










470 










475 










480 


Leu 


Asn 


Asn 


Phe 


Val 


Glu 


Asn 


Asp 


Leu 


Ser 


He 


Leu 


Val 


Ser 


Asn 


Phe 










485 










490 










495 




Lys 


Asn 


Val 


Leu 


Thr 


Asp 


Asp 


Thr 


Val 


Ser 


Gly 


Thr 


Asp 


Thr 


Asp 


Asn 








500 










505 










510 






Phe 


Gly 


Ser 


Ser 


Gly 


Glu 


Phe 


Glu 


Ala 


Leu 


Ser 


Ser 


His 


Leu 


Phe 


Leu 






515 










520 










525 








Ser 


Arg 


He 


Leu 


Asp 


Glu 


Val 


His 


He 


Leu 


Arg 


Asn 


Thr 


Asp 


He 


Gin 




530 










535 










540 










Arg 


Thr 


Leu 


Phe 


Ser 


Thr 


His 


Val 


Ser 


Leu 


Ser 


Asp 


Lys 


Ser 


Pro 


Pro 


545 










550 










555 










560 


Ser 


Arg 


Val 


Arg 


Gly 


Ser 


Asn 


Val 


Asn 


Phe 


Asn 


Asn 


Asn 


Ala 


Gly 


Asn 










565 










570 










575 




lie 


Ser 


Ser 


Leu 


Gin 


Thr 


Tyr 


Gly 


Gly 


He 


Glu 


Glu 


Leu 


Pro 


Glu 


Asn 








580 










585 










590 






Val 


Leu 


Val 


Gly 


Leu 


Ser 


Gly 


Gly 


Phe 


Glu 


Asp 


Thr 


Asp 


Met 


Tyr 


Ser 






595 










600 










605 








Gly 


Glu 


Asp 


Val 


Val 


Val 


Val 


Trp 


Asp 


Gly 


Cys 


Asp 


Gly 


Gly 


Lys 


Val 




610 










615 










620 










Leu 


Ser 


Val 


Thr 


Phe 


Asn 


Cys 


Gly Asp 


Asn 


Phe 


He 


Gin 


Leu 


His 


Glu 


625 










630 










635 










640 


Lys 


Thr 


Ala 


Glu 


Thr 


Phe 


Lys 


Asp 


Asp 


Thr 


Asp 


Leu 


Val 


Glu 


Arg 


He 










645 










650 










655 




Arg 


Asp 


Val 


Leu 


Gin 


Thr 


Ala 


Ser 


Lys 


Thr 


Gly 


Asn 


Leu 


Asn 


Lys 


Lys 








660 










665 










670 






Ala 


Tyr 


Ser 


Arg 


Lys 


Asn 


He 


Tyr 


Ala 


Val 


Leu 


Arg 


Glu 


Asn 


Gly 


He 






675 










680 










685 








Glu 


Arg 


Pro 


Gly 


Asp 


Asp 


Phe 


Thr 


Glu 


Lys 


Gly 


He 


Ala 


Leu 


Lys 


Asp 




690 










695 










700 










Lys 


Thr 


Asn 


Gin 


Pro 


Pro 


Pro 


Pro 


Ala 


Arg 


Ser 


Ala 


Lys 


He 


Thr 


Val 


705 










710 










715 










720 


Glu 


Gly 


Val 


Lys 


Gly 


Phe 


Phe 


Ser 


Gly 


Phe 


Arg 


Asp 


He 


Thr 


Arg 


Ala 










725 










730 










735 




Leu 


Thr 


Thr 


Tyr 


Ser 


Ala 


Glu 


Thr 


Phe 


Arg 


Asp 


Leu 


Gly 


Gin 


Gly 


He 








740 










745 










750 






Val 


Lys 


Glu 


Thr 


Glu 


Gly 


Leu 


Thr 


Ala 


Ala 


Thr 


Val 


Ala 


Glu 


Thr 


Ser 






755 










760 










765 








Phe 


Ser 


Glu 


Gly 


Leu 


Ala 


Glu 


Ser 


Leu 


Arg 


Ser 


Asp 


Ala 


Asn 


Leu 


Gly 




770 










775 










780 










Leu 


Glu 


Phe 


Ser 


Glu 


Asp 


Ala 


Lys 


Thr 


Val 


Val 


Phe 


Lys 


Asn 


Asp 


Thr 


785 










790 










795 










800 


Ser 


Arg 


Ser 


Leu 


Leu 


Glu 


Glu 


Thr 


Arg 


Ala 


Leu 


Arg 


Ala 


Asn 


Asn 


Thr 










805 










810 










815 




Ser 


Phe 


Ser 


Ser 


Phe 


Ala 


Arg 


Asp 


Met 


Gly 


Val 


Gin 


Val 


Ser 


Ala 


Asp 








820 










825 










830 






Leu 


Asp 


Ala 


Glu 


Phe 


Ala 


Ala 


Glu 


Met 


Arg 


Glu 


Thr 


Tyr 


Pro 


Asp 


Ala 






835 










840 










845 








Ala 


Leu 


Glu 


Gin 


Asn 


Leu 


Lys 


Asp 


Leu 


Asp 


Lys 


Phe 


Glu 


Glu 


Thr 


He 




850 










855 










860 










Pro 


Glu 


Ser 


Gin 


Val 


Lys 


Lys 


Leu 


Lys 


Lys 


He 


Asp 


Ser 


Tyr 


Leu 


Thr 


865 










870 










875 










880 


Glu 


Asn 


Pro 


Glu 


Arg 


Ala 


Gly 


Lys 


Glu 


He 


Asn 


Asp 


Thr 


Glu 


Leu 


Ser 
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885 










890 










895 




Lys 


Ala 


Thr 


Asp 
900 


Ser 


Val 


Leu 


Gly 


Lys 
905 


Lys 


Leu 


Gly 


Asn 


Ala 
910 


Val 


Thr 


Val 


Leu 


Met 
915 


Asn 


Asn 


Phe 


Gly 


Lys 
920 


Val 


Thr 


He 


Val 


Val 
925 


Gly 


Ala 


Ser 


Val 


Val 
930 


Ala 


Gly 


Phe 


Leu 


Gly 
935 


Pro 


Ala 


Ala 


Val 


Val 
940 


His 


Ala 


Ser 


Arg 


Gly 


Ala 


His 


Leu 


Asn 


Val 


Val 


Asp 


His 


Thr 


Ser 


Pro 


Lys 


Gly 


Val 


He 


945 










950 










955 










960 


Ser 


Tyr 


Lys 


He 


Val 
965 


Asp 


Phe 


Ser 


Cys 


Ala 
970 


Asp 


Arg 


Asn 


Thr 


Gly 
975 


Trp 


Ala 


Lys 


Pro 


Thr 
980 


Lys 


His 


Pro 


Phe 


Arg 
985 


Glu 


Glu 


He 


Asp 


His 
990 


Val 


He 


Ala 


Leu 


Asp 


Ala 


Ser 


Phe 


Leu 


Thr 


Glu 


Asn 


Gly 


Ala 


Tyr 


Val 


Phe 


Pro 






995 










1000 








1005 






Glu 


Asp 


Gly 


Gly 


Pro 


Lys 


Ser 


Lys 


Tyr 


Lys 


Ala 


Tyr 


Ala 


Pro 


He 


Cys 




1010 








1015 








1020 








Gly 


Thr 


Lys 


Asp 


Ala 


Ala 


Gin 


Gly 


Glu 


Cys 


Gly 


Ser 


Trp 


Ala 


Thr 


Phe 


1025 








1030 








1035 








1040 


Asp 


Asp 


Pro 


His 


Ser 


Val 


Leu 


Pro 


Trp 


Val 


Ala 


Ser 


Met 


Lys 


Asp 


Leu 










1045 








1050 








1055 


Pro 


Lys 


Gly 


Gin 


Ser 


Leu 


Ser 


Cys 


Asp 


Lys 


Gly 


Met 


Ser 


Thr 


Leu 


Lys 








1060 








1065 








1070 




Ala 


Val 


Ser 


Ser 


Val 


Leu 


Leu 


Ser 


He 


Gly 


Lys 


Asp 


Val 


Ala 


Glu 


Ala 






1075 








1080 








1085 






He 


Phe 


Glu 


Val 


Ala 


Glu 


Asp 


Ala 


Val 


Val 


Gly 


Leu 


Ala 


Ser 


Lys 


Ala 




1090 








1095 








1100 








He 


Ser 


Ala 


Val 


He 


Asn 


Asn 


Pro 


Leu 


Phe 


He 


Phe 


Gly 


Val 


Pro 


Leu 


1105 








1110 








1115 








1120 


Gly 


Phe 


Gly 


He 


Ala 


Ala 


Thr 


Arg 


Leu 


Asn 


Pro 


Ser 


Asn 


Trp 


Lys 


Thr 










1125 








1130 








1135 


Gly 


Leu 


He 


Val 


Phe 


Ser 


He 


Leu 


Leu 


Val 


Val 


He 


Leu 


He 


Val 


Arg 








1140 








1145 








1150 




Phe 


Phe 


Ala 


Gly 


Ser 


Gly 


Pro 


Leu 


Thr 


Leu 


Asn 


Trp 


Phe 


Gly 


Ala 


Lys 






1155 








1160 








1165 






Asn 


Ser 


Ala 


Lys 


Arg 


Lys 


Gin 


Thr 


Glu 


Gin 


Phe 


Glu 


Asp 


Gly 


Gly 


Gly 




1170 








1175 








1180 








Asn 


Arg 


Ser 


Lys 


He 


Val 


Leu 


Ala 


Glu 




Asp 


Asn 


Ala 


Asn 


Ser 


Lys 


1185 








1190 








1195 








1200 


Leu 


Gin 


Ser 


Arg 


Arg 


Asn 


Glu 


Thr 


Gly 


Pro 


Met 


Arg 


Leu 


Glu 


Glu 


Leu 










1205 








1210 








1215 


Pro 


Gly 


His 


Glu 


Asp 


Leu 


Arg 


Pro 


Val 


Phe 


Phe 


Pro 


Ala 


Thr 


Thr 


Asn 








1220 








1225 








1230 




Tyr 


Ser 


Lys 


Ser 


Ala 


Lys 


He 


Leu 


Gly 


Tyr 


Lys 


Ser 


Lys 


Pro 


Phe 


Asn 



1235 1240 1245 



Asp Phe Tyr Thr Lys He He Asn Thr Asp He He Lys Met Asp Arg 
1250 1255 1260 



<210> 212 
<211> 3744 
<212> DNA 
<213> SHRIMP 



<400> 212 

atgggcgcgc 

tatcttgtta 

tctgactct t 

tgtcccttta 

ctaattaaca 

tttcgcta tt 



ctaccaacgc tgattttaca cgcacggtat caggagtagc ttcttctctc 60 

accctggagc tccttccgat agagaaaagt tagttttggc ctcttcttat 120 

ttgtgtacaa ctacaaggat gcagtggtga ccgctgaggc tcccaagtgg 180 

acgagccagc tcttcatgag cacatcatga acagacttga aaaagctggt 24 0 

gatctcgttt tgtgtgtaac cctgttaaat cggctggaga agtatgcgga 300 

ctggaggaag tactccccag aacttaattt tcccgattgg agcatcagag 360 
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ataaattggt 
aaagacgcca 
aagaatgcaa 
agatcagcgg 
gaggtaagaa 
catttctgta 
ttctggacga 
aatgaaacta 
gtaaaaaacc 
gctgccatca 
attaaaacat 
gggaatttgt 
aaagcaggca 
catttaatag 
cttgctcttt 
ctagaagtga 
gcctacaaga 
gctacaattt 
gtacctctcc 
gtggatatgg 
ttacatagaa 
agactaagag 
gaagaagagg 
gaagaagatg 
gagggtgaag 
tcctccactc 
aagggaaaaa 
aacgatgacc 
aaatatgtta 
gttacaaatc 
cacgcaaata 
tactatgaaa 
attgagcggg 
gtagactcta 
ccttcattat 
ttattcacaa 
gatcttggaa 
tgtctagtgg 
ctagaaaagg 
gaattgaacg 
ccagaacctg 
gccatggaaa 
ttttgcaaga 
tttattgacc 
aaaaaataca 
aataaatggg 
cctgttgatt 
ccttcacgga 
accgagagca 
gtgttggaag 
ttgggaggag 
gaagacgttc 
atagaacaac 
tttgcttcat 
aattctgagg 
caattattgg 
aacagtgtca 



tggtcatact 
tcgaacgcat 
tgaggagact 
gcatgattcg 
gaaggcaact 
aagaatcaac 
tcgttgaagc 
atgccatccc 
aacgcgacac 
aagaacatga 
gtctaacaga 
ccccggccca 
tatgggttat 
ggtacgaagc 
ttatatctca 
tgaaagaaag 
acaatactaa 
atggtcaaat 
ataaatcgag 
gtttggcaaa 
gaaaaatgca 
gtgaagaaga 
aagaagaagc 
atggtgagga 
atatgaatgg 
aacaacctcc 
ggttaaagga 
gccaagtgaa 
aaggtggaaa 
tactagattc 
ggcgatcaga 
gaaacgtatc 
caagagaaag 
acaaccctga 
caggggtaga 
gtcttgtcac 
gatttatcgt 
acggagatat 
gtacagtgaa 
tggccactgc 
gaatcaagca 
agggtcgagg 
tgctgaagtg 
ctgcgtcacg 
agaacattga 
tgggaactgg 
tttacacaat 
aaccaaatga 
gggacgcttc 
tcaacggaga 
aagaagaaga 
aagtacagga 
atcaacaggg 
taccagctga 
agggggaaaa 
atttaataga 
ccattcaaga 



tagaaatgct 
tcctgatcta 
tcgtgtatgg 
ctacgaaccc 
aaaaggaagt 
tcttctggct 
tgtcatccgg 
tgaagactct 
aaatcaagga 
aagcatagga 
ggagcaacga 
tctaacaaat 
tatgcagagt 
tcaacgactt 
gaggggaatt 
agctgcaaat 
cgagtctaat 
gtctgattgt 
gatgaacatg 
gatacgtgta 
agaaaaggca 
agaagaagaa 
tggaacatca 
ggaggaggag 
ggaaaataac 
ccaaaagcgt 
aagaga taat 
tgtggagagc 
atgtggagat 
agtgttccat 
aactggcgta 
aaaagacagt 
ggatgcttcc 
agatgtggaa 
agattctctc 
caatccaatt 
attaaataac 
gcctatgctt 
aaacacgaga 
tcaatctgca 
aatgttactc 
agggagatcg 
tttattcttc 
tgccacctta 
ctgggtaaag 
agaatacact 
tttgaaatac 
cgtgtattac 
atgtgaaagc 
cggacgtgtc 
agaggaaata 
cagtaatcta 
tggagaagaa 
taacgattcc 
aagtccagaa 
tagcgatgga 
atga 



gctcgttttg 
agagaaggtg 
cgagccttta 
ttttcggttg 
tatgagcgaa 
gatatttcaa 
tacaagaata 
tctataaact 
gatgattcta 
gaaaaaagga 
gaaatgattt 
ctggccgatg 
cttttaaagc 
ttaatgttca 
ggtgacgtgt 
gcaaaaataa 
aatttgggca 
aatgcaattc 
caagacattg 
gcggaagaaa 
gccagggagc 
gaggaggagg 
ggtgttaatg 
gaagaggaag 
tcaagaaaac 
cagcgaggta 
attggaggat 
atacaaaaac 
aatggactac 
t tcagaaagg 
tacaccacaa 
agtaataata 
tgtgccgaat 
caacgtatgc 
gctatcgaaa 
t tcaatgccg 
attgtaaaat 
ctagactcaa 
aaatttttca 
ggacaccaat 
cctgattgta 
gcccttcaca 
aa tattgacc 
tttaggcttg 
gatttactag 
aacattggac 
acaatgattg 
agtactattg 
taccgcccga 
ccttaccctt 
acaggcgtta 
ttcgatgtgg 
gaggaacttc 
tcctcacccg 
ccgtacaata 
agaaacaata 



gtacagtggc 
gtacaagtaa 
actggatagc 
gctgtgctct 
atgtactatt 
gagggggaag 
gacatgctcg 
tggagtggga 
cgttagaaaa 
aaaaacatat 
ttaagggagt 
cgatactggc 
aaatcaattt 
agttgtatat 
ttctgaatgg 
gagacatggt 
attattccca 
ctctatctat 
aaaagaccat 
atagagtagc 
gagcagctgc 
aggaagaggg 
gttctggtta 
aagaggatga 
gtaaaaacac 
aaaatgcgcc 
ttttattagc 
ttttgacggc 
cagaattgtt 
ggtctattct 
aggcaaactg 
caccccattc 
ctaacaaacg 
gggaattgat 
gagtacttca 
tgttaggtgc 
ttatgaatat 
gaggcaagac 
aaccaaatat 
acatgaacgc 
taatgaaact 
gacagaaatg 
cttcaaatgc 
atgatctttg 
accctgtaat 
gtgactcaaa 
atgatggtgt 
aaagagcgga 
ccttgtttga 
ccagtgaaac 
ttgacgacag 
aattattcga 
cttcagcaat 
cgcatattcc 
tttttgattc 
accctaaaag 



agcatcggcc 
acatgtggca 
ggaagcctcc 
atacgatcac 
tcttggtgac 
aagttctgat 
aacaatcagt 
ggatgttcta 
gaccttagaa 
cctagagttt 
tggaggaaaa 
taataatgcc 
ctctatactc 
gcctgcatta 
tgtgtttaac 
ttctcgagac 
attcgatatc 
taatattgga 
acaacacata 
tagacgtctt 
gagacagaga 
agaagaaatg 
tgatcaagaa 
agaagatagt 
tggtaatact 
catttcaacc 
cacaattcaa 
caggcaaaga 
aattgaaaag 
taacagcata 
tatttgtgat 
atctgaatgt 
cccttgtcct 
catggaccct 
aaatgagata 
tgaaaaagga 
gactattgct 
aaaaaacctt 
gactgcagca 
tggtcattgt 
gaaatccatc 
tgaccatgcc 
tgcagataca 
tagggatagg 
gaagggaaca 
cgtggccgcc 
aataagtgtt 
tgacttactc 
cgccagagcg 
cgtagaagac 
tactgaaata 
tatccctgaa 
atctgaagtg 
ctcatttggg 
tgccctcgac 
agtcgactgg 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3744 



<210> 213 
<211> 1245 
<212> PRT 
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<213> SHRIMP 
<400> 213 



Met 


Gly 


Ala 


Pro 


Thr 


Asn 


Ala 


Asp 


1 








5 








Ala 


Ser 


Ser 


Leu 


Tyr 


Leu 


Val 


Asn 








20 










Lys 


Leu 


Val 


Leu 


Ala 


Ser 


Ser 


Tyr 






35 










40 


Lys 


Asp 


Ala 


Val 


Val 


Thr 


Ala 


Glu 




50 










55 




Glu 


Pro 


Ala 


Leu 


His 


Glu 


His 


He 


65 










70 






Leu 


He 


Asn 


Arg 


Ser 


Arg 


Phe 


Val 










85 








Glu 


Val 


Cys 


Gly 


Phe 


Arg 


Tyr 


Ser 








100 










He 


Phe 


Pro 


He 


Gly Ala 


Ser 


Glu 






115 










120 


Asn 


Ala 


Ala 


Arg 


Phe 


Gly 


Thr 


Val 




130 










135 




Glu 


Arg 


He 


Pro 


Asp 


Leu 


Arg 


Glu 


145 










150 






Lys 


Asn 


Ala 


Met 


Arg 


Arg 


Leu 


Arg 










165 








Ala 


Glu 


Ala 


Ser 


Arg 


Ser 


Ala 


Gly 








180 










Val 


Gly 


Cys 


Ala 


Leu 


Tyr 


Asp 


His 






195 










200 


Gly 


Ser 


Tyr 


Glu 


Arg 


Asn 


Val 


Leu 




210 










215 




Glu 


Ser 


Thr 


Leu 


Leu 


Ala 


Asp 


He 


225 










230 






Phe 


Trp 


Thr 


He 


Val 


Glu 


Ala 


Val 










245 








Arg 


Thr 


He 


Ser 


Asn 


Glu 


Thr 


Asn 








260 










Asn 


Leu 


Glu 


Trp 


Glu 


Asp 


Val 


Leu 






275 










280 


Gin 


Gly 


Asp 


Asp 


Ser 


Thr 


Leu 


Glu 




290 










295 




Glu 


His 


Glu 


Ser 


He 


Gly 


Glu 


Lys 


305 










310 






Thr 


Cys 


Leu 


Thr 


Glu 


Glu 


Gin 


Arg 










325 








Gly 


Lys 


Gly 


Asn 


Leu 


Ser 


Pro 


Ala 








340 










He 


Leu 


Ala 


Asn 


Asn 


Ala 


Lys 


Ala 






355 










360 


Leu 


Leu 


Lys 


Gin 


He 


Asn 


Phe 


Ser 




370 










375 




Ala 


Gin 


Arg 


Leu 


Leu 


Met 


Phe 


Lys 


385 










390 






Leu 


Phe 


He 


Ser 


Gin 


Arg 


Gly 


He 










405 








Phe 


Asn 


Leu 


Glu 


Val 


Met 


Lys 


Glu 








420 










Asp 


Met 


Val 


Ser 


Arg 


Asp 


Ala 


Tyr 






435 










440 


Asn 


Leu 


Gly 


Asn 


Tyr 


Ser 


Gin 


Phe 




450 










455 





Phe 


Thr 


Arg 


Thr 


Val 


Ser 


Gly 


Val 




10 










15 




Pro 


Gly 


Ala 


Pro 


Ser 


Asp 


Arg 


Glu 


25 










30 






Ser 


Asp 


Ser 


Phe 


Val 


Tyr 


Asn 


Tyr 










45 








Ala 


Pro 


Lys 


Trp 


Cys 


Pro 


Phe 


Asn 








60 










Met 


Asn 


Arg 


Leu 


Glu 


Lys 


Ala 


Gly 






75 










80 


Cys 


Asn 


Pro 


Val 


Lys 


Ser 


Ala 


Gly 




90 










95 




Gly 


Gly 


Ser 


Thr 


Pro 


Gin 


Asn 


Leu 


105 










110 






He 


Asn 


Trp 


Leu 


Val 


He 


Leu 


Arg 










125 








Ala 


Ala 


Ser 


Ala 


Lys 


Asp 


Ala 


He 








140 










Gly 


Gly 


Thr 


Ser 


Lys 


His 


Val 


Ala 






155 










160 


Val 


Trp 


Arg 


Ala 


Phe 


Asn 


Trp 


He 




170 










175 




Met 


He 


Arg 


Tyr 


Glu 


Pro 


Phe 


Ser 


185 










190 






Glu 


Val 


Arg 


Arg 


Arg 


Gin 


Leu 


Lys 










205 








Phe 


Leu 


Gly 


Asp 


His 


Phe 


Cys 


Lys 








220 










Ser 


Arg 


Gly 


Gly 


Arg 


Ser 


Ser 


Asp 






235 










240 


He 


Arg 


Tyr 


Lys 


Asn 


Arg 


His 


Ala 




250 










255 




Ala 


He 


Pro 


Glu 


Asp 


Ser 


Ser 


He 


265 










270 






Val 


Lys 


Asn 


Gin 


Arg 


Asp 


Thr 


Asn 










285 








Lys 


Thr 


Leu 


Glu 


Ala 


Ala 


He 


Lys 








300 










Arg 


Lys 


Lys 


His 


He 


Phe 


He 


Lys 






315 










320 


Glu 


Met 


He 


Phe 


Lys 


Gly 


Val 


Gly 




330 










335 




His 


Leu 


Thr 


Asn 


Leu 


Ala 


Asp 


Ala 


345 










350 






Gly 


He 


Trp 


Val 


He 


Met 


Gin 


Ser 










365 








He 


Leu 


His 


Leu 


He 


Gly 


Tyr 


Glu 








380 










Leu 


Tyr 


Met 


Pro 


Ala 


Leu 


Leu 


Ala 






395 










400 


Gly 


Asp 


Val 


Phe 


Leu 


Asn 


Gly 


Val 




410 










415 




Arg 


Ala 


Ala 


Asn 


Ala 


Lys 


He 


Arg 


425 










430 






Lys 


Asn 


Asn 


Thr 


Asn 


Glu 


Ser 


Asn 










445 








Asp 


He 


Ala 


Thr 


He 


Tyr 


Gly 


Gin 



460 
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Met 


Ser 


Asp 


Cys 


Asn 


Ala 


He 


Pro 


Leu 


Ser 


He 


Asn 


He 


Gly 


Val 


Pro 


465 










470 










475 










480 


Leu 


His 


Lys 


Ser 


Arg 


Met 


Asn 


Met 


Gin 


Asp 


He 


Glu 


Lys 


Thr 


He 


Gin 










485 










490 










495 




His 


He 


Val 


Asp 


Met 


Gly 


Leu 


Ala 


Lys 


He 


Arg 


Val 


Ala 


Glu 


Glu 


Asn 








500 










505 










510 






Arg 


Val 


Ala 


Arg 


Arg 


Leu 


Leu 


His 


Arg 


Arg 


Lys 


Met 


Gin 


Glu 


Lys 


Ala 






515 










520 










525 








Ala 


Arg 


Glu 


Arg 


Ala 


Ala 


Ala 


Arg 


Gin 


Arg 


Arg 


Leu 


Arg 


Gly 


Glu 


Glu 




530 










535 










540 










Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Gly 


Glu 


Glu 


Met 


Glu 


Glu 


545 










550 










555 










560 


Glu 


Glu 


Glu 


Glu 


Ala 


Gly 


Thr 


Ser 


Gly 


Val 


Asn 


Gly 


Ser 


Gly 


Tyr 


Asp 










565 










570 










575 




Gin 


Glu 


Glu 


Glu 


Asp 


Asp 


Gly 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 








580 










585 










590 






Glu 


Asp 


Glu 


Glu 


Asp 


Ser 


Glu 


Gly 


Glu 


Asp 


Met 


Asn 


Gly 


Glu 


Asn 


Asn 






595 










600 










605 








Ser 


Arg 


Lys 


Arg 


Lys 


Asn 


Thr 


Gly 


Asn 


Thr 


Ser 


Ser 


Thr 


Gin 


Gin 


Pro 




610 










615 










620 










Pro 


Gin 


Lys 


Arg 


Gin 


Arg 


Gly 


Lys 


Asn 


Ala 


Pro 


He 


Ser 


Thr 


Lys 


Gly 


625 










630 










635 










640 


Lys 


Arg 


Leu 


Lys 


Glu 


Arg 


Asp 


Asn 


He 


Gly 


Gly 


Phe 


Leu 


Leu 


Ala 


Thr 










645 










650 










655 




lie 


Gin 


Asn 


Asp 


Asp 


Arg 


Gin 


Val 


Asn 


Val 


Glu 


Ser 


He 


Gin 


Lys 


Leu 








660 










665 










670 






Leu 


Thr 


Ala 


Arg 


Gin 


Arg 


Lys 


Tyr 


Val 


Lys 


Gly 


Gly 


Lys 


Cys 


Gly 


Asp 






675 










680 










685 








Asn 


Gly 


Leu 


Pro 


Glu 


Leu 


Leu 


He 


Glu 


Lys 


Val 


Thr 


Asn 


Leu 


Leu 


Asp 




690 










695 










700 










Ser 


Val 


Phe 


His 


Phe 


Arg 


Lys 


Gly 


Ser 


He 


Leu 


Asn 


Ser 


He 


His 


Ala 


705 










710 










715 










720 


Asn 


Arg 


Arg 


Ser 


Glu 


Thr 


Gly Val 


Tyr 


Thr 


Thr 


Lys 


Ala 


Asn 


Cys 


He 










725 










730 










735 




Cys 


Asp 


Tyr 


Tyr 


Glu 


Arg 


Asn 


Val 


Ser 


Lys 


Asp 


Ser 


Ser 


Asn 


Asn 


Thr 








740 










745 










750 






Pro 


His 


Ser 


Ser 


Glu 


Cys 


He 


Glu 


Arg 


Ala 


Arg 


Glu 


Arg 


Asp 


Ala 


Ser 






755 










760 










765 








Cys 


Ala 


Glu 


Ser 


Asn 


Lys 


Arg 


Pro 


Cys 


Pro 


Val 


Asp 


Ser 


Asn 


Asn 


Pro 




770 










775 










780 










Glu 


Asp 


Val 


Glu 


Gin 


Arg 


Met 


Arg 


Glu 


Leu 


He 


Met 


Asp 


Pro 


Pro 


Ser 


785 










790 










795 










800 


Leu 


Ser 


Gly 


Val 


Glu 


Asp 


Ser 


Leu 


Ala 


He 


Glu 


Arg 


Val 


Leu 


Gin 


Asn 










805 










810 










815 




Glu 


He 


Leu 


Phe 


Thr 


Ser 


Leu 


Val 


Thr 


Asn 


Pro 


He 


Phe 


Asn 


Ala 


Val 








820 










825 










830 






Leu 


Gly 


Ala 


Glu 


Lys 


Gly 


Asp 


Leu 


Gly 


Arg 


Phe 


He 


Val 


Leu 


Asn 


Asn 






835 










840 










84 5 








He 


Val 


Lys 


Phe 


Met 


Asn 


Met 


Thr 


He 


Ala 


Cys 


Leu 


Val 


Asp 


Gly 


Asp 




850 










855 










860 










Met 


Pro 


Met 


Leu 


Leu 


Asp 


Ser 


Arg 


Gly 


Lys 


Thr 


Lys 


Asn 


Leu 


Leu 


Glu 


865 










870 










875 










880 


Lys 


Gly 


Thr 


Val 


Lys 


Asn 


Thr 


Arg 


Lys 


Phe 


Phe 


Lys 


Pro 


Asn 


Met 


Thr 










885 










890 










895 




Ala 


Ala 


Glu 


Leu 


Asn 


Val 


Ala 


Thr 


Ala 


Gin 


Ser 


Ala 


Gly 


His 


Gin 


Tyr 








900 










905 










910 






Met 


Asn 


Ala 


Gly 


His 


Cys 


Pro 


Glu 


Pro 


Gly 


He 


Lys 


Gin 


Met 


Leu 


Leu 






915 










920 










925 








Pro 


Asp 


Cys 


He 


Met 


Lys 


Leu 


Lys 


Ser 


He 


Ala 


Met 


Glu 


Lys 


Gly 


Arg 




930 










935 










940 










Gly 


Gly 


Arg 


Ser 


Ala 


Leu 


His 


Arg 


Gin 


Lys 


Cys 


Asp 


His 


Ala 


Phe 


Cys 
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945 










950 










955 










960 


Lys 


Met 


Leu 


Lys 


Cys 
965 


Leu 


Phe 


Phe 


Asn 


He 
970 


Asp 


Pro 


Ser 


Asn 


Ala 
975 


Ala 


Asp 


Thr 


Phe 


He 
980 


Asp 


Pro 


Ala 


Ser 


Arg 
985 


Ala 


Thr 


Leu 


Phe 


Arg 
990 


Leu 


Asp 


Asp 


Leu 


Cys 


Arg 


Asp 


Arg 


Lys 


Lys 


Tyr 


Lys 


Asn 


He 


Asp 


Trp 


Val 


Lys 






995 










1000 








1005 






Asp 


Leu 


Leu 


Asp 


Pro 


Val 


Met 


Lys 


Gly 


Thr 


Asn 


Lys 


Trp 


Val 


Gly 


Thr 




1010 








1015 








1020 








Gly 


Glu 


Tyr 


Thr 


Asn 


He 


Gly Arg 


Asp 


Ser 


Asn 


Val 


Ala 


Ala 


Pro 


Val 


1025 








1030 








1035 








1040 


Asp 


Phe 


Tyr 


Thr 


He 


Leu 


Lys 


Tyr 


Thr 


Met 


He 


Asp 


Asp 


Gly 


Val 


He 










1045 








1050 








1055 


Ser 


Val 


Pro 


Ser 


Arg 


Lys 


Pro 


Asn 


Asp 


Val 


Tyr 


Tyr 


Ser 


Thr 


He 


Glu 








1060 








1065 








1070 




Arg 


Ala 


Asp 


Asp 


Leu 


Leu 


Thr 


Glu 


Ser 


Arg 


Asp 


Ala 


Ser 


Cys 


Glu 


Ser 






1075 








1080 








1085 






Tyr 


Arg 


Pro 


Thr 


Leu 


Phe 


Asp 


Ala 


Arg 


Ala 


Val 


Leu 


Glu 


Val 


Asn 


Gly 




1090 








1095 








1100 








Asp 


Gly 


Arg 


Val 


Pro 


Tyr 


Pro 


Ser 


Ser 


Glu 


Thr 


Val 


Glu 


Asp 


Leu 


Gly 


1105 








1110 








1115 








1120 


Gly 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


He 


Thr 


Gly 


Val 


He 


Asp 


Asp 


Ser 


Thr 










1125 








1130 








1135 


Glu 


He 


Glu 


Asp 


Val 


Gin 


Val 


Gin 


Asp 


Ser 


Asn 


Leu 


Phe 


Asp 


Val 


Glu 








1140 








1145 








1150 




Leu 


Phe 


Asp 


He 


Pro 


Glu 


He 


Glu 


Gin 


His 


Gin 


Gin 


Gly 


Gly Glu 


Glu 






1155 








1160 








1165 






Glu 


Glu 


Leu 


Pro 


Ser 


Ala 


He 


Ser 


Glu 


Val 


Phe 


Ala 


Ser 


Leu 


Pro 


Ala 




1170 








1175 








1180 








Asp 


Asn 


Asp 


Ser 


Ser 


Ser 


Pro 


Ala 


His 


He 


Pro 


Ser 


Phe 


Gly 


Asn 


Ser 


1185 








1190 








1195 








1200 


Glu 


Glu 


Gly 


Glu 


Lys 


Ser 


Pro 


Glu 


Pro 


Tyr 


Asn 


He 


Phe 


Asp 


Ser 


Ala 










1205 








1210 








1215 


Leu 


Asp 


Gin 


Leu 


Leu 


Asp 


Leu 


He 


Asp 


Ser 


Asp 


Gly 


Arg 


Asn 


Asn 


Asn 








1220 








1225 








1230 




Pro 


Lys 


Arg 


Val 


Asp 


Trp 


Asn 


Ser 


Val 


Thr 


He 


Gin 


Glu 









1235 1240 1245 



<210> 214 

<211> 2946 

<212> DNA 

<213> SHRIMP 



<400> 214 

atggaagaaa 

aatattacca 

gtcaacattt 

gaactgaata 

gaccacataa 

gatgataaaa 

tcgcttgca t 

tgcgtaaaaa 

gctggattaa 

ggtaaaagcg 

aataaaagag 

cttgacgatt 

gatataaatg 

ataataggac 

aaagtcaagg 

gaagatgaat 



caataaccct 
gaatagcgtc 
ctggagctcc 
ga tactggac 
cgagggatga 
atggactata 
ctttttccca 
ttccccacga 
gtgggccttt 
gtatagatgg 
caactga ta t 
ctgaaatgtc 
tg tccagttc 
aacatctgat 
aagaggaaaa 
atgaagaagg 



aaaagagtcg 
taattacgtc 
tctaactgct 
caaaactaac 
agatactggt 
tcagtctgcc 
tggagctgat 
cactcgcgct 
taagagatgg 
cctttcaggt 
tttacatttg 
tcgtactttt 
aaggcaagtg 
cgaatttaga 
tggtgagcat 
aggttgccta 



actggaacag 
cgagccttta 
gagaagaatc 
gtgttatgca 
acaataacac 
gttttaatgc 
ttggtttcca 
gaatctttac 
agcattaact 
tccatgttga 
gtgaataatg 
aaccaccaaa 
aaccagcgta 
accaagcaac 
gaagaaatga 
tcagatattg 



tgtcaccatt 
ctgacacgtg 
caagcgctat 
acccactctt 
taaaattcaa 
tggctctcga 
ataaaagtga 
tgaataatgt 
acaaggctgc 
cagtactaaa 
tttctgcctc 
agaaagttgg 
atttactcca 
ttgaacgcgc 
caagtgagga 
acgaggaaga 



caagaatgaa 
gtctcatttg 
cccagctaac 
taaattggag 
aatgtatata 
ttcgttcgtt 
aaacaaattc 
tggat tcccc 
aaacttgagc 
aaataacacc 
tgcacaacaa 
agtttgttat 
ccatcagaat 
acaaaataaa 
ggaagaagag 
tttctatgag 



60 
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780 
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900 

960 
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gatggttacg 
gaagatgaag 
acttgtggtg 
caaaattcaa 
aagggggagt 
agcctaggta 
atgaagggga 
cgtataaggt 
tcgagttgga 
aaggaagaag 
gtatctgaac 
catttggctc 
ctcaatttga 
ataccagaag 
gggctcctct 
ggcgcctttg 
agagatggga 
agaagtattc 
tctagttctt 
agcaagcgtg 
tcatcctcgg 
acgacaatca 
aaaaacatgt 
ttcaaggcgc 
actttattgg 
gacgtcaatg 
tctagggtgt 
aggaacaaac 
ggcgactttt 
caacaagata 
agagtaagcg 
actgtatctc 
cactctcttt 
ctataa 



atgaagaaga 
aggatgaaga 
ctaatggtgt 
aaaggaagaa 
gctctgctaa 
taaagccagt 
gtgaagctga 
cactactcaa 
aatgggctaa 
ttgaggttaa 
tccctagtaa 
tgattgaaga 
ttaaaactat 
aagtgctcag 
ctgtacaatt 
cgacgtcatg 
gtaacacgtg 
gcaacagagg 
cttgtggagg 
gtataaccct 
ccatgatctc 
tccaagattt 
ctgacaagag 
tcctaacatg 
catctggaga 
atgcccacca 
ataaaagagc 
cccagtttac 
cttccgagtt 
gctgcacaga 
gttcttccac 
cgaacgaaaa 
tcaacgctct 



gggcgatgac 
agaagaatat 
tattgattgt 
aaatggtaaa 
cactttatcc 
agggtgtccg 
cagctgttat 
taaatactct 
ccctcccgac 
gtttgaaatt 
tattaaagta 
ctttcttcca 
gacgagcatt 
ttttattcct 
tgatcgta ta 
cttgaacaac 
gctccacacg 
agtgaatatt 
gaataagggc 
gaaaccacca 
attgcctcag 
ctcagaagtt 
caaagacgtg 
cacagtcaac 
gggtgtggta 
gttccaggaa 
ccagcgccga 
tatagcagtc 
aaatccaacg 
agacgacgac 
cacagccgga 
gaggaactct 
atctgcaaaa 



aatagaacta 
gacgatgaag 
gaagacgatg 
aaaacaaaca 
tttgtggaaa 
cccccatcca 
aatacttgtc 
gttaaagatt 
cgacggttcg 
gaatgtgaaa 
tggttaaaag 
gctatgggtg 
ttctctgtta 
atagaatgga 
atagaagtga 
gcattcttct 
gaccttgtgc 
ggcggtaaca 
gattatggag 
cctgcagcga 
cccacgcgcc 
tctgggaaat 
tttaatgatg 
gataaaagta 
cgaagaaacc 
gaatgcggaa 
ggttcggcag 
tctgacgaag 
cactctcaac 
gttttagtgt 
gacagagttc 
gccgccctcg 
acaaagttgg 



gaaagaagaa 
aagatgaaga 
caatcatttt 
ttaaaaagcg 
aatacgttgg 
ctgaatttac 
agtccaccag 
tgatgcaggt 
tgctgtttga 
aatccgagta 
agacggcaaa 
ctgctacccc 
gagatattgt 
agacatctat 
tagatttaat 
tggaaagagg 
aactctccac 
acaacactgg 
tacgttgtgg 
tgactaattc 
agagcataga 
tgaggcttaa 
caatatacga 
gacgtaaaag 
tgatggtgag 
taaaaattgg 
tcagttccag 
acgacgactg 
tacttctatt 
ctgtagaaga 
ttgcaaagga 
ccgcactcac 
gagaaaatgg 



gaaaatggaa 

^g^ggc^gsa 

ccccaatgga 
gtcacggagg 
aaattgtaag 
ttccctattt 
aggggctagc 
aaacagccct 
taagaaaact 
ttttgatgtc 
aataataaaa 
aaaaattccc 
tggatttaaa 
ttctgcaatg 
gataactaat 
agtggtgccc 
ctccatattt 
tagcaattct 
attgagtata 
ttcttccccg 
tctttcgata 
tggattacag 
ctctggcgca 
gaaaagaagg 
tcagggcaat 
gggcggggct 
aagaagagta 
cgaagaagaa 
ccaacaacgg 
atataacaac 
tcttctctct 
catatcccgg 
acgtttcttc 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2946 



<210> 215 

<211> 975 

<212> PRT 

<213> SHRIMP 



<400> 215 



Met 


Glu 


Glu 


Thr 


He 


Thr 


Leu 


Lys 


Glu 


Ser 


Thr 


Gly 


Thr 


Val 


Ser 


Pro 


1 








5 










10 










15 




Phe 


Lys 


Asn 


Glu 


Asn 


He 


Thr 


Arg 


He 


Asn 


Tyr 


Val 


Arg 


Ala 


Phe 


Thr 








20 










25 










30 






Asp 


Thr 


Trp 


Ser 


His 


Leu 


Val 


Asn 


He 


Ser 


Gly 


Ala 


Pro 


Leu 


Thr 


Ala 






35 










40 










45 








Glu 


Lys 


Asn 


Pro 


Ser 


Ala 


He 


Pro 


Ala 


Asn 


Glu 


Leu 


Asn 


Arg 


Tyr 


Trp 




50 










55 










60 










Thr 


Lys 


Thr 


Asn 


Val 


Leu 


Cys 


Asn 


Pro 


Leu 


Phe 


Lys 


Leu 


Glu 


Asp 


His 


65 










70 










75 










80 


He 


Thr 


Arg 


Asp 


Glu 


Asp 


Thr 


Gly 


Thr 


He 


Thr 


Leu 


Lys 


Phe 


Lys 


Met 










85 










90 










95 




Tyr 


He 


Asp 


Asp 


Lys 


Asn 


Gly 


Gin 


Ser 


Ala 


Vai 


Leu 


Met 


Leu 


Ala 


Leu 








100 










105 










110 






Asp 


Ser 


Phe 


Val 


Ser 


Leu 


Ala 


Ser 


Phe 


Ser 


His 


Gly 


Ala 


Asp 


Leu 


Val 






115 










120 










125 








Ser 


Asn 


Lys 


Ser 


Glu 


Asn 


Lys 


Phe 


Cys 


Val 


Lys 


He 


Pro 


His 


Asp 


Thr 




130 










135 










140 










Arg 


Ala 


Glu 


Ser 


Leu 


Leu 


Asn 


Asn 


Val 


Gly 


Phe 


Pro 


Ala 


Gly 


Leu 


Ser 


145 










150 










155 










160 
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Gly 


Pro 


Phe 


Lys 


Arg 


Trp 


Ser 


He 


Asn 


Tyr 










165 










170 


Gly 


Lys 


Ser 


Gly 


He 


Asp 


Gly 


Leu 


Ser 


Gly 








180 










185 




Lys 


Asn 


Asn 


Thr 


Asn 


Lys 


Arg 


Ala 


Thr 


Asp 






195 










200 






Asn 


Val 


Ser 


Ala 


Ser 


Ala 


Gin 


Gin 


Leu 


Asp 




210 










215 








Thr 


Phe 


Asn 


His 


Gin 


Lys 


Lys 


Val 


Gly 


Val 


225 










230 










Ser 


Ser 


Ser 


Arg 


Gin 


Val 


Asn 


Gin 


Arg 


Asn 










245 










250 


lie 


He 


Gly 


Gin 


His 


Leu 


He 


Glu 


Phe 


Arg 








2 60 










265 




Ala 


Gin 


Asn 


Lys 


Lys 


Val 


Lys 


Glu 


Glu 


Glu 






275 










280 






Met 


Thr 


Ser 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 




290 










295 








Cys 


Leu 


Ser 


Asp 


He 


Asp 


Glu 


Glu 


Asp 


Phe 


305 










310 










Glu 


Glu 


Glu 


Gly 


Asp 


Asp 


Asn 


Arg 


Thr 


Arg 










325 










330 


Glu 


Asp 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Tyr 








340 










345 




Glu 


Glu 


Ala 


Glu 


Thr 


Cys 


Gly 


Ala 


Asn 


Gly 






355 










360 






Asp 


Ala 


He 


He 


Phe 


Pro 


Asn 


Gly 


Gin 


Asn 




370 










375 








Gly 


Lys 


Lys 


Thr 


Asn 


He 


Lys 


Lys 


Arg 


Ser 


385 










390 










Ser 


Ala 


Asn 


Thr 


Leu 


Ser 


Phe 


Val 


Glu 


Lys 










405 










410 


Ser 


Leu 


Gly 


He 


Lys 


Pro 


Val 


Gly 


Cys 


Pro 








420 










425 




Thr 


Ser 


Leu 


Phe 


Met 


Lys 


Gly 


Ser 


Glu 


Ala 






435 










440 






Cys 


Gin 


Ser 


Thr 


Arg 


Gly 


Ala 


Ser 


Arg 


He 




450 










455 








Tyr 


Ser 


Val 


Lys 


Asp 


Leu 


Met 


Gin 


Val 


Asn 


465 










470 










Trp 


Ala 


Asn 


Pro 


Pro 


Asp 


Arg 


Arg 


Phe 


Val 










485 










490 


Lys 


Glu 


Glu 


Val 


Glu 


Val 


Lys 


Phe 


Glu 


He 








500 










505 




Tyr 


Phe 


Asp 


Val 


Val 


Ser 


Glu 


Leu 


Pro 


Ser 






515 










520 






Lys 


Glu 


Thr 


Ala 


Lys 


He 


He 


Lys 


His 


Leu 




530 










535 








Leu 


Pro 


Ala 


Met 


Gly 


Ala 


Ala 


Thr 


Pro 


Lys 


545 










550 










Lys 


Thr 


Met 


Thr 


Ser 


He 


Phe 


Ser 


Val 


Arg 










565 










570 


lie 


Pro 


Glu 


Glu 


Val 


Leu 


Ser 


Phe 


He 


Pro 








580 










585 




He 


Ser 


Ala 


Met 


Gly 


Leu 


Leu 


Ser 


Val 


Gin 






595 










600 






Val 


He 


Asp 


Leu 


Met 


He 


Thr 


Asn 


Gly 


Ala 




610 










615 








Asn 


Asn 


Ala 


Phe 


Phe 


Leu 


Glu 


Arg 


Gly 


Val 


625 










630 










Asn 


Thr 


Trp 


Leu 


His 


Thr 


Asp 


Leu 


Val 


Gin 
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Lys 


Ala 


Ala 


Asn 


Leu 


Ser 










175 




Ser 


Met 


Leu 


Thr 


Val 


Leu 








190 






He 


Leu 


His 


Leu 


Val 


Asn 






205 








Asp 


Ser 


Glu 


Met 


Ser 


Arg 




220 










Cys 


Tyr 


Asp 


He 


Asn 


Val 


235 










240 


Leu 


Leu 


His 


His 


Gin 


Asn 










255 




Thr 


Lys 


Gin 


Leu 


Glu 


Arg 








270 






Asn 


Gly 


Glu 


His 


Glu 


Glu 






285 








Glu 


Tyr 


Glu 


Glu 


Gly 


Gly 




300 










Tyr 


Glu 


Asp 


Gly 


Tyr 


Asp 


315 










320 


Lys 


Lys 


Lys 


Lys 


Met 


Glu 










335 




Asp 


Asp 


Glu 


Glu 


Asp 


Glu 








350 






Val 


He 


Asp 


Cys 


Glu 


Asp 






365 








Ser 


Lys 


Arg 


Lys 


Lys 


Asn 




380 










Arg 


Arg 


Lys 


Gly 


Glu 


Cys 


395 










400 


Tyr 


Val 


Gly 


Asn 


Cys 


Lys 










415 




Pro 


Pro 


Ser 


Thr 


Glu 


Phe 








430 






Asp 


Ser 


Cys 


Tyr 


Asn 


Thr 






445 








Arg 


Ser 


Leu 


Leu 


Asn 


Lys 




460 










Ser 


Pro 


Ser 


Ser 


Trp 


Lys 


475 










480 


Leu 


Phe 


Asp 


Lys 


Lys 


Thr 










495 




Glu 


Cys 


Glu 


Lys 


Ser 


Glu 








510 






Asn 


He 


Lys 


Val 


Trp 


Leu 






525 








Ala 


Leu 


He 


Glu 


Asp 


Phe 




540 










He 


Pro 


Leu 


Asn 


Leu 


He 


555 










560 


Asp 


He 


Val 


Gly 


Phe 


Lys 










575 




He 


Glu 


Trp 


Lys 


Thr 


Ser 








590 






Phe 


Asp 


Arg 


He 


He 


Glu 






605 








Phe 


Ala 


Thr 


Ser 


Cys 


Leu 




620 










Val 


Pro 


Arg 


Asp 


Gly 


Ser 


635 










640 


Leu 


Ser 


Thr 


Ser 


He 


Phe 
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Arg 


Ser 


He 


Arg 








660 


Gly 


Ser 


Asn 


Ser 






675 




Glv 

V3 J. _y 


Val 


Arg 


Cys 




690 






Pro 


Pro 


Pro 


Ala 


705 








Met 


He 


Ser 


Leu 


Thr 


Thr 


He 


He 








740 


Asn 


Gly 


Leu 


Gin 






755 




Asp 


Ala 


He 


Tvr" 
J. y i 




770 






Val 


Asn 


Asp 


Lys 


785 








Ser 


Gly 


Glu 


Gly 


Asp 


Val 


Asn 


Asp 








820 


Gly 


Gly 


Gly 


Ala 






835 




Ala 


Val 


Ser 


Ser 




850 






Ala 


Val 


Ser 


Asp 


865 








Ser 


Glu 


Leu 


Asn 


Gin 


Gin 


Asp 


Ser 








900 


Glu 


Tyr 


Asn 


Asn 






915 




Val 


Leu 


Ala 


Lys 




930 






Asn 


Ser 


Ala 


Ala 


945 








Leu 


Ser 


Ala 


Lys 



Asn 


Arg 


Gly 


Val 


Ser 


Ser 


Ser 


Ser 








680 


Gly 


Leu 


Ser 


He 






695 




Ala 


Met: 




Asn 




710 






Pro 


Gin 


Pro 


Thr 


725 








Gin 


Asp 


Phe 


Ser 


Lys 


Asn 


Met 


Ser 








760 


Asp 


Ser 


Gly 


Ala 






775 




Ser 


Arg 


Arg 


Lys 




790 






Val 


Val 


Arg 


Arg 


805 








Ala 


His 


Gin 


Phe 


Ser 


Arg 


Val 


Tyr 








840 


Arg 


Arg 


Arg 


Val 






855 




Glu 


Asp 


Asp 


Asp 




870 






Pro 


Thr 


His 


Ser 


885 








Cys 


Thr 


Glu 


Asp 


Arg 


Val 


Ser 


Gly 








920 


Asp 


Leu 


Leu 


Ser 






935 




Ala 


Leu 


Thr 


He 




950 






Thr 


Lys 


Leu 


Gly 



965 



650 

Asn He Gly Gly 
665 

Cys Gly Gly Asn 

Ser Lys Arg Gly 
700 

Ser Ser Ser Pro 
715 

Arg Gin Ser He 
730 

Glu Val Ser Gly 
745 

Asp Lys Ser Lys 

Phe Lys Ala Leu 
780 

Arg Lys Arg Arg 
795 

Asn Leu Met Val 
810 

Gin Glu Glu Cys 

825 

Lys Arg Ala Gin 

Arg Asn Lys Pro 
860 

Cys Glu Glu Glu 
875 

Gin Leu Leu Leu 
890 

Asp Asp Val Leu 
905 

Ser Ser Thr Thr 

Thr Val Ser Pro 
940 

Ser Arg His Ser 
955 

Glu Asn Gly Arg 
970 



655 

Asn Asn Asn Thr 
670 

Lys Gly Asp Tyr 
685 

He Thr Leu Lys 

Ser Ser Ser Ala 
720 

Asp Leu Ser He 
735 

Lys Leu Arg Leu 
750 

Asp Val Phe Asn 
765 

Leu Thr Cys Thr 

Thr Leu Leu Ala 
800 

Ser Gin Gly Asn 
815 

Gly He Lys He 
830 

Arg Arg Gly Ser 
845 

Gin Phe Thr He 

Gly Asp Phe Ser 
880 

Phe Gin Gin Arg 
895 

Val Ser Val Glu 
910 

Ala Gly Asp Arg 

925 

Asn Glu Lys Arg 

Leu Phe Asn Ala 
960 

Phe Phe Leu 
975 



<210> 216 
<211> 2280 
<212> ONA 
<213> SHRIMP 



<400> 216 

atgccaccta 

cgaaaaaagg 

aaattggacg 

a ttctgtcaa 

ggaaaaagag 

gaagggaaag 

gaagagaaag 

gttacaccta 

ccctctaaaa 

aaaatgatga 

gtaaggagtg 

gtcaaggaga 



aacataaacc 
aggatgacgc 
cgccaacttc 
ggagtggcga 
acactgatca 
aggaagggga 
aagaaaaaga 
ctacatctgt 
aaagaaagag 
ccgtgtctcg 
aaaatgcgcc 
aggctataca 



taatactgct 
cgaatccaga 
tagtaagaat 
ttgtgttgct 
agaaggtggt 
aggggaggaa 
tggagaggag 
atcgaagcgg 
aagtgacaca 
accattgaga 
acaatacgat 
gtacagaata 



ttaaaaaagc 
ttccaacgaa 
aggcaaaggc 
ggcgattgtt 
ggaagaggag 
cagcagcggg 
gaagaagagg 
gcaaaacaaa 
gaatcaaagg 
ggtgctataa 
tacgtgtcat 
aggtctcttt 



a tataattag 
acatgggtca 
gtaagat teg 
cagatttaga 
gaggtaatga 
aagaaaagga 
agaatgttga 
tgaaaaagca 
cattagcagt 
c ttcgggctc 
act tggcaga 
tagctaacct 



gaaccaacaa 
ggaggtctct 
tactagtaaa 
aaatgatgag 
agaagaagag 
agaacagagt 
agacgaacat 
catatttcct 
gcctgctgga 
tattttaggt 
cgaggcagtg 
tttaaaggcc 
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aataaaacca 
aagttcggag 
aaagctcttc 
gagattcagt 
acagagataa 
gaacgagtat 
ttaccgaaca 
actgtacgtg 
aaaatgtcat 
tctatctcca 
gttgaaaaaa 
ggagtgtttg 
cctttccaac 
cctacttacc 
gctcctcaac 
ccgcctcagc 
cctcctcccc 
cagcagcagc 
aatactgctc 
caacctccac 
agagcagaag 
agtattgatt 
gcgcaacgtc 
gcatttattg 
ttagatgtca 
tttgactggg 



aggcttttcc 
gaaaaagaac 
tggctaatag 
ggcaagaatt 
acaatttaat 
actttgaaca 
agaactttga 
gcctatcaat 
cgactatgga 
tgctggagaa 
tgataaagag 
tgcccaaaca 
ctcctccctt 
aaggatactt 
aattacaaca 
tgcagcagca 
ctactcctca 
cctttccggt 
cttctccacc 
ctacaccagt 
aagaaaacgc 
ttgatcttcc 
aagctgtcaa 
caccaattat 
atgaatccat 
atatggggtt 



tacttcatca 
aaacactttt 
ttgttgggca 
aagttcaaaa 
gagcagtatc 
acttgctact 
taagaatttt 
acctcgttat 
tcttttggtt 
atgtgcagct 
tgcagtggag 
acaacaacag 
tcctcttcct 
aaatccttat 
acagtacccg 
gcagcagcag 
acaacaatca 
tcaactcatt 
tatttcccgt 
tctacccaag 
tactgatatg 
tcccgctact 
ggaaacggga 
tcgtcaaccg 
ctatggctca 
agacgattta 



tctttattgt 
gttgtcacaa 
atttcccata 
atattgaaat 
gtggaggatc 
gtgtgtaata 
gtatctgtag 
tttagagcta 
ggaggaggga 
ggtgttttag 
gaaacaagtc 
caacgtcagc 
ccaccacagg 
tatcaatata 
ctgtactttt 
cagcaaccac 
cccagtaata 
tctagtcccc 
gtaagatttg 
cctactcctc 
tcctttactg 
ccagggagga 
gtcagaggag 
cgtacaccag 
gacattgaac 
aatggggatg 



catctgaaca 
atgtgggtgc 
ggaaagatat 
ctcttaacga 
gtatacagcg 
acctgtttgg 
cttctgataa 
taaataataa 
tgaggagtaa 
caagagcttc 
aagcatttaa 
aacaacaaca 
cacctt tcca 
accagtatta 
taggcaacca 
cacagccgcc 
ttcctcctcc 
ctcctcctcc 
actctcgttc 
ttcctcctcc 
atatagactc 
acgttgaaga 
aagaagaaga 
gaaattttag 
cagcagcagc 
aaccatatga 



cigggsagaaa 
tgaattggtg 
acggtcagga 
tggaaatgca 
aactgtgaaa 
tacacgtata 
ctctaacgcc 
cgtgtgggta 
atcagagcat 
agcaagaccg 
tttatcgact 
acaatttccc 
ggttcaacaa 
taacccatat 
atcacagccg 
caataatatt 
ccagcagcag 
tcctatacct 
tactacccct 
gtctacagca 
tgagcttggc 
gataataaaa 
agaagaggag 
agatgaactt 
agcagcagcg 
atttgaataa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 



<210> 217 
<211> 757 
<212> PRT 
<213> SHRIMP 



<400> 217 



Met 


Pro 


Pro 


Lys 


His 


Lys 


Pro 


Asn 


1 








5 








Arg 


Asn 


Gin 


Gin 


Arg 


Lys 


Lys 


Glu 








20 










Arg 


Asn 


Met 


Gly 


Gin 


Glu 


Val 


Ser 






35 










40 


Lys 


Asn 


Arg 


Gin 


Arg 


Arg 


Lys 


lie 




50 










55 




Ser 


Gly 


Asp 


Cys 


Val 


Ala 


Gly 


Asp 


65 










70 






Gly 


Lys 


Arg 


Asp 


Thr 


Asp 


Gin 


Glu 










85 








Glu 


Glu 


Glu 


Glu 


Glu 


Gly 


Lys 


Glu 








100 










Arg 


Glu 


Glu 


Lys 


Glu 


Glu 


Gin 


Ser 






115 










120 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asn 


Val 




130 










135 




Thr 


Ser 


Val 


Ser 


Lys 


Arg 


Ala 


Lys 


145 










150 






Pro 


Ser 


Lys 


Lys 


Arg 


Lys 


Arg 


Ser 










165 








Ala 


Gly 


Lys 


Met 


Met 


Thr 


Val 


Ser 








180 










Ser 


Gly 


Ser 


lie 


Leu 


Gly 


Val 


Arg 






195 










200 


Tyr 


Val 


Ser 


Tyr 


Leu 


Ala 


Asp 


Glu 



Thr 


Ala 


Leu 


Lys 


Lys 


His 


He 


He 




10 










15 




Asp 


Asp 


Ala 


Glu 


Ser 


Arg 


Phe 


Gin 


25 










30 






Lys 


Leu 


Asp 


Ala 


Pro 


Thr 


Ser 


Ser 










45 








Arg 


Thr 


Ser 


Lys 


He 


Leu 


Ser 


Arg 








60 










Cys 


Ser 


Asp 


Leu 


Glu 


Asn 


Asp 


Glu 






75 










80 


Gly 


Gly 


Gly 


Arg 


Gly 


Gly 


Gly 


Asn 




90 










95 




Glu 


Gly 


Glu 


Gly 


Glu 


Glu 


Gin 


Gin 


105 










110 






Glu 


Glu 


Lys 


Glu 


Glu 


Lys 


Asp 


Gly 










125 








Glu 


Asp 


Glu 


His 


Val 


Thr 


Pro 


Thr 








140 










Gin 


Met 


Lys 


Lys 


His 


He 


Phe 


Pro 






155 










160 


Asp 


Thr 


Glu 


Ser 


Lys 


Ala 


Val 


Pro 




170 










175 




Arg 


Pro 


Leu 


Arg 


Gly 


Ala 


He 


Thr 


185 










190 






Ser 


Glu 


Asn 


Ala 


Pro 


Gin 


Tyr 


Asp 










205 








Ala 


Val 


Val 


Lys 


Glu 


Lys 


Ala 


He 
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210 










215 










220 










Gin 


Tyr 


Arg 


He 


Arg 


Ser 


Leu 


Leu 


Ala 


Asn 


Leu 


Leu 


Lys 


Ala 


Asn 


Lys 


225 










230 










235 










240 


Thr 


Lys 


Ala 


Phe 


Pro 


Thr 


Ser 


Ser 


Ser 


Leu 


Leu 


Ser 


Ser 


Glu 


Gin 


Gly 










245 










250 










255 




Lys 


Lys 


Lys 


Phe 


Gly 


Gly 


Lys 


Arg 


Thr 


Asn 


Thr 


Phe 


Val 


Val 


Thr 


Asn 








260 










265 










270 






Val 


Gly 


Ala 


Glu 


Leu 


Val 


Lys 


Ala 


Leu 


Leu 


Ala 


Asn 


Ser 


Cys 


Trp 


Ala 






275 










280 










285 








lie 


Ser 


His 


Arg 


Lys 


Asp 


He 


Arg 


Ser 


Gly 


Glu 


He 


Gin 


Trp 


Gin 


Glu 




290 










295 










300 










Leu 


Ser 


Ser 


Lys 


He 


Leu 


Lys 


Ser 


Leu 


Asn 


Asp 


Gly 


Asn 


Ala 


Thr 


Glu 


305 










310 










315 










320 


He 


Asn 


Asn 


Leu 


Met 


Ser 


Ser 


He 


Val 


Glu 


Asp 


Arg 


He 


Gin 


Arg 


Thr 










325 










330 










335 




Val 


Lys 


Glu 


Arg 


Val 


Tyr 


Phe 


Glu 


Gin 


Leu 


Ala 


Thr 


Val 


Cys 


Asn 


Asn 








340 










345 










350 






Leu 


Phe 


Gly 


Thr 


Arg 


He 


Leu 


Pro 


Asn 


Lys 


Asn 


Phe 


Asp 


Lys 


Asn 


Phe 






355 










360 










365 








Val 


Ser 


Val 


Ala 


Ser 


Asp 


Asn 


Ser 


Asn 


Ala 


Thr 


Val 


Arg 


Gly 


Leu 


Ser 




370 










375 










380 










He 


Pro 


Arg 


Tyr 


Phe 


Arg 


Ala 


He 


Asn 


Asn 


Asn 


Val 


Trp 


Val 


Lys 


Met 


385 










390 










395 










400 


Ser 


Ser 


Thr 


Met 


Asp 


Leu 


Leu 


Val 


Gly 


Gly 


Gly 


Met 


Arg 


Ser 


Lys 


Ser 










4 05 










410 










415 




Glu 


His 


Ser 


He 


Ser 


Met 


Leu 


Glu 


Lys 


Cys 


Ala 


Ala 


Gly 


Val 


Leu 


Ala 








420 










425 










430 






Arg 


Ala 


Ser 


Ala 


Arg 


Pro 


Val 


Glu 


Lys 


Met 


He 


Lys 


Ser 


Ala 


Val 


Glu 






435 










440 










445 








Glu 


Thr 


Ser 


Gin 


Ala 


Phe 


Asn 


Leu 


Ser 


Thr 


Gly 


Val 


Phe 


Val 


Pro 


Lys 




450 










455 










460 










Gin 


Gin 


Gin 


Gin 


Gin 


Arg 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Phe 


Pro 


Pro 


Phe 


4 65 










470 










475 










480 


Gin 


Pro 


Pro 


Pro 


Phe 


Pro 


Leu 


Pro 


Pro 


Pro 


Gin 


Ala 


Pro 


Phe 


Gin 


Val 










485 










490 










495 




Gin 


Gin 


Pro 


Thr 


Tyr 


Gin 


Gly 


Tyr 


Leu 


Asn 


Pro 


Tyr 


Tyr 


Gin 


Tyr 


Asn 








500 










505 










510 






Gin 


Tyr 


Tyr 


Asn 


Pro 


Tyr 


Ala 


Pro 


Gin 


Gin 


Leu 


Gin 


Gin 


Gin 


Tyr 


Pro 






515 










520 










525 








Leu 


Tyr 


Phe 


Leu 


Gly 


Asn 


Gin 


Ser 


Gin 


Pro 


Pro 


Pro 


Gin 


Leu 


Gin 


Gin 




530 










535 










540 










Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Pro 


Pro 


Gin 


Pro 


Pro 


Asn 


Asn 


He 


Pro 


Pro 


545 










550 










555 










560 


Pro 


Pro 


Thr 


Pro 


Gin 


Gin 


Gin 


Ser 


Pro 


Ser 


Asn 


He 


Pro 


Pro 


Pro 


Gin 










565 










570 










575 




Gin 


Gin 


Gin 


Gin 


Gin 


Pro 


Phe 


Pro 


Val 


Gin 


Leu 


He 


Ser 


Ser 


Pro 


Pro 








580 










585 










590 






Pro 


Pro 


Pro 


Pro 


He 


Pro 


Asn 


Thr 


Ala 


Pro 


Ser 


Pro 


Pro 


He 


Ser 


Arg 






595 










600 










605 








Val 


Arg 


Phe 


Asp 


Ser 


Arg 


Ser 


Thr 


Thr 


Pro 


Gin 


Pro 


Pro 


Pro 


Thr 


Pro 




610 










615 










620 










Val 


Leu 


Pro 


Lys 


Pro 


Thr 


Pro 


Leu 


Pro 


Pro 


Pro 


Ser 


Thr 


Ala 


Arg 


Ala 



625 630 635 640 

Glu Glu Glu Asn Ala Thr Asp Met Ser Phe Thr Asp He Asp Ser Glu 

645 650 655 

Leu Gly Ser He Asp Phe Asp Leu Pro Pro Ala Thr Pro Gly Arg Asn 

660 665 670 

Val Glu Glu He He Lys Ala Gin Arg Gin Ala Val Lys Glu Thr Gly 

675 680 685 

Val Arg Gly Glu Glu Glu Glu Glu Glu Glu Ala Phe He Ala Pro He 
690 695 700 
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lie Arg Gin Pro Arg Thr Pro Gly Asn Phe Arg Asp Glu Leu Leu Asp 
705 710 715 720 

Val Asn Glu Ser He Tyr Gly Ser Asp He Glu Pro Ala Ala Ala Ala 

725 730 735 

Ala Ala Phe Asp Trp Asp Met Gly Leu Asp Asp Leu Asn Gly Asp Glu 

740 745 750 

Pro Tyr Glu Phe Glu 
755 



<210> 218 
<211> 3177 
<212> DNA 
<213> SHRIMP 



<400> 218 

atggaaggtg 

gacctaatgg 

ggagaaagtg 

gatgaacttt 

tctctattaa 

gttccattta 

aaaaaacaag 

gtcatgaaaa 

gtggaggaga 

atacgcattt 

atgaaaagtg 

cataaggagt 

gctagaacgt 

atttgggaaa 

aaccactaca 

gaatatgtgc 

gatatagtgt 

agagtagagg 

gcttcgctct 

agaaaactag 

gcaatcgcca 

gcggcgatgg 

atattgagtg 

aaaccaaaat 

gtacatgctt 

cctgattgta 

aaactaaagg 

agatgtcacg 

ccagaataca 

tgcttctata 

gactttgcac 

gttaaactct 

acccagtgtt 

gtggatgtac 

ttccaggatg 

gatactggtc 

aattttgtac 

agaactcccg 

ctagcgctag 

aacgacatat 

tacttaaaca 

agtatacctc 

ttaggattat 

ccagaatgtg 

ttatcgggta 



taacatccat 
gaggcagaaa 
gtgatcttct 
tggagggaga 
tgaagaaagc 
caactcttga 
atgggccagc 
atacgcgctt 
agaatgcgtt 
tcttactcct 
tcatttatga 
ggtacatgtg 
ttagtgtttt 
atatggcctt 
ataagggtca 
cccaagaatt 
acagatactc 
agccaaagaa 
tagaaaatac 
gatctttcat 
aatgcatggt 
tgtaccaacc 
ggaataccac 
tcttcaactg 
tcatggacac 
ctataaaaca 
acgtttttag 
aattttcatc 
ttgcagccaa 
ctgtggaact 
tagacaagga 
ttaagatggt 
tactacccaa 
ctcctccagt 
tagaagatgc 
ccggactcca 
ctccaaatga 
aaaagaagat 
ccaactggca 
ctgtaatcca 
gtgtttttgg 
taattgcaac 
gggcgaaaat 
tgttccatag 
tcagacctaa 



agttgcagct 
taataagagg 
ggaggccata 
aacagttgtg 
cctcgacatg 
ccaaaacgaa 
atacaaggat 
gatgttggat 
gtgtgaggcg 
tgttgccatc 
agatgtggta 
gcccttctct 
atttggttct 
ctccgtaaca 
tcttcgcatg 
acacca tatt 
tatctacaat 
atccgtaatg 
accactgtcc 
cctcgaacat 
tagccagaca 
atctgacgaa 
caataaaaca 
ttctcatgaa 
ttttgaagca 
cctaatgtat 
gtgcccatgt 
tttattcgag 
caaact tctc 
tcttcaggcc 
ttcaaaattt 
ggagggagaa 
tgtatgtgac 
attgccgtac 
tgaatttgat 
taaatggccc 
agaggtgaca 
tcgaggatgg 
ctcgactgat 
caacaccagg 
gggttttgat 
ctatttttat 
gtttgtcaag 
agcccattct 
ccaaggtgca 



gctgtcccag 
tccacctatg 
ctggacatct 
at taatccaa 
aacatcaaga 
caagaattca 
ctaattcatc 
gaaattatac 
tacgccgaga 
aaacgagatc 
gtttctctca 
gctttgcaag 
gatatgcatg 
gaagcatttt 
tacgctgcca 
ctttttggaa 
gttccctatc 
tcgccttctg 
ctggtatcta 
gaaaacgctg 
ctgcaagaag 
gtggaggtaa 
tgtgggt tgt 
aacatgaaag 
cgacaagaaa 
g tgtacgaaa 
tgtggagaat 
agggcgatt t 
atcaccgaac 
gaattcatgg 
actgtggtgg 
actggtgata 
cagcctaatg 
cctcctcccg 
gaccctccca 
atgagattgt 
aattgtaggc 
aacccagagt 
cgtatgccag 
gagaggtttc 
gatagagatt 
gtctatgaaa 
aatcttatag 
tttgtccttc 
aaaatggaaa 



aagttgcaat 
aacgaattgt 
gcaaccgcaa 
ccggtctttt 
tgaaaagtaa 
ttggacattt 
gtatttatag 
gtggaaatgc 
tgatatcaga 
agaacaagaa 
acactttgaa 
tggggacaaa 
aaggaagaaa 
taagtggtcc 
ggcccgtgta 
caaaaatagc 
ttttggcagc 
gtctaataat 
gacacggcat 
aaaatatgca 
agtcttgggg 
tacaagcaca 
gctacgcgga 
cctcttacga 
cgtgttctgc 
aggtctcggc 
atatggtaca 
tagcaggaga 
tcatcaagcg 
agatgtgtaa 
ataatagatt 
gcaaatgttc 
agatggaaga 
aacaactgga 
ccgatgagct 
cctgcggttt 
aggctgtttc 
ctccggaggg 
agaatatgaa 
agaatagggt 
ttgaacaagt 
aattgaacca 
gggaaatggt 
attgtgtaga 
ttgttaaaca 



cttgatcacc 
gggtatagtg 
ttcttatagg 
gaaagagata 
cgacgaccca 
aaagagttgc 
tggcatgttt 
aggcgatgca 
tatggatttg 
gcaccgccac 
ggatgtgttt 
aataagagac 
caatgacagg 
atccaccaac 
tgacgctatg 
caagatgatt 
agatactgaa 
ctccccgaac 
accttcagcc 
tctagaagaa 
agaaagtcaa 
cgttacaaag 
tttggacatg 
ttatttcccc 
aaaactatgc 
tggtagtgaa 
gtttataggt 
aaatgtcgac 
agcagagaaa 
gatggacaag 
cagaccgccc 
cctgatctgt 
cattgtaaca 
agactattac 
agtaagatat 
cttggcgtca 
tatactaaaa 
caaggttctt 
ggggcttctc 
taaagtccat 
tgtaggtgtg 
tgagagtgca 
gttggagcgg 
taggcgtgcc 
ggtaaatatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 
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gtacgtcaaa 
aggacattag 
aaatgtaaga 
agcggtagaa 
aaccctgacc 
gctgctttag 
gatagttatt 
atggaagatg 



acatgacatc 
aatggaaggt 
cacctaacat 
gagatgggta 
atatactaat 
aaaaggtgta 
tgtccaaaca 
gttttgaatt 



tgaatcaata 
tgagaaagaa 
aaaattaggt 
ccctacagtt 
tgaccataaa 
taattgtact 
aagaggagga 
tgatgtacat 



aaggatccag 
ggacaggaaa 
ggatgtatta 
ttttgttgga 
ctattgtatt 
ctatgttgtc 
ggagaagaag 
actaaaacag 



tgttcactgt 
ttaagactgt 
ccatgacttg 
tatgtgaaga 
ctgattgtaa 
tagctttaag 
aagagattga 
cggtgcctac 



ggatgaaaag 
aaaatgccct 
ttatgattgt 
tgaaataact 
gtctacaaag 
aaaatgtagt 
gatatatgta 
aaaataa 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3177 



<210> 219 
<211> 1050 
<212> PRT 
<213> SHRIMP 



<400> 219 
Met Glu Gly Val 
1 

lie Leu lie Thr 
20 

Tyr Glu Arg lie 
35 

Ala lie Leu Asp 
50 

Glu Gly Glu Thr 
65 

Ser Leu Leu Met 

Asn Asp Asp Pro 
100 

Phe He Gly His 
115 

Lys Asp Leu He 
130 

Thr Arg Leu Met 
1A5 

Val Glu Glu Lys 

Asp Met Asp Leu 
180 

Asp Gin Asn Lys 
195 

Val Val Val Ser 
210 

Tyr Met Trp Pro 
225 

Ala Arg Thr Phe 

Asn Asn Asp Arg 
260 

Phe Leu Ser Gly 
275 

Arg Met Tyr Ala 
290 

Gin Glu Leu His 

305 

Asp He Val Tyr 

Ala Asp Thr Glu 
340 

Ser Gly Leu He 
355 



Thr Ser He Val 
5 

Asp Leu Met Gly 

Val Gly He Val 
40 

He Cys Asn Arg 
55 

Val Val He Asn 
70 

Lys Lys Ala Leu 
85 

Val Pro Phe Thr 

Leu Lys Ser Cys 
120 

His Arg He Tyr 
135 

Leu Asp Glu He 
150 

Asn Ala Leu Cys 
165 

He Arg He Phe 

Lys His Arg His 
200 

Leu Asn Thr Leu 
215 

Phe Ser Ala Leu 
230 

Ser Val Leu Phe 
245 

He Trp Glu Asn 

Pro Ser Thr Asn 
280 

Ala Arg Pro Val 

295 

His He Leu Phe 
310 

Arg Tyr Ser He 
325 

Arg Val Glu Glu 

He Ser Pro Asn 
360 



Ala Ala Ala Val 
10 

Gly Arg Asn Asn 
25 

Gly Glu Ser Gly 

Asn Ser Tyr Arg 
60 

Pro Thr Gly Leu 
75 

Asp Met Asn He 
90 

Thr Leu Asp Gin 
105 

Lys Lys Gin Asp 

Ser Gly Met Phe 
140 

He Arg Gly Asn 
155 

Glu Ala Tyr Ala 
170 

Leu Leu Leu Val 
185 

Met Lys Ser Val 

Lys Asp Val Phe 
220 

Gin Val Gly Thr 

235 

Gly Ser Asp Met 
250 

Met Ala Phe Ser 
265 

Asn His Tyr Asn 

Tyr Asp Ala Met 
300 

Gly Thr Lys He 
315 

Tyr Asn Val Pro 
330 

Pro Lys Lys Ser 
345 

Ala Ser Leu Leu 



Pro 


Glu 


Val 


Ala 






15 




Lys 


Arg 


Ser 


Thr 




30 






Asp 


Leu 


Leu 


Glu 


45 








Asp 


Glu 


Leu 


Leu 


Leu 


Lys 


Glu 


He 








80 


Lys 


Met 


Lys 


Ser 






95 




Asn 


Glu 


Gin 


Glu 




110 






Gly 


Pro 


Ala 


Tyr 


125 








Val 


Met 


Lys 


Asn 


Ala 


Gly 


Asp 


Ala 








160 


Glu 


Met 


He 


Ser 






175 




Ala 


He 


Lys 


Arg 




190 






He 


Tyr 


Glu 


Asp 


205 








His 


Lys 


Glu 


Trp 


Lys 


He 


Arg 


Asp 








240 


His 


Glu 


Gly 


Arg 






255 




Val 


Thr 


Glu 


Ala 




270 






Lys 


Gly 


His 


Leu 


285 








Glu 


Tyr 


Val 


Pro 


Ala 


Lys 


Met 


He 








320 


Tyr 


Leu 


Leu 


Ala 






335 




Val 


Met 


Ser 


Pro 




350 






Glu 


Asn 


Thr 


Pro 



365 
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Leu 


Ser 


Leu 


Val 


Ser 


Arg 


His 


Gly 


He 


Pro 


Ser 


Ala 


Arg 


Lys 


Leu 


Gly 




370 










375 










380 










Ser 


Phe 


He 


His 


Glu 


Asn 


Ala 


Glu 


Asn 


Met 


His 


Leu 


Glu 


Glu 


Ala 


He 


385 










390 










395 










400 


Ala 


Lys 


Cys 


Met 


Val 


Ser 


Gin 


Thr 


Leu 


Gin 


Glu 


Glu 


Ser 


Trp 


Gly 


Glu 










405 










410 










415 




Ser 


Gin 


Ala 


Ala 


Met 


Val 


Tyr 


Gin 


Pro 


Ser 


Asp 


Glu 


Val 


Glu 


Val 


He 








420 










425 










430 






Gin 


Ala 


His 


Val 


Thr 


Lys 


He 


Leu 


Ser 


Gly 


Asn 


Thr 


Thr 


Asn 


Lys 


Thr 






435 










440 










445 








Cys 


Gly 


Leu 


Cys 


Tyr 


Ala 


Asp 


Leu 


Asp 


Met 


Lys 


Pro 


Lys 


Phe 


Phe 


Asn 




450 










455 










460 










Cys 


Ser 


His 


Glu 


Asn 


Met 


Lys 


Ala 


Ser 


Tyr 


Asp 


Tyr 


Phe 


Pro 


Val 


His 


465 










470 










475 










480 


Ala 


Phe 


Met 


Asp 


Thr 


Phe 


Glu 


Ala 


Arg 


Gin 


Glu 


Thr 


Cys 


Ser 


Ala 


Lys 










485 










490 










495 




Leu 


Cys 


Pro 


Asp 


Cys 


Thr 


He 


Lys 


His 


Leu 


Met 


Tyr 


Val 


Tyr 


Glu 


Lys 








500 










505 










510 






Val 


Ser 


Ala 


Gly 


Ser 


Glu 


Lys 


Leu 


Lys 


Asp 


Val 


Phe 


Arg 


Cys 


Pro 


Cys 






515 










520 










525 








Cys 


Gly 


Glu 


Tyr 


Met 


Val 


Gin 


Phe 


He 


Gly 


Arg 


Cys 


His 


Glu 


Phe 


Ser 




530 










535 










540 










Ser 


Leu 


Phe 


Glu 


Arg 


Ala 


He 


Leu 


Ala 


Gly 


Glu 


Asn 


Val 


Asp 


Pro 


Glu 


545 










550 










555 










560 


Tyr 


lie 


Ala 


Ala 


Asn 


Lys 


Leu 


Leu 


He 


Thr 


Glu 


Leu 


He 


Lys 


Arg 


Ala 










565 










570 










575 




Glu 


Lys 


Cys 


Phe 


Tyr 


Thr 


Val 


Glu 


Leu 


Leu 


Gin 


Ala 


Glu 


Phe 


Met 


Glu 








580 










585 










590 






Met 


Cys 


Lys 


Met 


Asp 


Lys 


Asp 


Phe 


Ala 


Leu 


Asp 


Lys 


Asp 


Ser 


Lys 


Phe 






595 










600 










605 








Thr 


Val 


Val 


Asp 


Asn 


Arg 


Phe 


Arg 


Pro 


Pro 


Val 


Lys 


Leu 


Phe 


Lys 


Met 




610 










615 










620 










Val 


Glu 


Gly 


Glu 


Thr 


Gly 


Asp 


Ser 


Lys 


Cys 


Ser 


Leu 


He 


Cys 


Thr 


Gin 


625 










630 










635 










640 


Cys 


Leu 


Leu 


Pro 


Asn 


Val 


Cys 


Asp 


Gin 


Pro 


Asn 


Glu 


Met 


Glu 


Asp 


He 










645 










650 










655 




Val 


Thr 


Val 


Asp 


Val 


Pro 


Pro 


Pro 


Val 


Leu 


Pro 


Tyr 


Pro 


Pro 


Pro 


Glu 








660 










665 










670 






Gin 


Leu 


Glu 


Asp 


Tyr 


Tyr 


Phe 


Gin 


Asp 


Val 


Glu 


Asp 


Ala 


Glu 


Phe 


Asp 






675 










680 










685 








Asp 


Pro 


Pro 


Thr 


Asp 


Glu 


Leu 


Val 


Arg 


Tyr 


Asp 


Thr 


Gly 


Pro 


Gly 


Leu 




690 










695 










700 










His 


Lys 


Trp 


Pro 


Met 


Arg 


Leu 


Ser 


Cys 


Gly 


Phe 


Leu 


Asn 


Phe 


Val 


Pro 


705 










710 










715 










720 


Pro 


Asn 


Glu 


Glu 


Val 


Thr 


Asn 


Cys 


Arg 


Gin 


Ala 


Val 


Ser 


He 


Leu 


Lys 










725 










730 










735 




Arg 


Thr 


Pro 


Glu 


Lys 


Lys 


He 


Arg 


Gly 


Trp 


Asn 


Pro 


Glu 


Ser 


Pro 


Glu 








7 4 0 










745 










750 






Gly 


Lys 


Val 


Leu 


Leu 


Ala 


Asn 


Trp 


His 


Ser 


Thr 


Asp 


Arg 


Met 


Pro 


Glu 






755 










760 










765 








Asn 


Met 


Lys 


Gly 


Leu 


Leu 


Asn 


Asp 


He 


Ser 


Val 


He 


His 


Asn 


Thr 


Arg 




770 










775 










780 








Glu 


Arg 


Phe 


Gin 


Asn 


Arg 


Val 


Lys 


Val 


His 


Tyr 


Leu 


Asn 


Ser 


Val 


Phe 


785 










790 










795 










800 


Gly 


Gly 


Phe 


Asp 


Asp 


Arg 


Asp 


Phe 


Glu 


Gin 


Val 


Val 


Gly 


Val 


Ser 


He 










805 










810 










815 




Pro 


Leu 


He 


Ala 


Thr 


Tyr 


Phe 


Tyr 


Val 


Tyr 


Glu 


Lys 


Leu 


Asn 


His 


Glu 








820 










825 










830 






Ser 


Ala 


Leu 


Gly 


Leu 


Trp 


Ala 


Lys 


Met 


Phe 


Val 


Lys 


Asn 


Leu 


He 


Gly 






835 










840 










845 








Glu 


Met 


Val 


Leu 


Glu 


Arg 


Pro 


Glu 


Cys 


Val 


Phe 


His 


Arg 


Ala 


His 


Ser 
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850 










855 










860 










Phe 


Val 


Leu 


His 


Cys 


Val 


Asp 


Arg 


Arg 


Ala 


Leu 


Ser 


Gly 


He 


Arg 


Pro 


865 










870 










875 










880 


Asn 


Gin 


Gly 


Ala 


Lys 


Met 


Glu 


He 


Val 


Lys 


Gin 


Val 


Asn 


He 


Val 


Arg 










885 










890 










895 




Gin 


Asn 


Met 


Thr 


Ser 


Glu 


Ser 


He 


Lys 


Asp 


Pro 


Val 


Phe 


Thr 


Val 


Asp 








900 










905 










910 






Glu 


Lys 


Arg 


Thr 


Leu 


Glu 


Trp 


Lys 


Val 


Glu 


Lys 


Glu 


Gly 


Gin 


Glu 


He 






915 










920 










925 








Lys 


Thr 


Val 


Lys 


Cys 


Pro 


Lys 


Cys 


Lys 


Thr 


Pro 


Asn 


He 


Lys 


Leu 


Gly 




930 










935 










940 










Gly 


Cys 


lie 


Thr 


Met 


Thr 


Cys 


Tyr 


Asp 


Cys 


Ser 


Gly 


Arg 


Arg 


Asp 


Gly 


945 










950 










955 










960 


Tyr 


Pro 


Thr 


Val 


Phe 


Cys 


Trp 


He 


Cys 


Glu 


Asp 


Glu 


He 


Thr 


Asn 


Pro 










965 










970 










975 




Asp 


His 


He 


Leu 


He 


Asp 


His 


Lys 


Leu 


Lys 


Asp 


Cys 


Lys 


Ser 


Thr 


Lys 








980 










985 










990 






Ala 


Ala 


Leu 


Glu 


Lys 


Val 


Tyr 


Asn 


Cys 


Thr 


Leu 


Cys 


Cys 


Leu 


Ala 


Leu 






995 










1000 








1005 






Arg 


Lys 


Cys 


Ser 


Asp 


Ser 


Tyr 


Leu 


Ser 


Lys 


Gin 


Arg 


Gly 


Gly 


Gly 


Glu 




1010 








1015 








1020 








Glu 


Glu 


Glu 


He 


Glu 


He 


Tyr 


Val 


Met 


Glu 


Asp 


Gly 


Phe 


Glu 


Phe 


Asp 


1025 








1030 








1035 








1040 


Val 


His 


Thr 


Lys 


Thr 


Ala 


Val 


Pro 


Thr 


Lys 















1045 1050 



<210> 220 
<2H> 2547 
<212> DNA 
<213> SHRIMP 



<400> 220 

atgggttcta 

agtcttgaaa 

cccaagcttg 

cttcccgcta 

aaaattgttg 

cacaaaaaca 

atccaccctg 

atggccaatg 

tgtttcggac 

agaaagaaga 

attcacggat 

tttacccatg 

tgcttccttc 

gcggcaatca 

aaaggaagcc 

atctttaacg 

gccatctata 

gccggtaatg 

ttcatggaga 

ggcctttccg 

ggcaaaggta 

atcgaaactg 

gaaaatgtcg 

tcggaggaaa 

tctcccatcc 

atcatgacca 

cgcatttcca 

ttcttcaaac 



accagcaaca 
agataatcaa 
acaagaacgc 
ccatctcctt 
accaccctga 
ccaaagagtg 
gaactggtaa 
ctgaaattct 
tgaggacgct 
gaatcttggt 
cagacatcaa 
cttcccccac 
tgggccttca 
tctccaagac 
ccatttcgtc 
tctctgtgaa 
tttctgattg 
aagatt tgag 
gagtgaaggc 
acgtccatgg 
aagaggtggt 
gaacacctta 
gcatcatcaa 
ctgcagtgtg 
ct tccctaag 
gaaatctcga 
atatgaaaac 
tcagaatccc 



atcattcatc 
gaggattgaa 
aattaaccct 
ccaagaaatg 
ttttggaaaa 
gaatagtttt 
accagcatca 
cgatgctgtt 
agaatactct 
tgagcgccct 
atctgtcatt 
actgtttaat 
agatgatagc 
tgctggagga 
atggagtggt 
aaaggttagc 
gcatctggac 
gacgagggat 
tgggaagaat 
cgaagagttt 
gaaggcacgt 
tgtgtgcttt 
gtcttcaaat 
caatttggct 
gccctatgtt 
caaggtgatt 
taggccaatg 
cttcgaatct 



tcaaagagga 
aatgcctgtc 
caagaacttg 
gacgattttc 
ctggcaggaa 
agtgcaacaa 
gtagttaatg 
atcgattaca 
tatttgatca 
caggacatga 
gaaacatatg 
tgtggaacag 
attgagggta 
ctcggcatcc 
acccaccctg 
cagggaggag 
gtgaaggact 
cttttcccag 
tggtccctga 
aaggcgttgt 
gcattattcg 
aaggatacca 
ttgtgcactg 
tctatcgcag 
gattaccggg 
gatgtcaatt 
ggattgggtg 
gaagaagcgg 



atggcactaa 
ttccagtcaa 
catctcacat 
tggccgatta 
gattcatctg 
ctcagaaatt 
atacctacta 
aaatggatta 
agattggttc 
ttatgcgtgt 
atctcatgtc 
tcacacccca 
tttatgatac 
actttcatga 
gtctcatggc 
acaagaggag 
ttattgactg 
ctatctgggt 
tgtgccccca 
acgagaaata 
accaaattaa 
tcaatagaaa 
aaattgtcca 
tcaacaagtt 
agatgaagag 
tctatgcagt 
tgcagggact 
cactaattaa 



gcaagaaatt 
ccagtatgtg 
catggaccgt 
tgcaaagaca 
ttcgaacatc 
gaggcacgca 
tgaaaatgtt 
tctcttcacc 
ccccactgat 
ggctgtcggc 
gaggcactat 
actttcctcg 
tcttaaggag 
tttgagggca 
gttcctccaa 
aggagctgca 
cagaaagaat 
atctgatctc 
cgagtgccct 
tgaggctgaa 
ttctgcacgt 
gtctaaccaa 
gtacagtgat 
tgtgaagtat 
ggttgtaaaa 
tgacaagacc 
agcagatttg 
caagaggatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 
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tttgaaacta 
gaaacatatg 
gggaaggaaa 
gagcaattga 
atgcctactg 
tccaacatgt 
attagagagc 
agtggtggat 
actgtatggg 
gttgaccaag 
tcaatgacta 
ggcgcagcta 
gaagctccct 
aagtcctcta 
ggtgcttgtg 



tatactatgg 
agctgtttga 
atattaagaa 
gaagggacat 
catccactgc 
ataatcgtaa 
tcataaaact 
ctattcagac 
aaattaatcc 
ctcaatccct 
tgtatgcatg 
gagctgtcca 
ctcctgttgc 
ttgttgttcc 
aaatgtgttc 



tgccttggaa 
gggtagtcct 
tagggacata 
tatgaagtat 
acagatcctg 
tgtactttca 
gggagaatgg 
gcttcctaat 
tcgtactact 
caacttgttt 
ggagaagggg 
gttcacagtc 
agctttttct 
agatcctgct 
ttcctag 



gcttcatgtg 
ttgagcaaag 
tatttcaact 
ggtgttcaca 
ggcaactctg 
ggatcattcc 
aattcagtaa 
atccctaaat 
ttggacatgg 
gtggaagaac 
atcaagactc 
gacaagaatg 
gctcctgtcc 
gctgctcttt 



aaattgccaa 
ggattttcca 
ctttgccaat 
attcaatgtt 
aatcctttga 
aagtggtgaa 
ctaaacagag 
cgaccaagga 
ctattcagag 
ccgaactcag 
tctattatct 
tactccaaga 
gagaagaaga 
tatgttctat 



agaaaaggga 
atttgacatg 
tcacgattgg 
tgttgctccc 
gcctttaacg 
cgaatatgtg 
gattatggca 
actattcaaa 
aggtatgttt 
caaggtgcgg 
acgcacaaag 
agtcaagaag 
digsigga.ga.ag 
caataaccct 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2547 



<210> 221 
<211> 842 
<212> PRT 
<213> SHRIMP 



<400> 221 



Met 


Gly 


Ser 


Asn 


Gin 


Gin 


Gin 


Ser 


Phe 


He 


Ser 


Lys 


Arg 


Asn 


Gly 


Thr 


1 








5 










10 










15 




Lys 


Gin 


Glu 


He 


Ser 


Leu 


Glu 


Lys 


He 


He 


Lys 


Arg 


He 


Glu 


Asn 


Ala 








20 










25 










30 






Cys 


Leu 


Pro 


Val 


Asn 


Gin 


Tyr 


Val 


Pro 


Lys 


Leu 


Asp 


Lys 


Asn 


Ala 


He 






35 










40 










45 








Asn 


Pro 


Gin 


Glu 


Leu 


Ala 


Ser 


His 


He 


Met 


Asp 


Arg 


Leu 


Pro 


Ala 


Thr 




50 










55 










60 










lie 


Ser 


Phe 


Gin 


Glu 


Met 


Asp 


Asp 


Phe 


Leu 


Ala 


Asp 


Tyr 


Ala 


Lys 


Thr 


65 










70 










75 










80 


Lys 


He 


Val 


Asp 


His 


Pro 


Asp 


Phe 


Gly 


Lys 


Leu 


Ala 


Gly 


Arg 


Phe 


He 










85 










90 










95 




Cys 


Ser 


Asn 


He 


His 


Lys 


Asn 


Thr 


Lys 


Glu 


Trp 


Asn 


Ser 


Phe 


Ser 


Ala 








100 










105 










110 






Thr 


Thr 


Gin 


Lys 


Leu 


Arg 


His 


Ala 


He 


His 


Pro 


Gly 


Thr 


Gly 


Lys 


Pro 






115 










120 










125 








Ala 


Ser 


Val 


Val 


Asn 


Asp 


Thr 


Tyr 


Tyr 


Glu 


Asn 


Val 


Met 


Ala 


Asn 


Ala 




130 










135 










140 










Glu 


He 


Leu 


Asp 


Ala 


Val 


He 


Asp 


Tyr 


Lys 


Met 


Asp 


Tyr 


Leu 


Phe 


Thr 


145 










150 










155 










160 


Cys 


Phe 


Gly 


Leu 


Arg 


Thr 


Leu 


Glu 


Tyr 


Ser 


Tyr 


Leu 


He 


Lys 


He 


Gly 










165 










170 










175 




Ser 


Pro 


Thr 


Asp 


Arg 


Lys 


Lys 


Arg 


He 


Leu 


Val 


Glu 


Arg 


Pro 


Gin 


Asp 








180 










185 










190 






Met 


He 


Met 


Arg 


Val 


Ala 


Val 


Gly 


He 


His 


Gly 


Ser 


Asp 


He 


Lys 


Ser 






195 










200 










205 








Val 


He 


Glu 


Thr 


Tyr 


Asp 


Leu 


Met 


Ser 


Arg 


His 


Tyr 


Phe 


Thr 


His 


Asp 




210 










215 










220 










Thr 


Leu 


Phe 


Asn 


Cys 


Gly 


Thr 


Val 


Thr 


Pro 


Gin 


Leu 


Ser 


Ser 


Cys 


Phe 


225 










230 










235 










240 


Leu 


Leu 


Gly 


Leu 


Gin 


Asp 


Asp 


Ser 


He 


Glu 


Gly 


He 


Tyr 


Asp 


Thr 


Leu 










245 










250 










255 




Lys 


Glu 


Ala 


Ala 


He 


He 


Ser 


Lys 


Thr 


Ala 


Gly 


Gly 


Leu 


Gly 


He 


His 








260 










265 










270 






Phe 


His 


Asp 


Leu 


Arg 


Ala 


Lys 


Gly 


Ser 


Pro 


He 


Ser 


Ser 


Trp 


Ser 


Gly 






275 










280 










285 








Thr 


His 


Pro 


Gly 


Leu 


Met 


Ala 


Phe 


Leu 


Gin 


He 


Phe 


Asn 


Val 


Ser 


Val 




290 










295 










300 










Lys 


Lys 


Val 


Ser 


Gin 


Gly 


Gly 


Asp 


Lys 


Arg 


Arg 


Gly 


Ala 


Ala 


Ala 


He 
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305 










310 










315 










320 


Tyr 


He 


Ser 


Asp 


Trp 


His 


Leu 


Asp 


Val 


Lys 


Asp 


Phe 


He 


Asp 


Cys 


Arg 










325 










330 










335 




Lys 


Asn 


Ala 


Gly 


Asn 


Glu 


Asp 


Leu 


Arg 


Thr 


Arg 


Asp 


Leu 


Phe 


Pro 


Ala 








340 










345 










350 






lie 


Trp 


Val 


Ser 


Asp 


Leu 


Phe 


Met 


Glu 


Arg 


Val 


Lys 


Ala 


Gly 


Lys 


Asn 






355 










360 










365 








Trp 


Ser 


Leu 


Met 


Cys 


Phe 


Cys 


Pro 


Gly 


Leu 


Ser 


Asp 


Val 


His 


Gly 


Glu 




370 










375 










380 










Glu 


Phe 


Lys 


Ala 


Leu 


Tyr 


Glu 


Lys 


Tyr 


Glu 


Ala 


Glu 


Gly 


Lys 


Gly 


Lys 


385 










390 










395 










400 


Glu 


Val 


Val 


Lys 


Ala 


Arg 


Ala 


Leu 


Phe 


Asp 


Gin 


He 


Asn 


Ser 


Ala 


Arg 










405 










410 










415 




lie 


Glu 


Thr 


Gly 


Thr 


Pro 


Tyr 


Val 


Cys 


Phe 


Lys 


Asp 


Thr 


He 


Asn 


Arg 








420 










425 










430 






Lys 


Ser 


Asn 


Gin 


Glu 


Asn 


Val 


Gly 


He 


He 


Lys 


Ser 


Ser 


Asn 


Leu 


Cys 






435 










440 










445 








Thr 


Glu 


lie 


Val 


Gin 


Tyr 


Ser 


Asp 


Ser 


Glu 


Glu 


Thr 


Ala 


Val 


Cys 


Asn 




450 










455 










460 










Leu 


Ala 


Ser 


He 


Ala 


Val 


Asn 


Lys 


Phe 


Val 


Lys 


Tyr 


Ser 


Pro 


He 


Pro 


465 










470 










475 










480 


Ser 


Leu 


Arg 


Pro 


Tyr 


Val 


Asp 


Tyr 


Arg 


Glu 


Met 


Lys 


Arg 


Val 


Val 


Lys 










485 










490 










495 




lie 


Met 


Thr 


Arg 


Asn 


Leu 


Asp 


Lys 


Val 


He 


Asp 


Val 


Asn 


Phe 


Tyr 


Ala 








500 










505 










510 






Val 


Asp 


Lys 


Thr 


Arg 


He 


Ser 


Asn 


Met 


Lys 


Trp 


Met 


Gly 


Leu 


Gly 


Val 






515 










520 










525 








Gin 


Gly 


Leu 


Ala 


Asp 


Leu 


Phe 


Phe 


Lys 


Leu 


Arg 


He 


Pro 


Phe 


Glu 


Ser 




530 










535 










540 










Glu 


Glu 


Ala 


Ala 


Leu 


He 


Asn 


Lys 


Arg 


He 


Phe 


Glu 


Thr 


He 


Tyr 


Tyr 


545 










550 










555 










560 


Gly 


Ala 


Leu 


Glu 


Ala 


Ser 


Cys 


Glu 


He 


Ala 


Lys 


Glu 


Lys 


Gly 


Glu 


Thr 










565 










570 










575 




Tyr 


Glu 


Leu 


Phe 


Glu 


Gly 


Ser 


Pro 


Leu 


Ser 


Lys 


Gly 


He 


Phe 


Gin 


Phe 








580 










585 










590 






Asp 


Met 


Gly 


Lys 


Glu 


Asn 


He 


Lys 


Asn 


Arg 


Asp 


He 


Tyr 


Phe 


Asn 


Ser 






595 










600 










605 








Leu 


Pro 


He 


His 


Asp 


Trp 


Glu 


Gin 


Leu 


Arg 


Arg 


Asp 


He 


Met 


Lys 


Tyr 




610 










615 










620 










Gly 


Val 


His 


Asn 


Ser 


Met 


Phe 


Val 


Ala 


Pro 


Met 


Pro 


Thr 


Ala 


Ser 


Thr 


625 










630 










635 










640 


Ala 


Gin 


He 


Leu 


Gly 


Asn 


Ser 


Glu 


Ser 


Phe 


Glu 


Pro 


Leu 


Thr 


Ser 


Asn 










645 










650 










655 




Met 


Tyr 


Asn 


Arg 


Asn 


Val 


Leu 


Ser 


Gly 


Ser 


Phe 


Gin 


Val 


Val 


Asn 


Glu 








660 










665 










670 






Tyr 


Val 


He 


Arg 


Glu 


Leu 


He 


Lys 


Leu 


Gly 


Glu 


Trp 


Asn 


Ser 


Val 


Thr 






675 










680 










685 








Lys 


Gin 


Arg 


He 


Met 


Ala 


Ser 


Gly 


Gly 


Ser 


He 


Gin 


Thr 


Leu 


Pro 


Asn 




690 










695 










700 










He 


Pro 


Lys 


Ser 


Thr 


Lys 


Glu 


Leu 


Phe 


Lys 


Thr 


Val 


Trp 


Glu 


He 


Asn 


705 










710 










715 










720 


Pro 


Arg 


Thr 


Thr 


Leu 


Asp 


Met 


Ala 


He 


Gin 


Arg 


Gly 


Met 


Phe 


Val 


Asp 










725 










730 










735 




Gin 


Ala 


Gin 


Ser 


Leu 


Asn 


Leu 


Phe 


Val 


Glu 


Glu 


Pro 


Glu 


Leu 


Ser 


Lys 








740 










745 










750 






Val 


Arg 


Ser 


Met 


Thr 


Met 


Tyr 


Ala 


Trp 


Glu 


Lys 


Gly 


He 


Lys 


Thr 


Leu 






755 










760 










765 








Tyr 


Tyr 


Leu 


Arg 


Thr 


Lys 


Gly 


Ala 


Ala 


Arg 


Ala 


Val 


Gin 


Phe 


Thr 


Val 




770 










775 










780 










Asp 


Lys 


Asn 


Val 


Leu 


Gin 


Glu 


Val 


Lys 


Lys 


Glu 


Ala 


Pro 


Ser 


Pro 


Val 


785 










790 










795 










800 
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Ala Ala Phe Ser Ala Pro Val Arg Glu Glu Glu Glu Glu Lys Lys Ser 

805 810 815 

Ser He Val Val Pro Asp Pro Ala Ala Ala Leu Leu Cys Ser He Asn 

820 825 830 

Asn Pro Gly Ala Cys Glu Met Cys Ser Ser 
835 840 



<210> 222 
<211> 222 
<212> DNA 
<213> SHRIMP 



<400> 222 

atggtcatac gactctgctt cttggaaagt atcacgtgct ttgtatatgg catcatggca 60 

ccactatctt tggacacaaa taccgattat ttatcccaca aaaaggatac taataaaaaa 120 

atacagatgc agataaactt tattccttat tcaaatatgc acgtatatat agcaggtgtg 180 

tacacatttc atgaaaaaaa ggggttaaca tatcaacaat at 222 



<210> 223 
<211> 74 
<212> PRT 
<213> SHRIMP 



<400> 223 

Met Val He Arg Leu Cys Phe Leu Glu Ser He Thr Cys Phe Val Tyr 

15 10 15 

Gly He Met Ala Pro Leu Ser Leu Asp Thr Asn Thr Asp Tyr Leu Ser 

20 25 30 

His Lys Lys Asp Thr Asn Lys Lys He Gin Met Gin He Asn Phe He 

35 40 45 

Pro Tyr Ser Asn Met His Val Tyr He Ala Gly Val Tyr Thr Phe His 

50 55 60 

Glu Lys Lys Gly Leu Thr Tyr Gin Gin Tyr 
65 70 



<210> 224 
<2H> 1848 
<212> DNA 
<213> SHRIMP 



<400> 224 

atggaa taca 

gagttaaaac 

gtcatgggtg 

tttgttgtac 

caagaa ttgg 

cctggacatt 

caagaagaag 

aaggacttta 

gaaaaaaaac 

acaatgaacg 

aaggatatga 

atagccaatc 

gtggcgggcg 

tcgacactag 

gagttggcta 

acaaaagttg 

ctagaaacac 

cttttcaaag 



ttggagagaa 
taagatcgtc 
tttatgaagc 
atgtgaagaa 
tagaagatga 
tattcaaaga 
acgtgcgtaa 
aattcagtca 
acgattgtat 
tcacgcacgc 
tggtcccatt 
ctattttcaa 
tttttgcaag 
cctacgctgt 
ggaaaaattt 
aaaaggaact 
aagtagtaaa 
tctctgagga 



aaacaataac 
attcctgatg 
tatcgaatct 
agataaacaa 
ctctttaaga 
tgatgctggg 
aatcaataca 
attatacagg 
gataataagt 
tttaaagtca 
ggttgaaatt 
gagccattct 
gagcgcacac 
tatcttatac 
acaaataaaa 
agaaacacaa 
gaaagaggag 
acaaaaagaa 



cctgttagta 
atcgggaaga 
ataagacaaa 
ctcaaattcg 
attgaacgga 
cacgttactg 
atagtcaagg 
tacatgagca 
tccgatttct 
atggagagaa 
tgtcaccgta 
tcacattgct 
ccttctgctg 
agtgatgaaa 
gaggaactag 
gtagtaaaga 
tacaaaaact 
tctctcagaa 



atgaaagtgt 
aaacaag taa 
gcgaat tg tc 
caagagggtt 
taagttgtgc 
acgaggaatg 
aaaaattaaa 
atagtctttc 
taatcggt tt 
ctatgcagaa 
ccca t tacaa 
tgattg t tec 
caagcattga 
agcaacgcca 
aaaaccaggt 
ccacaaaagt 
cgtatatcga 
atgtacacaa 



atcagaaaag 
atatgagcaa 
cgaagacaca 
aaaaagatta 
ccctcctgaa 
gcttgcaacg 
acgaaaagac 
tgaagcagta 
aggtttcagt 
acatggtttc 
aggagaatat 
cttgtttatg 
aatgtacctt 
gatacgcgaa 
cgaaaagacc 
tgaaaaggaa 
aactgaacaa 
gaaatcttcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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aatgcgacct 
ttctacttgt 
ttgaggtaca 
ctcatcatgg 
caaaacaagt 
agtgatgaag 
atgaatgatg 
cttgatgaag 
atggagaaca 
gaagctcgta 
caaaagaagc 
aggacctggg 
aacactcaag 



tcagatacga 
tgtgcaggac 
ttttctcccc 
ccgtgactgg 
tcaaggtatt 
atttgaaggg 
gcaccgtcta 
aaaatatgac 
catcaaaaat 
cgagaaacaa 
gagaaaatgg 
aaaacgacaa 
aacaacagcg 



cagcggctct 
agacaaaagt 
cataaacaag 
ctgtgacgca 
aaaatgtaac 
tattgtacaa 
tcaagatggg 
ccaatttgaa 
tgtcactagg 
gaccgaaaag 
agaacagtct 
tgaatttgat 
tgtaaagaga 



tgtctcgtct 
ggttcctttg 
aaaagggata 
cctatcgctt 
cgatcaagta 
aaagtgacag 
cagaggatag 
attgaacaac 
tacaacaagg 
tggtttgaga 
accagtgaac 
agcgacgtag 
catgccattt 



tctcgatatc 
aaactgcaac 
ctgcaggtat 
gcaacgacag 
tagttttcca 
gttctgatat 
acatctcttc 
aaaggaagct 
aaagacattt 
aggtaaagaa 
aggagcaaag 
aagaagaaga 
ctgtgtaa 



ctctacagaa 
agaaaatggt 
gaggcccaga 
tatcaaacac 
gacccctcca 
ccgaatcttt 
gcctcaagaa 
ccattccatg 
gaccacaaag 
gagggaggaa 
gggagtaaaa 
agatggaaac 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1848 



<210> 225 

<211> 611 

<212> PRT 

<213> SHRIMP 



<400> 225 



Met 


Glu 


Tyr 


He 


Gly 


Glu 


Lys 


Asn 


Asn 


Asn 


Pro 


Val 


Ser 


Asn 


Glu 


Ser 


1 








5 










10 










15 




Val 


Ser 


Glu 


Lys 


Glu 


Leu 


Lys 


Leu 


Arg 


Ser 


Ser 


Phe 


Leu 


Met 


He 


Gly 








20 










25 










30 






Lys 


Lys 


Thr 


Ser 


Lys 


Tyr 


Glu 


Gin 


Val 


Met 


Gly 


Val 


Tyr 


Glu 


Ala 


He 






35 










40 










45 








Glu 


Ser 


He 


Arg 


Gin 


Ser 


Glu 


Leu 


Ser 


Glu 


Asp 


Thr 


Phe 


Val 


Val 


His 




50 










55 










60 










Val 


Lys 


Lys 


Asp 


Lys 


Gin 


Leu 


Lys 


Phe 


Arg 


Leu 


Lys 


Arg 


Leu 


Gin 


Glu 


65 










70 










75 










80 


Leu 


Val 


Glu 


Asp 


Asp 


Ser 


Leu 


Arg 


He 


Glu 


Arg 


He 


Ser 


Cys 


Ala 


Pro 










85 










90 










95 




Pro 


Glu 


Pro 


Gly 


His 


Leu 


Phe 


Lys 


Asp 


Asp 


Ala 


Gly 


His 


Val 


Thr 


Asp 








100 










105 










110 






Glu 


Glu 


Trp 


Leu 


Ala 


Thr 


Gin 


Glu 


Glu 


Asp 


Val 


Arg 


Lys 


He 


Asn 


Thr 






115 










120 










125 








He 


Val 


Lys 


Glu 


Lys 


Leu 


Lys 


Arg 


Lys 


Asp 


Lys 


Asp 


Phe 


Lys 


Phe 


Ser 




130 










135 










140 










Gin 


Leu 


Tyr 


Arg 


Tyr 


Met 


Ser 


Asn 


Ser 


Leu 


Ser 


Glu 


Ala 


Val 


Glu 


Lys 


145 










150 










155 










160 


Lys 


His 


Asp 


Cys 


Met 


He 


He 


Ser 


Ser 


Asp 


Phe 


Leu 


He 


Gly 


Leu 


Gly 










165 










170 










175 




Phe 


Ser 


Thr 


Met 


Asn 


Val 


Thr 


His 


Ala 


Leu 


Lys 


Ser 


Met 


Glu 


Arg 


Thr 








180 










185 










190 






Met 


Gin 


Lys 


His 


Gly 


Phe 


Lys 


Asp 


Met 


Met 


Val 


Pro 


Leu 


Val 


Glu 


He 






195 










200 










205 








Cys 


His 


Arg 


Thr 


His 


Tyr 


Lys 


Gly 


Glu 


Tyr 


He 


Ala 


Asn 


Pro 


He 


Phe 




210 










215 










220 










Lys 


Ser 


His 


Ser 


Ser 


His 


Cys 


Leu 


He 


Val 


Pro 


Leu 


Phe 


Met 


Val 


Ala 


225 










230 










235 










240 


Gly 


Val 


Phe 


Ala 


Arg 


Ser 


Ala 


His 


Pro 


Ser 


Ala 


Ala 


Ser 


He 


Glu 


Met 










245 










250 










255 




Tyr 


Leu 


Ser 


Thr 


Leu 


Ala 


Tyr 


Ala 


Val 


He 


Lys 


Asp 


Glu 


Lys 


Gin 


Arg 








260 










265 










270 






Gin 


He 


Arg 


Glu 


Glu 


Leu 


Ala 


Arg 


Lys 


Asn 


Leu 


Gin 


He 


Lys 


Glu 


Glu 






275 










280 










285 








Leu 


Glu 


Asn 


Gin 


Val 


Glu 


Lys 


Thr 


Thr 


Lys 


Val 


Glu 


Lys 


Glu 


Leu 


Glu 




290 










295 










300 










Thr 


Gin 


Val 


Val 


Lys 


Thr 


Thr 


Lys 


Val 


Glu 


Lys 


Glu 


Leu 


Glu 


Thr 


Gin 


305 










310 










315 










320 


Val 


Val 


Lys 


Lys 


Glu 


Glu 


Tyr 


Lys 


Asn 


Ser 


Tyr 


He 


Glu 


Thr 


Glu 


Gin 



wo 01/38351 



315 



PCT/USOO/28888 











325 










330 










335 




Leu 


Phe 


Lys 


Val 
340 


Ser 


Glu 


Glu 


Gin 


Lys 
345 


Glu 


Ser 


Leu 


Arg 


Asn 
350 


Val 


His 


Lys 


Lys 


Ser 
355 


Ser 


Asn 


Ala 


Thr 


Phe 
360 


Arg 


Tyr 


Asp 


Ser 


Gly 

365 


Ser 


Cys 


Leu 


Val 


Phe 
370 


Ser 


He 


Ser 


Ser 


Thr 

375 


Glu 


Phe 


Tyr 


Leu 


Leu 
380 


Cys 


Arg 


Thr 


Asp 


Lys 


Ser 


Gly 


Ser 


Phe 


Glu 


Thr 


Ala 


Thr 


Glu 


Asn 


Gly 


Leu 


Arg 


Tyr 


He 


385 










390 










395 










400 


Phe 


Ser 


Pro 


He 


Asn 
405 


Lys 


Lys 


Arg 


Asp 


Thr 
410 


Ala 


Gly 


Met 


Arg 


Pro 
415 


Arg 


Leu 


He 


Met 


Ala 
420 


Val 


Thr 


Gly 


Cys 


Asp 
425 


Ala 


Pro 


He 


Ala 


Cys 
430 


Asn 


Asp 


Ser 


He 


Lys 
435 


His 


Gin 


Asn 


Lys 


Phe 
440 


Lys 


Val 


Leu 


Lys 


Cys 
445 


Asn 


Arg 


Ser 


Ser 


He 
450 


Val 


Phe 


Gin 


Thr 


Pro 
455 


Pro 


Ser 


Asp 


Glu 


Asp 
460 


Leu 


Lys 


Gly 


He 


Val 


Gin 


Lys 


Val 


Thr 


Gly 


Ser 


Asp 


He 


Arg 


He 


Phe 


Met 


Asn 


Asp 


Gly 


465 










470 










475 










480 


Thr 


Val 


Tyr 


Gin 


Asp 
485 


Gly 


Gin 


Arg 


He 


Asp 
490 


He 


Ser 


Ser 


Pro 


Gin 
495 


Glu 


Leu 


Asp 


Glu 


Glu 
500 


Asn 


Met 


Thr 


Gin 


Phe 
505 


Glu 


He 


Glu 


Gin 


Gin 
510 


Arg 


Lys 


Leu 


His 


Ser 
515 


Met 


Met 


Glu 


Asn 


Thr 

520 


Ser 


Lys 


He 


Val 


Thr 

525 


Arg 


Tyr 


Asn 


Lys 


Glu 
530 


Arg 


His 


Leu 


Thr 


Thr 
535 


Lys 


Glu 


Ala 


Arg 


Thr 
540 


Arg 


Asn 


Lys 


Thr 


Glu 


Lys 


Trp 


Phe 


Glu 


Lys 


Val 


Lys 


Lys 


Arg 


Glu 


Glu 


Gin 


Lys 


Lys 


Arg 


545 










550 










555 










560 


Glu 


Asn 


Gly 


Glu 


Gin 
565 


Ser 


Thr 


Ser 


Glu 


Gin 
570 


Glu 


Gin 


Arg 


Gly 


Val 

575 


Lys 


Arg 


Thr 


Trp 


Glu 
580 


Asn 


Asp 


Asn 


Glu 


Phe 
585 


Asp 


Ser 


Asp 


Val 


Glu 
590 


Glu 


Glu 


Glu 


Asp 


Gly 
595 


Asn 


Asn 


Thr 


Gin 


Glu 
600 


Gin 


Gin 


Arg 


Val 


Lys 
605 


Arg 


His 


Ala 


lie 


Ser 
610 


Val 





























<210> 226 
<211> 885 
<212> DNA 
<213> SHRIMP 



<400> 226 

atggagtata 

gagttacagc 

gaagtaatga 

gttttcattg 

ttaagagagc 

cctgagaaag 

gcttctcaag 

ttgtttagag 

atacctttcg 

tttagtgctg 

ggtctgaaat 

gatattgttg 

gtgtatctag 

tatctgttgg 

tgcatgcgat 



taggggaaca 
tacgctcttc 
gtacatttga 
tacaattaaa 
tcacagggga 
gacacgttat 
acaaagacgt 
ggttttattt 
gggaaaagga 
atga tgtcat 
atacatgggt 
taaaccccat 
ggtacatgtt 
cgttagcgtt 
tatgtgagga 



aaaactgata 
ttttctaatg 
agcagttgaa 
ggaaaataaa 
caattccctc 
acttaaaaac 
acaggaagta 
cagtccaact 
acgttttgtt 
ggaaaaatta 
gcctgtggcc 
tttcaagagt 
ttcccgtaa t 
tgctat tgat 
tatttctgag 



aatctcctgg 
attggagaaa 
actatccgaa 
cacattactt 
aagatagaat 
acgtctacaa 
aataagctcg 
tacaggtata 
gttagcacag 
atagcgatag 
gaagtgtgcc 
tatcattccc 
gttcaacccc 
t tgtacggca 
gtgaaaaggg 



atgagacacc 
aacaatatga 
agagtgaatt 
ttgaaggggg 
cattattatc 
ctactgatga 
taaaggaaaa 
taacaaaaag 
attttttgat 
agggaaatat 
atctcaaaaa 
attgcctagt 
catctctaga 
gggaagaaat 
ggtaa 



tgaagaagac 
aaagtatgaa 
tagagacggc 
actgaaagaa 
ctctattaag 
cgagtggctt 
gacacggatg 
tcttcctcaa 
tggtctcggt 
gcggaaaagt 
atacaagggt 
tattccactg 
agtggagacg 
gcgcaagtct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

885 
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<210> 227 
<211> 290 
<212> PRT 
<213> SHRIMP 



<400> 227 



Met 


Glu 


Tyr 


He 


Gly 


Glu 


Gin 


Lys 


1 








5 








Pro 


Glu 


Glu 


Asp 


Glu 


Leu 


Gin 


Leu 








20 










Glu 


Lys 


Gin 


Tyr 


Glu 


Lys 


Tyr 


Glu 






35 










40 


val 


Glu 


Thr 


He 


Arg 


Lys 


Ser 


Glu 




50 










55 




Gin 


Leu 


Lys 


Glu 


Asn 


Lys 


His 


He 


65 










70 






Leu 


Arg 


Glu 


Leu 


Thr 


Gly 


Asp 


Asn 










85 








Ser 


Ser 


He 


Lys 


Pro 


Glu 


Lys 


Gly 








100 










Thr 


Thr 


Thr 


Asp 


Asp 


Glu 


Trp 


Leu 






115 










120 


Glu 


Val 


Asn 


Lys 


Leu 


Val 


Lys 


Glu 




130 










135 




Phe 


Tyr 


Phe 


Ser 


Pro 


Tyr 


Tyr 


He 


145 










150 






Phe 


Gly 


Glu 


Lys 


Glu 


Arg 


Phe 


Val 










165 








Leu 


Gly 


Phe 


Ser 


Ala 


Asp 


Asp 


Val 








180 










Gly 


Asn 


Met 


Arg 


Lys 


Ser 


Gly 


Leu 






195 










200 


Glu 


Val 


Cys 


His 


Leu 


Lys 


Lys 


Tyr 




210 










215 




He 


Phe 


Lys 


Ser 


Tyr 


His 


Ser 


His 


225 










230 






Leu 


Gly 


Tyr 


Met 


Phe 


Ser 


Arg 


Asn 










245 








Glu 


Thr 


Tyr 


Leu 


Leu 


Ala 


Phe 


Ala 








260 










Met 


Arg 


Lys 


Ser 


Cys 


Met 


Arg 


Leu 






275 










280 



Arg Gly 
290 



Leu 


He 


Asn 


Leu 


Leu 


Asp 


Glu 


Thr 




10 










15 




Arg 


Ser 


Ser 


Phe 


Leu 


Met 


He 


Gly 


25 










30 






Glu 


Val 


Met 


Ser 


Thr 


Phe 


Glu 


Ala 










45 








Phe 


Arg 


Asp 


Gly 


Val 


Phe 


He 


Val 








60 










Thr 


Phe 


Glu 


Gly 


Gly 


Leu 


Lys 


Glu 






75 










80 


Ser 


Leu 


Lys 


He 


Glu 


Ser 


Leu 


Leu 




90 










95 




His 


Val 


He 


Leu 


Lys 


Asn 


Thr 


Ser 


105 










110 






Ala 


Ser 


Gin 


Asp 


Lys 


Asp 


Val 


Gin 










125 








Lys 


Thr 


Arg 


Met 


Leu 


Phe 


Arg 


Gly 








140 










Thr 


Lys 


Ser 


Leu 


Pro 


Gin 


He 


Pro 






155 










160 


Val 


Ser 


Thr 


Asp 


Phe 


Leu 


He 


Gly 




170 










175 




Met 


Glu 


Lys 


Leu 


He 


Ala 


He 


Glu 


185 










190 






Lys 


Tyr 


Thr 


Trp 


Val 


Pro 


Val 


Ala 










205 








Lys 


Gly 


Asp 


He 


Val 


Val 


Asn 


Pro 








220 










Cys 


Leu 


Val 


He 


Pro 


Leu 


Val 


Tyr 






235 










240 


Val 


Gin 


Pro 


Pro 


Ser 


Leu 


Glu 


Val 




250 










255 




He 


Asp 


Leu 


Tyr 


Gly 


Arg 


Glu 


Glu 


265 










270 






Cys 


Glu 


Asp 


He 


Ser 


Glu 


Val 


Lys 



285 



<210> 228 
<211> 2769 
<212> DNA 
<213> SHRIMP 



<400> 228 

atggagtcaa 

gatcaagttt 

aaggccaagt 

agtccacaca 

actcctccag 

cttatgggca 

tttctggaga 

tcggtactgt 

tgtgatgggt 



tcaaggaaca 
atgaatttga 
ttgctgctgg 
cggaaactgt 
aaatcacacc 
aggtgccagc 
atgacgaaga 
tcaatagtgt 
gtgtctcaaa 



gcagcagcag 
ggatactact 
aagaagaatg 
ggaagaagtt 
cgccgagaag 
tcatcttgac 
tgaggacgaa 
gatgaactct 
ggtcaagtcg 



cagccaac tg 
actacttctg 
gtctctaagc 
gtaggagagg 
aagcagcaat 
gtgagtgttc 
gaattggaga 
ggacgcacag 
gcatt tgaag 



tcactttctc 
ccaagaaacc 
agaggaggaa 
aagaagaaca 
ctcttcaaga 
tggccaaatc 
agaataagaa 
aactgtcccc 
gaaaggatct 



tgaggaacca 
aactccttcc 
cactatcagg 
gcagcagcag 
actggacgct 
tgttgctgaa 
ggcgcagaaa 
ttccaccttt 
ggttagcaac 



60 

120 

180 

240 

300 

360 

420 

480 
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attgtcaagg 
ctggccaatc 
acactcaaca 
gctgtatatg 
gataagtcac 
ggattggtca 
accactaacc 
ttcaagaacg 
gtatttggag 
gacgaagatg 
tcagttggaa 
gaggaaaagg 
gacgacgcct 
gctgctgaag 
gtagaggatg 
gatgaggatg 
gagtgcgaat 
agaccatctg 
ctatcaaaga 
atcaccaact 
actgaggctg 
gatgaggagg 
gtggacaatc 
gtagattatg 
agcaagaaga 
gctactgtcc 
aacactactc 
gggcaacata 
aatcttctaa 
cttattgttg 
aatgaggata 
gcctttaaga 
actgggcctt 
a tcatggagc 
gcatacaaac 
atctggcacg 
attcctgatt 
ttggattag 



ttgagggaga 
tattccttgg 
agaaggccct 
tggacgagat 
tgtttaagga 
tgtcaactcc 
aaattgtgga 
acaagtggtt 
aagaggatga 
aaaacgagga 
atgtcatcaa 
aagagggtga 
catcagtgtg 
aagaagagga 
aagatgacaa 
atgatggtgc 
tcgacgagag 
cttcagtgac 
acagatcaac 
cattcaacat 
ctgaggattg 
atcagggact 
ttattatctt 
ataagagttt 
gtgctcttat 
atacactggc 
cttatggaag 
acgacatcaa 
agagggaagt 
acggaggcgt 
tttttggaag 
atgttgccaa 
tcaagcaata 
cttgtgaggg 
aagcactcca 
agatgatcac 
tttataaata 



agctgtgaag 
ttgtatgaat 
tgataaggga 
ggatttggag 
acttggagga 
cagttcttcg 
agaaagtgtc 
ggccaagagg 
cgaaaagtca 
ggaagttgat 
gaatcttatt 
gcagtcagag 
ttcttcttcc 
agaggatgag 
ggaaagtgtt 
aacttcccag 
cgatgggaac 
caaatatcct 
tcctgtctgt 
gtccactatc 
tgtgcacatc 
tgatctgcaa 
ttctatgggt 
ctggtcctct 
taatgctcta 
tattccattc 
ctacaagacc 
caacaattgg 
tgttgagaac 
gggagaaaat 
cattgaagcg 
gaaatgtgac 
cctcattgat 
ggatgaaact 
csgggcaaag 
cagggatatg 
tgttcaggta 



aaaactgcaa 
ctccagttcc 
ggccctctct 
aagaagaggc 
aactatattg 
tctaccaaga 
actgaatcta 
gagagtaacc 
gcatatgcat 
tacgattaca 
agaaaaacta 
gaagaagaag 
tcttcgtctt 
gaggataagg 
atcagcagta 
tgttcagaag 
cctctctacc 
caatctgagg 
ctatctctat 
ctcaagtctc 
gagtctacaa 
aacagccagt 
tccactacat 
ttcgacaagt 
tgtgaagata 
tgtgaatcta 
ttcagaatta 
agatctgaca 
aagacacaca 
agtggcaact 
aattcaatgt 
ctcatccaga 
tacaagtata 
acagcacatg 
attacagcaa 
aatacaactt 
tcacctgtaa 



tcgccactga 
atgagcacgt 
tcacattgaa 
agatatttgg 
atagtgcaat 
aggcaggaac 
tgagaaatgg 
tcaagtcact 
acagcgacag 
acaatgaaac 
tcggtctcag 
aagactccga 
catctgtcac 
ataaggatac 
gcagtgaaga 
ttgtgtttgg 
tggcttcaga 
aagagatgga 
gcagacactc 
ttaagttgtt 
agaagaagga 
actactcagt 
acgaatcttc 
ctgtgaagcc 
atgtgactgc 
tgcccattaa 
gtttgccagg 
tgtacactaa 
gccgcagata 
act tgaaggt 
ctgccaagac 
ctacaaccaa 
at tcggccag 
agatgaagag 
gctccatctc 
acaatagcat 
atgtatcacc 



tactacaaaa 
gactattgaa 
gctctctgat 
aagtaatgga 
caagtcaact 
acatttcaag 
atgttgttgc 
gaacaatact 
tgaggatgaa 
tattgaaagt 
tgatgtagag 
tgatgatgat 
tgttgtagct 
agcaacagta 
ttcagaagaa 
ggatgtaaca 
caacagcttc 
tgtttctctc 
ctctggatgc 
ccctgccgga 
tgaggatgaa 
acttgtcgat 
aatggtggag 
ttattgtgag 
caaggtatac 
ccacattaat 
gaacttttct 
gatggttgaa 
cgt tcgcaac 
gcacgagaat 
tgctgctgcc 
cgacatcttg 
aaagaatatt 
ggctcaagac 
gctgaggggc 
gtttatgtat 
attatatatg 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2769 



<210> 229 

<211> 922 

<212> PRT 

<213> SHRIMP 



<400> 229 



Met 


Glu 


Ser 


lie 


Lys 


Glu 


Gin 


Gin 


Gin 


Gin 


Gin 


Pro 


Thr 


Val 


Thr 


Phe 


1 








5 










10 










15 




Ser 


Glu 


Glu 


Pro 

20 


Asp 


Gin 


Val 


Tyr 


Glu 
25 


Phe 


Glu 


Asp 


Thr 


Thr 
30 


Thr 


Thr 


Ser 


Ala 


Lys 
35 


Lys 


Pro 


Thr 


Pro 


Ser 
40 


Lys 


Ala 


Lys 


Phe 


Ala 
45 


Ala 


Gly 


Arg 


Arg 


Met 
50 


Val 


Ser 


Lys 


Gin 


Arg 
55 


Arg 


Asn 


Thr 


He 


Arg 
60 


Ser 


Pro 


His 


Thr 


Glu 


Thr 


Val 


Glu 


Glu 


Val 


Val 


Gly 


Glu 


Glu 


Glu 


Glu 


Gin 


Gin 


Gin 


Gin 


65 










70 










75 










80 


Thr 


Pro 


Pro 


Glu 


lie 
85 


Thr 


Pro 


Ala 


Glu 


Lys 
90 


Lys 


Gin 


Gin 


Ser 


Leu 
95 


Gin 


Glu 


Leu 


Asp 


Ala 
100 


Leu 


Met 


Gly 


Lys 


Val 
105 


Pro 


Ala 


His 


Leu 


Asp 
110 


Val 


Ser 


Val 


Leu 


Ala 
115 


Lys 


Ser 


Val 


Ala 


Glu 
120 


Phe 


Leu 


Glu 


Asn 


Asp 
125 


Glu 


Asp 


Glu 
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Asp 


Glu 


Glu 


Leu 


Glu 


Lys 


Asn 


Lys 


Lys 


Ala 


Gin 


Lys 


Ser 


Val 


Leu 


Phe 




130 










135 










140 










Asn 


Ser 


Val 


Met 


Asn 


Ser 


Gly Arg 


Thr 


Glu 


Leu 


Ser 


Pro 


Ser 


Thr 


Phe 


145 










150 










155 










160 


Cys 


Asp 


Gly 


Cys 


Val 


Ser 


Lys 


Val 


Lys 


Ser 


Ala 


Phe 


Glu 


Gly 


Lys 


Asp 










165 










170 










175 




Leu 


Val 


Ser 


Asn 


He 


Val 


Lys 


Val 


Glu 


Gly 


Glu 


Ala 


Val 


Lys 


Lys 


Thr 








180 










185 










190 






Ala 


He 


Ala 


Thr 


Asp 


Thr 


Thr 


Lys 


Leu 


Ala 


Asn 


Leu 


Phe 


Leu 


Gly 


Cys 






195 










200 










205 








Met 


Asn 


Leu 


Gin 


Phe 


His 


Glu 


His 


Val 


Thr 


He 


Glu 


Thr 


Leu 


Asn 


Lys 




210 










215 










220 










Lys 


Ala 


Leu 


Asp 


Lys 


Gly 


Gly 


Pro 


Leu 


Phe 


Thr 


Leu 


Lys 


Leu 


Ser 


Asp 


225 










230 










235 










240 


Ala 


Val 


Tyr 


Val 


Asp 


Glu 


Met 


Asp 


Leu 


Glu 


Lys 


Lys 


Arg 


Gin 


He 


Phe 










245 










250 










255 




Gly 


Ser 


Asn 


Gly 


Asp 


Lys 


Ser 


Leu 


Phe 


Lys 


Glu 


Leu 


Gly 


Gly 


Asn 


Tyr 








260 










265 










270 






lie 


Asp 


Ser 


Ala 


He 


Lys 


Ser 


Thr 


Gly 


Leu 


Val 


Met 


Ser 


Thr 


Pro 


Ser 






275 










280 










285 








Ser 


Ser 


Ser 


Thr 


Lys 


Lys 


Ala 


Gly 


Thr 


His 


Phe 


Lys 


Thr 


Thr 


Asn 


Gin 




290 










295 










300 










He 


Val 


Glu 


Glu 


Ser 


Val 


Thr 


Glu 


Ser 


Met 


Arg 


Asn 


Gly 


Cys 


Cys 


Cys 


305 










310 










315 










320 


Phe 


Lys 


Asn 


Asp 


Lys 


Trp 


Leu 


Ala 


Lys 


Arg 


Glu 


Ser 


Asn 


Leu 


Lys 


Ser 










325 










330 










335 




Leu 


Asn 


Asn 


Thr 


Val 


Phe 


Gly 


Glu 


Glu 


Asp 


Asp 


Glu 


Lys 


Ser 


Ala 


Tyr 








340 










345 










350 






Ala 


Tyr 


Ser 


Asp 


Ser 


Glu 


Asp 


Glu 


Asp 


Glu 


Asp 


Glu 


Asn 


Glu 


Glu 


Glu 






355 










360 










365 








Val 


Asp 


Tyr 


Asp 


Tyr 


Asn 


Asn 


Glu 


Thr 


He 


Glu 


Ser 


Ser 


Val 


Gly 


Asn 




370 










375 










380 










Val 


He 


Lys 


Asn 


Leu 


He 


Arg 


Lys 


Thr 


He 


Gly 


Leu 


Ser 


Asp 


Val 


Glu 


385 










390 










395 










400 


Glu 


Glu 


Lys 


Glu 


Glu 


Gly 


Glu 


Gin 


Ser 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Ser 










405 










410 










415 




Asp 


Asp 


Asp 


Asp 


Asp 


Asp 


Ala 


Ser 


Ser 


Val 


Cys 


Ser 


Ser 


Ser 


Ser 


Ser 








420 










425 










430 






Ser 


Ser 


Ser 


Val 


Thr 


Val 


Val 


Ala 


Ala 


Ala 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 






435 










440 










445 








Asp 


Glu 


Glu 


Asp 


Lys 


Asp 


Lys 


Asp 


Thr 


Ala 


Thr 


Val 


Val 


Glu 


Asp 


Glu 




450 










455 










460 










Asp 


Asp 


Lys 


Glu 


Ser 


Val 


He 


Ser 


Ser 


Ser 


Ser 


Glu 


Asp 


Ser 


Glu 


Glu 


465 










470 










475 










480 


Asp 


Glu 


Asp 


Asp 


Asp 


Gly 


Ala 


Thr 


Ser 


Gin 


Cys 


Ser 


Glu 


Val 


Val 


Phe 










485 










490 










495 




Gly 


Asp 


Val 


Thr 


Glu 


Cys 


Glu 


Phe 


Asp 


Glu 


Ser 


Asp 


Gly 


Asn 


Pro 


Leu 








500 










505 










510 






Tyr 


Leu 


Ala 


Ser 


Asp 


Asn 


Ser 


Phe 


Arg 


Pro 


Ser 


Ala 


Ser 


Val 


Thr 


Lys 






515 










520 










525 








Tyr 


Pro 


Gin 


Ser 


Glu 


Glu 


Glu 


Met 


Asp 


Val 


Ser 


Leu 


Leu 


Ser 


Lys 


Asn 




530 










535 










540 










Arg 


Ser 


Thr 


Pro 


Val 


Cys 


Leu 


Ser 


Leu 


Cys 


Arg 


His 


Ser 


Ser 


Gly 


Cys 


545 










550 










555 










560 


He 


Thr 


Asn 


Ser 


Phe 


Asn 


Met 


Ser 


Thr 


He 


Leu 


Lys 


Ser 


Leu 


Lys 


Leu 










565 










570 










575 




Phe 


Pro 


Ala 


Gly 


Thr 


Glu 


Ala 


Ala 


Glu 


Asp 


Cys 


Val 


His 


He 


Glu 


Ser 








580 










585 










590 






Thr 


Lys 


Lys 


Lys 


Asp 


Glu 


Asp 


Glu 


Asp 


Glu 


Glu 


Asp 


Gin 


Gly 


Leu 


Asp 






595 










600 










605 








Leu 


Gin 


Asn 


Ser 


Gin 


Tyr 


Tyr 


Ser 


Val 


Leu 


Val 


Asp 


Val 


Asp 


Asn 


Leu 
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610 615 620 



lie 


lie 


Phe 


Ser 


Met 


Gly 


Ser 


Thr 


Thr 


Tyr 


Glu 


Ser 


Ser 


Met 


Val 


Glu 


62 5 










630 










635 










640 


Val 


Asp 


Tyr 


Asp 


Lys 


Ser 


Phe 


Trp 


Ser 


Ser 


Phe 


Asp 


Lys 


Ser 


Val 


Lys 










645 










650 










655 




Pro 


Tyr 


Cys 


Glu 


Ser 


Lys 


Lys 


Ser 


Ala 


Leu 


He 


Asn 


Ala 


Leu 


Cys 


Glu 








660 










665 










670 






Asp 


Asn 


Val 


Thr 


Ala 


Lys 


Val 


Tyr 


Ala 


Thr 


Val 


His 


Thr 


Leu 


Ala 


He 






675 










680 










685 








Pro 


Phe 


Cys 


Glu 


Ser 


Met 


Pro 


He 


Asn 


His 


He 


Asn 


Asn 


Thr 


Thr 


Pro 




690 










695 










700 










Tyr 


Gly 


Ser 


Tyr 


Lys 


Thr 


Phe 


Arg 


He 


Ser 


Leu 


Pro 


Gly 


Asn 


Phe 


Ser 


705 










710 










715 










720 


Gly 


Gin 


His 


Asn 


Asp 


He 


Asn 


Asn 


Asn 


Trp 


Arg 


Ser 


Asp 


Met 


Tyr 


Thr 










725 










730 










735 




Lys 


Met 


Val 


Glu 


Asn 


Leu 


Leu 


Lys 


Arg 


Glu 


Val 


Val 


Glu 


Asn 


Lys 


Thr 








740 










745 










750 






His 


Ser 


Arg 


Arg 


Tyr 


Val 


Arg 


Asn 


Leu 


He 


Val 


Asp 


Gly 


Gly 


Val 


Gly 






755 










760 










765 








Glu 


Asn 


Ser 


Gly 


Asn 


Tyr 


Leu 


Lys 


Val 


His 


Glu 


Asn 


Asn 


Glu 


Asp 


He 




770 










775 










780 










Phe 


Gly 


Ser 


He 


Glu 


Ala 


Asn 


Ser 


Met 


Ser 


Ala 


Lys 


Thr 


Ala 


Ala 


Ala 


785 










790 










795 










800 


Ala 


Phe 


Lys 


Asn 


Val 


Ala 


Lys 


Lys 


Cys 


Asp 


Leu 


He 


Gin 


Thr 


Thr 


Thr 










805 










810 










815 




Asn 


Asp 


He 


Leu 


Thr 


Gly 


Pro 


Phe 


Lys 


Gin 


Tyr 


Leu 


He 


Asp 


Tyr 


Lys 








820 










825 










830 






Tyr 


Asn 


Ser 


Ala 


Arg 


Lys 


Asn 


He 


He 


Met 


Glu 


Pro 


Cys 


Glu 


Gly 


Asp 






835 










840 










845 








Glu 


Thr 


Thr 


Ala 


His 


Glu 


Met 


Lys 


Arg 


Ala 


Gin 


Asp 


Ala 


Tyr 


Lys 


Gin 




850 










855 










860 










Ala 


Leu 


His 


Arg 


Ala 


Lys 


He 


Thr 


Ala 


Ser 


Ser 


He 


Ser 


Leu 


Arg 


Gly 


865 










870 










875 










880 


lie 


Trp 


His 


Glu 


Met 


He 


Thr 


Arg 


Asp 


Met 


Asn 


Thr 


Thr 


Tyr 


Asn 


Ser 










885 










890 










895 




Met 


Phe 


Met 


Tyr 


He 


Pro 


Asp 


Phe 


Tyr 


Lys 


Tyr 


Val 


Gin 


Val 


Ser 


Pro 








900 










905 










910 






Val 


Asn 


Val 


Ser 


Pro 


Leu 


Tyr 


Met 


Leu 


Asp 


















915 










920 



















<210> 230 
<211> 846 
<212> DNA 
<213> SHRIMP 



<400> 230 

atgtctcctg 

ataattagta 

gaagaagaac 

ccattttctc 

tcaaagggtg 

caggacgtcc 

accgacaagt 

tacaaatcgg 

agatatgttc 

acttttatac 

cctcgtccaa 

agctgttctt 

ttccaagtgt 

tgggttgaac 



ttatctccca 
ctgccaacct 
aacaggaagt 
cagcactcac 
tgtgcttaaa 
ctccagaact 
tcctttcatc 
ctctaagcac 
ccttaaacga 
cagaaaagac 
ttacagctaa 
cttcttcttc 
ct tcatccat 
aaaatgagct 



acaatcttcc 
tagagttcta 
tgaaccagaa 
aa tctgcata 
taggaaaaag 
agcaaatgcg 
ccactgttct 
aacaagaaat 
agataatgtg 
gggaagaaga 
attatgttgt 
tgcgtcctcc 
at ttttcaag 
ggatgcagaa 



ccttctgcta 
ggtgtcaaaa 
attattgaac 
tacatcaatg 
ataaaaccta 
tccagttatc 
atatgcaact 
ggagatcagc 
gtagtccaga 
attttgtggt 
cttctggaaa 
tccagcaacg 
aaagaagagt 
tcacctgttc 



ccagcacggc 
ataaggaaga 
cagctaccga 
cgaatcgcat 
cttcaacaat 
ttgtacaaac 
ataacgtgaa 
ccttgatgag 
aaggaacata 
tctctcatta 
ctataaactc 
ccccagggcc 
gttgtcccct 
ttgtacttct 



tgcagccaga 
gaaagatgag 
tt ttgagata 
acacattaat 
aaacaaaaac 
tgaacacgta 
cgacggggaa 
aaagtcggtc 
ttatgggact 
caagaagtct 
gttcaacggc 
tattgaagaa 
gcaaa tgaag 
aatgttagcg 
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660 

720 

780 

840 
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eta tga 

<210> 231 
<211> 281 
<212> PRT 
<213> SHRIMP 



<400> 231 



Met 


Ser 


Pro 


Val 


He 


Ser 


Gin 


Gin 


1 








5 








Ala 


Ala 


Ala 


Arg 


He 


He 


Ser 


Thr 








20 










Lys 


Asn 


Lys 


Glu 


Glu 


Lys 


Asp 


Glu 






35 










40 


Pro 


Glu 


He 


He 


Glu 


Pro 


Ala 


Thr 




50 










55 




Ala 


Leu 


Thr 


He 


Cys 


He 


Tyr 


He 


65 










70 






Ser 


Lys 


Gly 


Val 


Cys 


Leu 


Asn 


Arg 










85 








He 


Asn 


Lys 


Asn 


Gin 


Asp 


Val 


Pro 








100 










Tyr 


Leu 


Val 


Gin 


Thr 


Glu 


His 


Val 






115 










120 


Cys 


Ser 


He 


Cys 


Asn 


Tyr 


Asn 


Val 




130 










135 




Leu 


Ser 


Thr 


Thr 


Arg 


Asn 


Gly 


Asp 


145 










150 






Arg 


Tyr 


Val 


Pro 


Leu 


Asn 


Glu 


Asp 










165 








Tyr 


Tyr 


Gly 


Thr 


Thr 


Phe 


He 


Pro 








180 










Trp 


Phe 


Ser 


His 


Tyr 


Lys 


Lys 


Ser 






195 










200 


Cys 


Cys 


Leu 


Leu 


Glu 


Thr 


He 


Asn 




210 










215 




Ser 


Ser 


Ser 


Ala 


Ser 


Ser 


Ser 


Ser 


225 










230 






Phe 


Gin 


Val 


Ser 


Ser 


Ser 


He 


Phe 










245 








Leu 


Gin 


Met 


Lys 


Trp 


Val 


Glu 


Gin 








260 










Val 


Leu 


Val 


Leu 


Leu 


Met 


Leu 


Ala 






275 










280 



846 



Ser 


Ser 


Pro 


Ser 


Ala 


Thr 


Ser 


Thr 




10 










15 




Ala 


Asn 


Leu 


Arg 


Val 


Leu 


Gly 


Val 


25 










30 






Glu 


Glu 


Glu 


Gin 


Gin 


Glu 


Val 


Glu 










45 








Asp 


Phe 


Glu 


He 


Pro 


Phe 


Ser 


Pro 








60 










Asn 


Ala 


Asn 


Arg 


He 


His 


He 


Asn 






75 










80 


Lys 


Lys 


He 


Lys 


Pro 


Thr 


Ser 


Thr 




90 










95 




Pro 


Glu 


Leu 


Ala 


Asn 


Ala 


Ser 


Ser 


105 










110 






Thr 


Asp 


Lys 


Phe 


Leu 


Ser 


Ser 


His 










125 








Asn 


Asp 


Gly 


Glu 


Tyr 


Lys 


Ser 


Ala 








140 










Gin 


Pro 


Leu 


Met 


Arg 


Lys 


Ser 


Val 






155 










160 


Asn 


Val 


Val 


Val 


Gin 


Lys 


Gly 


Thr 




170 










175 




Glu 


Lys 


Thr 


Gly 


Arg 


Arg 


He 


Leu 


185 










190 






Pro 


Arg 


Pro 


He 


Thr 


Ala 


Lys 


Leu 










205 








Ser 


Phe 


Asn 


Gly 


Ser 


Cys 


Ser 


Ser 








220 










Asn 


Ala 


Pro 


Gly 


Pro 


He 


Glu 


Glu 






235 










240 


Phe 


Lys 


Lys 


Glu 


Glu 


Cys 


Cys 


Pro 




250 










255 




Asn 


Glu 


Leu 


Asp 


Ala 


Glu 


Ser 


Pro 


265 










270 







Leu 



<210> 232 
<211> 1470 
<212> DNA 
<213> SHRIMP 



<400> 232 

atggaatctg 

aaaaagatac 

ggatatggag 

ttggcaagat 

agtgcgggaa 

tgtgggttta 

tataaaataa 

gccatctgtg 

actaaagaag 



taagagacgt 
agcgcctaaa 
gagcaagact 
gtctctataa 
atattatagt 
atgtcaataa 
caggattaaa 
atttgttcct 
tagggaaaaa 



taaattttac 
caaggataaa 
catggatgtg 
ctgcgatgga 
atattcttta 
tcgcctacga 
tttccctgaa 
ggaggtggct 
acaacaacaa 



acttttatga 
ggttggagga 
cgcttcacgg 
gaatatacaa 
gcctt tataa 
atgggaaaaa 
actgtaaaat 
gctcttcaag 
catttcaacg 



atgtattggc 
cttcaataaa 
gcaggaaatc 
ctcttagact 
tgggaatcag 
taatagacag 
gcacatgtga 
aacaccccgc 
aattcgggtc 



agagaaggcg 
cgctgaaatt 
aatggacgaa 
cgtgggttca 
aggcgagtgt 
agagttgttt 
tggagtacga 
ctggcatgaa 
ccagtaccct 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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ggcacaaaat 
gaagagaaga 
tcagatttgt 
aatatggaag 
tctaggtggg 
ataggctcca 
gatttagttc 
aggttcaatg 
cctgtacttt 
aaatccctct 
tttgattcga 
gatagtctat 
aaggaaaata 
gggaaacact 
ataacaccat 
ggtcctagaa 



tcaataagcg 
ctatggaaca 
acgtagaagc 
gaagagtagg 
aaaagaaacc 
aaaaaattgt 
ctctagttcg 
attctactga 
ctaaaataat 
ccaaactagc 
atgattatgt 
caaaaaatca 
aaacaacacc 
tgttcccttt 
ggcaattaaa 
aacgacctcg 



gcacaaacta 
agttctagcc 
acctatacaa 
tgccatgtat 
agggtataaa 
tgatgacgag 
attggcccca 
tatattaatg 
atggcaaaac 
caaatggaaa 
cattgaacat 
ccttccaaat 
cttaatatgt 
agaggaagtt 
ggttgtccag 
cacacagtaa 



tccactaaaa 
tttagtgaag 
tacgtggtta 
aatttgtcga 
aatgactttt 
agctttattt 
tcaaacgaag 
gacagtattg 
gtttctgcac 
tggaatggta 
aaacgtcaac 
tttagtaaaa 
tggaattata 
tgcggattct 
aaaaagagag 



taattcagca 
gaactgccgc 
atatgtatag 
gagttttaat 
atagtaagtg 
tcaccgatct 
atatacagag 
acgtgaggga 
gtctaaaatt 
tggtgtcaac 
ttgcagcgga 
caataacgga 
tttttgaatt 
atgaggcgag 
gaaggcagat 



aatgttttca 
tagtggattt 
agctatcagt 
actcttatgt 
tgagatgtac 
cataactggg 
agatgttata 
tgttgttttg 
gcgcaacaat 
acacgacaac 
tataatgtcc 
gtacgatgaa 
gtcccctatg 
tttacctctc 
ggtcatctac 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1470 



<210> 233 
<211> 487 
<212> PRT 
<213> SHRIMP 



<400> 233 



Met 


Glu 


Ser 


Val 


Arg 


Asp 


Val 


Lys 


Phe 


Tyr 


Thr 


Phe 


Met 


Asn 


Val 


Leu 


1 








5 










10 










15 




Ala 


Glu 


Lys 


Ala 


Lys 


Lys 


He 


Gin 


Arg 


Leu 


Asn 


Lys 


Asp 


Lys 


Gly 


Trp 








20 










25 










30 






Arg 


Thr 


Ser 


He 


Asn 


Ala 


Glu 


He 


Gly 


Tyr 


Gly 


Gly 


Ala 


Arg 


Leu 


Met 






35 










40 










45 








Asp 


Val 


Arg 


Phe 


Thr 


Gly 


Arg 


Lys 


Ser 


Met 


Asp 


Glu 


Leu 


Ala 


Arg 


Cys 




50 










55 










60 










Leu 


Tyr 


Asn 


Cys 


Asp 


Gly 


Glu 


Tyr 


Thr 


Thr 


Leu 


Arg 


Leu 


Val 


Gly 


Ser 


65 










70 










75 










80 


Ser 


Ala 


Gly 


Asn 


He 


He 


Val 


Tyr 


Ser 


Leu 


Ala 


Phe 


He 


Met 


Gly 


He 










85 










90 










95 




Arg 


Gly 


Glu 


Cys 


Cys 


Gly 


Phe 


Asn 


Val 


Asn 


Asn 


Arg 


Leu 


Arg 


Met 


Gly 








100 










105 










110 






Lys 


He 


He 


Asp 


Arg 


Glu 


Leu 


Phe 


Tyr 


Lys 


He 


Thr 


Gin 


Phe 


Pro 


Glu 






115 










120 










125 








Thr 


Val 


Lys 


Cys 


Thr 


Cys 


Asp 


Gly 


Val 


Arg 


Ala 


He 


Cys 


Asp 


Leu 


Phe 




130 










135 










140 










Leu 


Glu 


Val 


Ala 


Ala 


Leu 


Gin 


Glu 


His 


Pro 


Ala 


Trp 


His 


Glu 


Thr 


Lys 


145 










150 










155 










160 


Glu 


Val 


Gly 


Lys 


Lys 


Gin 


Gin 


Gin 


His 


Phe 


Asn 


Glu 


Phe 


Gly 


Ser 


Gin 










165 










170 










175 




Tyr 


Pro 


Gly 


Thr 


Lys 


Phe 


Asn 


Lys 


Arg 


His 


Lys 


Leu 


Ser 


Thr 


Lys 


He 








180 










185 










190 






lie 


Gin 


Gin 


Met 


Phe 


Ser 


Glu 


Glu 


Lys 


Thr 


Met 


Glu 


Gin 


Val 


Leu 


Ala 






195 










200 










205 








Phe 


Ser 


Glu 


Gly 


Thr 


Ala 


Ala 


Ser 


Gly 


Phe 


Ser 


Asp 


Leu 


Tyr 


Val 


Glu 




210 










215 










220 










Ala 


Pro 


He 


Gin 


Tyr 


Val 


Val 


Asn 


Met 


Tyr 


Arg 


Ala 


He 


Ser 


Asn 


Met 


225 










230 










235 










240 


Glu 


Gly 


Arg 


Val 


Gly 


Ala 


Met 


Tyr 


Asn 


Leu 


Ser 


Arg 


Val 


Leu 


He 


Leu 










245 










250 










255 




Leu 


Cys 


Ser 


Arg 


Trp 


Glu 


Lys 


Lys 


Pro 


Gly 


Tyr 


Lys 


Asn 


Asp 


Phe 


Tyr 








260 










265 










270 






Ser 


Lys 


Cys 


Glu 


Met 


Tyr 


He 


Gly 


Ser 


Lys 


Lys 


He 


Val 


Asp 


Asp 


Glu 






275 










280 










285 








Ser 


Phe 


He 


Phe 


Thr 


Asp 


Leu 


He 


Thr 


Gly 


Asp 


Leu 


Val 


Pro 


Leu 


Val 



290 



295 



300 
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Arg 


Leu 


Ala 


Pro 


Ser 


Asn 


Glu 


Asp 


He 


Gin 


Arg 


Asp 


Val 


He 


Arg 


Phe 


305 










310 










315 










320 


Asn 


Asp 


Ser 


Thr 


Asp 


He 


Leu 


Met 


Asp 


Ser 


He 


Asp 


Val 


Arg 


Asp 


Val 










325 










330 










335 




Val 


Leu 


Pro 


Val 


Leu 


Ser 


Lys 


He 


He 


Trp 


Gin 


Asn 


Val 


Ser 


Ala 


Arg 








340 










345 










350 






Leu 


Lys 


Leu 


Arg 


Asn 


Asn 


Lys 


Ser 


Leu 


Ser 


Lys 


Leu 


Ala 


Lys 


Trp 


Lys 






355 










360 










365 








Trp 


Asn 


Gly 


Met 


Val 


Ser 


Thr 


His 


Asp 


Asn 


Phe 


Asp 


Ser 


Asn 


Asp 


Tyr 




370 










375 










380 










Val 


He 


Glu 


His 


Lys 


Arg 


Gin 


Leu 


Ala 


Ala 


Asp 


He 


Met 


Ser 


Asp 


Ser 


385 










390 










395 










400 


Leu 


Ser 


Lys 


Asn 


His 


Leu 


Pro 


Asn 


Phe 


Ser 


Lys 


Thr 


He 


Thr 


Glu 


Tyr 










405 










410 










415 




Asp 


Glu 


Lys 


Glu 


Asn 


Lys 


Thr 


Thr 


Pro 


Leu 


He 


Cys 


Trp 


Asn 


Tyr 


He 








420 










425 










430 






Phe 


Glu 


Leu 


Ser 


Pro 


Met 


Gly 


Lys 


His 


Leu 


Phe 


Pro 


Leu 


Glu 


Glu 


Val 






435 










440 










445 








Cys 


Gly 


Phe 


Tyr 


Glu 


Ala 


Ser 


Leu 


Pro 


Leu 


He 


Thr 


Pro 


Trp 


Gin 


Leu 




450 










455 










460 










Lys 


Val 


Val 


Gin 


Lys 


Lys 


Arg 


Gly 


Arg 


Gin 


Met 


Val 


He 


Tyr 


Gly 


Pro 


4 65 










470 










475 










480 


Arg 


Lys 


Arg 


Pro 


Arg 


Thr 


Gin 





















485 



<210> 234 
<211> 4698 
<212> DNA 
<213> SHRIMP 



<400> 234 

atgtcaacaa 

aatagccgt t 

ggcgatagcg 

acccttgcaa 

gtaaaaatac 

ata tctacaa 

tcatctctga 

gagaggtt ta 

tctttagaga 

catgatgaga 

aata tatccc 

ttgaaagacg 

acctctagat 

aaggtgtgga 

gcaaagaaaa 

gcttcaaact 

actgaatt tg 

ttaaagga tt 

ctttatgaac 

acgaccgaac 

ggagatgtat 

caaaaaact t 

aaaaacccag 

gagaaaacag 

aataaaacac 

tttcttacat 

ttaaaattaa 

atatattata 

gtgggactt t 



cgcaaacgca 
tagct tgcct 
aaatttctct 
acctcaccga 
acccgtctgt 
aatctctaaa 
cagaacaaaa 
tcgaaggcga 
gattggcagg 
aagaaatacg 
atctgaatca 
cctatggtgt 
tcacagggaa 
taaacacccc 
ttctctatca 
ccaagtcagt 
gtatgcacta 
tggttacacg 
caacagcaac 
aaaaggaaaa 
cttccaccct 
ctaccgatac 
ttacttgctt 
actggataga 
tcgttgtcga 
tcggtcaaaa 
ttctgagaga 
ataatgtaaa 
tctcgggcgg 



aacaatagaa 
attggctgaa 
accgcttgtt 
ggggcgttta 
gtttattata 
tgcattagtg 
atttcctcga 
aattaacaac 
gcttttgcct 
gctagatctt 
agaagaatgg 
gttttatgcc 
attattacaa 
cctccagata 
acatccctct 
ttcttccatt 
ctttacggcc 
gtattatcaa 
agcaacagca 
aatcgcacaa 
ttctaaggaa 
caacattgtc 
ctttgggcct 
aatgcacctt 
cagaaaaagt 
taatactaca 
cgattctggg 
tctagaaggg 
atctgctatt 



cggcccctgc 
gggctacagc 
aatgcgggaa 
ggaagtgaaa 
gagacgaata 
gaaaaaagag 
ggaggaggag 
atcaagctga 
gttgtcatta 
aaaggaaacg 
gaaatggaaa 
catcacggca 
caccctgtga 
gcatttcctg 
cttacaaggc 
atcataccag 
cagtgctttt 
ctatcattta 
gcctcctcct 
tcgattttat 
tacgatgaaa 
ccttcagggg 
caatacactt 
tttttaacct 
aacgtgtcgg 
gcctttatac 
gaatcttctg 
agggaattct 
acggtaggag 



ctggtaagaa 
aacaacaaca 
cttttgcgtg 
cagaaaatgc 
aagagatgac 
cacgagaagc 
gatgttattc 
acatggaaga 
atattaagga 
acggcatgga 
gactttcttc 
tacttgatat 
tatttcgtct 
acaccagtaa 
tacgtgattt 
agttgtcaaa 
ttggaaaaaa 
aaaataaacc 
cctcctcaac 
cgtctaaagg 
atagaaaacg 
cacctacttc 
caatcatgga 
ctcttaacga 
aaatacacga 
cagatgttga 
ctgctataat 
caaatgtctc 
atattgctcg 



caacgaagac 
acaacaagat 
ttatgattct 
aaaaataagg 
tattgaagaa 
acgacgattt 
tcgtaaaaat 
aactgcttcg 
ctggacaatg 
agaacttgtc 
ctcaatagta 
tgttcttacg 
aatggatgtg 
gaacccaaat 
aaatgacatg 
a t ttaactcg 
taccaattct 
tcaacccaaa 
cgcctcattg 
aaaatcttta 
tacaaagagg 
catttccatg 
ctgtatatcg 
cgctgaacat 
ttctggaagg 
tattcctacg 
agcatctctg 
tgatgctgtt 
agagattgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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tccatataca 
atactggccg 
gtggtgggtc 
ccagtgaaag 
cttcttccat 
caactaacag 
cacaaatttt 
tatgcagtta 
cctcccttta 
cagagtattt 
tcttctccat 
cctgtaccaa 
cctctaatat 
ttttcttctg 
agaaatgtga 
ggtgaattgg 
gatatggaaa 
tcttcttcat 
gttagcgcat 
aaacaagtca 
ctctcgtctg 
atgaggatct 
agtaaagtgg 
gatactggat 
tcaggcgaag 
attgctattg 
agagcgatcc 
aatgtgcctg 
tttaggaatg 
atgaaaatta 
gttgcactag 
aatgggatta 
gggagacttt 
ggtgctcttg 
aataccattg 
aataataaca 
agagactctt 
gaccgtataa 
tttattgctt 
ctcaacaaga 
aatccgcgcc 
gcaacaagag 
actcaattgg 
aaatccccag 
attgctgcta 
attgttaaca 
ttttatataa 
gctgt taata 
tcacagacat 
agaaatataa 



atattggaag 
gaagaagaag 
tttatcggtc 
tggatgagac 
tttttgattg 
attatatagt 
tattattcaa 
ttcattcggc 
ttgtagaaaa 
tgaatggaga 
catcctcctc 
caggacggtt 
cctctaagga 
aaaccatttc 
catatggtca 
tgtcgtactt 
ttggtgatat 
tcgttgatgg 
tcctccctgg 
aatattcggt 
ctgtgcttta 
ataaggaatg 
gcacagtggc 
tacccacaac 
actcaaaaat 
cagccgacca 
aacaacaaca 
cccgtttcga 
ttattgctac 
ttaaagaagt 
atcctagaaa 
ctattgatac 
ttataaatga 
gaactggtag 
gaagtactac 
atactgttcc 
ctctattcac 
agacaaacat 
caaaggcaca 
ttccagaaga 
atatccttac 
attcggctat 
aggaactttt 
cttatgt ttc 
tacgagagaa 
ttgatgagct 
aaatgcccaa 
tagcagcaac 
ctgttttata 
ccatgtga 



agaatctaac 
ttatggacgg 
gtgtttagaa 
cgcttggatg 
cgttttgaaa 
acgtgaagtg 
gaagcccgtg 
agccgataat 
tgatacgtct 
taatgtagcc 
cagatattca 
tttaaagatg 
acaaaaacaa 
aaaatcgttc 
gaatatcgca 
taaagttgta 
taataaccat 
tgtacggaca 
tacgacttct 
aaaggaattg 
tgaagcctca 
tgttagtcct 
aggaactgga 
tctagaaagg 
ggaagtaatt 
actcagaggc 
acagcagcaa 
gcctacgttc 
tattatatct 
gtgcgagcac 
ggccattgaa 
tggaaatgaa 
tgaagctagt 
aagggtgcct 
tcaacaacaa 
cgaggatata 
atctaaactt 
ttcagaaaca 
gaaacctatc 
cttgaacttt 
tgacatgcaa 
cgtcatgcaa 
atctcaatgg 
aagtgtaaaa 
ttcagaaaaa 
aggtatagtg 
tgtgtgggct 
tccccttatt 
tgactcttct 



tgcgattcca 
cagtacagat 
acaatgacaa 
tacatgcacc 
tcgggcgtat 
ctactaaaat 
atggatctaa 
ggaggcgtgt 
ctcagatact 
gaaaacttaa 
tcagaaaccc 
aataaggatc 
caacagcagc 
ttaaattatg 
gaagatggat 
aagaatactg 
caagacgaca 
agtttttcgg 
cagcccacaa 
ggcatggaaa 
aaaactaaat 
ctatctacag 
ttcgaattta 
ttgtcaccca 
gcagaatctg 
agtaataata 
caacaagaaa 
acagagatcg 
agaatgatga 
gtgtctcata 
gaaatactgg 
gggtatggat 
gaagaagctg 
gtagaattac 
caacagcagc 
aaggttcaca 
ttgaattaca 
ctaaagaaat 
ccatggaaac 
tctcctgcac 
tggctcaact 
agtttccaag 
aataatattg 
ttggaatggt 
tcaaaaattg 
gctgttgcca 
tctagggatt 
aataatatat 
cttgctctta 



tactattccc 
ggtacgatcc 
gaaatattat 
agcaggtact 
gggctgtaaa 
atacagccga 
ttgcaaagat 
gccttgcttt 
atacactgac 
agtctgccac 
ctattagggt 
tggaactttt 
aaacaactgc 
taccacctaa 
tccttggatt 
aacgcgatga 
ctggatccct 
ttgatgggaa 
atcttcctgt 
ctgtattctt 
cgacacaaca 
ggaggatcga 
tttggaaagt 
agattccttc 
gtaaaggtgt 
tagtaggtgg 
cacaagcggt 
aactatttct 
tgcttgtcag 
ttatggtcga 
aaaggataac 
ctttacgcta 
cagcggccat 
gttcaatatt 
agcgtcaaca 
atgaacaaat 
ttagggatga 
acagtagaat 
acaccaagga 
agaatctgtt 
gtatttccat 
agcaagcgga 
tatctcaagt 
taaataatga 
taatgggtgt 
ggtcaatagt 
ggaagaacct 
cgaggggtat 
ttgctgccga 



aggggagcct 
cattaactgc 
gagaggacag 
tcaagttgtt 
ggaggctaga 
tcctgatcaa 
tgttacacat 
ccctagagat 
cgatacacct 
ttcagtcgct 
agttaatctt 
cattaatgta 
cactgctcca 
atctctaaca 
gaaaaataaa 
tggtataaag 
ttcttcttct 
aattgaacat 
tcatgcatct 
tgaacccctt 
tttaagccca 
tattttcccc 
tcttcagtat 
tgtaccaatt 
acaaaacatc 
tggaacaaga 
tgttccagtt 
tcagaacaaa 
caacgaagag 
tggattatat 
agctgaacaa 
tgcttcttcg 
cggcggtgga 
ggacaaatta 
gaggcaggct 
gcaaaaaatt 
tggaagaaag 
cccatcatac 
caatatcaac 
tgttccagtt 
catcgaaact 
caagactaca 
aactgatgaa 
agctagccgt 
tcaggggaag 
agacgttgat 
gatttattat 
catggctgct 
gcaagctaca 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4698 



<210> 235 
<211> 1551 
<212> PRT 
<213> SHRIMP 



<400> 235 

Met Ser Thr Thr Gin Thr Gin Thr lie Glu Arg Pro Leu Pro Gly Lys 

15 10 15 

Asn Asn Glu Asp Asn Ser Arg Leu Ala Cys Leu Leu Ala Glu Gly Leu 
20 25 30 
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Gin Gin Gin Gin Gin Gin Gin Asp Gly Asp Ser Glu lie Ser Leu Pro 

35 40 45 

Leu Val Asn Ala Gly Thr Phe Ala Cys Tyr Asp Ser Thr Leu Ala Asn 

50 55 60 

Leu Thr Glu Gly Arg Leu Gly Ser Glu Thr Glu Asn Ala Lys lie Arg 
65 70 75 80 



Val 


Lys 


He 


His 


Pro 


Ser 


Val 


Phe 


He 


He 


Glu 


Thr 


Asn 


Lys 


Glu 


Met 








85 










90 










95 




Thr 


lie 


Glu 


Glu 


He 


Ser 


Thr 


Lys 


Ser 


Leu 


Asn 


Ala 


Leu 


Val 


Glu 


Lys 








100 










105 










110 






Arg 


Ala 


Arg 


Glu 


Ala 


Arg 


Arg 


Phe 


Ser 


Ser 


Leu 


Thr 


Glu 


Gin 


Lys 


Phe 






115 










120 










125 








Pro 


Arg 


Gly 


Gly 


Gly 


Gly 


Cys 


Arg 


Lys 


Asn 


Glu 


Arg 


Phe 


He 


Glu 


Gly 




130 










135 










140 










Glu 


He 


Asn 


Asn 


He 


Lys 


Leu 


Asn 


Met 


Glu 


Glu 


Thr 


Ala 


Ser 


Ser 


Leu 


145 










150 










155 










160 


Glu 


Arg 


Leu 


Ala 


Gly 


Leu 


Leu 


Pro 


Val 


Val 


He 


Asn 


He 


Lys 


Asp 


Trp 










165 










170 










175 




Thr 


Met 


His 


Asp 


Glu 


Lys 


Glu 


He 


Arg 


Leu 


Asp 


Leu 


Lys 


Gly 


Asn 


Asp 








180 










185 










190 






Gly 


Met 


Glu 


Glu 


Leu 


Val 


Asn 


He 


Ser 


His 


Leu 


Asn 


Gin 


Glu 


Glu 


Trp 






195 










200 










205 








Glu 


Met 


Glu 


Arg 


Leu 


Ser 


Ser 


Ser 


He 


Val 


Leu 


Lys 


Asp 


Ala 


Tyr 


Gly 




210 










215 










220 










Val 


Phe 


Tyr 


Ala 


His 


His 


Gly 


He 


Leu 


Asp 


He 


Val 


Leu 


Thr 


Thr 


Ser 


225 










230 










235 










240 


Arg 


Phe 


Thr 


Gly 


Lys 


Leu 


Leu 


Gin 


His 


Pro 


Val 


He 


Phe 


Arg 


Leu 


Met 










245 










250 










255 




Asp 


Val 


Lys 


Val 


Trp 


He 


Asn 


Thr 


Pro 


Leu 


Gin 


He 


Ala 


Phe 


Pro 


Asp 








260 










265 










270 






Thr 


Ser 


Lys 


Asn 


Pro 


Asn 


Ala 


Lys 


Lys 


He 


Leu 


Tyr 


Gin 


His 


Pro 


Ser 






275 










280 










285 








Leu 


Thr 


Arg 


Leu 


Arg 


Asp 


Leu 


Asn 


Asp 


Met 


Asn 


Ser 


Lys 


Ser 


Val 


Ser 




290 










295 










300 










Ser 


He 


He 


He 


Pro 


Glu 


Leu 


Ser 


Lys 


Phe 


Asn 


Ser 


Thr 


Glu 


Phe 


Gly 


305 










310 










315 










320 


Met 


His 


Tyr 


Phe 


Thr 


Ala 


Gin 


Cys 


Phe 


Phe 


Gly 


Lys 


Asn 


Thr 


Asn 


Ser 










325 










330 










335 




Leu 


Lys 


Asp 


Leu 


Val 


Thr 


Arg 


Tyr 


Tyr 


Gin 


Leu 


Ser 


Phe 


Lys 


Asn 


Lys 








340 










345 










350 






Pro 


Gin 


Pro 


Lys 


Leu 


Tyr 


Glu 


Pro 


Thr 


Ala 


Thr 


Ala 


Thr 


Ala 


Ala 


Ser 






355 










360 










365 








Ser 


Ser 


Ser 


Ser 


Thr 


Ala 


Ser 


Leu 


Thr 


Thr 


Glu 


Gin 


Lys 


Glu 


Lys 


He 




370 










375 










380 










Ala 


Gin 


Ser 


He 


Leu 


Ser 


Ser 


Lys 


Gly 


Lys 


Ser 


Leu 


Gly 


Asp 


Val 


Ser 


385 










390 










395 










400 


Ser 


Thr 


Leu 


Ser 


Lys 


Glu 


Tyr 


Asp 


Glu 


Asn 


Arg 


Lys 


Arg 


Thr 


Lys 


Arg 










405 










410 










415 




Gin 


Lys 


Thr 


Ser 


Thr 


Asp 


Thr 


Asn 


He 


Val 


Pro 


Ser 


Gly 


Ala 


Pro 


Thr 








420 










425 










430 






Ser 


He 


Ser 


Met 


Lys 


Asn 


Pro 


Val 


Thr 


Cys 


Phe 


Phe 


Gly 


Pro 


Gin 


Tyr 






435 










440 










445 








Thr 


Ser 


He 


Met 


Asp 


Cys 


He 


Ser 


Glu 


Lys 


Thr 


Asp 


Trp 


He 


Glu 


Met 




450 










455 










460 










His 


Leu 


Phe 


Leu 


Thr 


Ser 


Leu 


Asn 


Asp 


Ala 


Glu 


His 


Asn 


Lys 


Thr 


Leu 


465 










470 










475 










480 


Val 


Val 


Asp 


Arg 


Lys 


Ser 


Asn 


Val 


Ser 


Glu 


He 


His 


Asp 


Ser 


Gly 


Arg 










485 










490 










495 




Phe 


Leu 


Thr 


Phe 


Gly 


Gin 


Asn 


Asn 


Thr 


Thr 


Ala 


Phe 


He 


Pro 


Asp 


Val 








500 










505 










510 






Asp 


He 


Pro 


Thr 


Leu 


Lys 


Leu 


He 


Leu 


Arg 


Asp 


Asp 


Ser 


Gly 


Glu 


Ser 
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515 










520 










525 








Ser 


Ala 


Ala 


He 


He 


Ala 


Ser 


Leu 


He 


Tyr 


Tyr 


Asn 


Asn 


Val 


Asn 


Leu 




530 










535 










540 










Glu 


Gly 


Arg 


Glu 


Phe 


Ser 


Asn 


Val 


Ser 


Asp 


Ala 


Val 


Val 


Gly 


Leu 


Phe 


545 










550 










555 










560 


Ser 


Gly 


Gly 


Ser 


Ala 


He 


Thr 


Val 


Gly 


Asp 


He 


Ala 


Arg 


Glu 


He 


Ala 










565 










570 










575 




Ser 


lie 


Tyr 


Asn 


He 


Gly 


Arg 


Glu 


Ser 


Asn 


Cys 


Asp 


Ser 


He 


Leu 


Phe 








580 










585 










590 






Pro 


Gly 


Glu 


Pro 


He 


Leu 


Ala 


Gly 


Arg 


Arg 


Ser 


Tyr 


Gly 


Arg 


Gin 


Tyr 






595 










600 










605 








Arg 


Trp 


Tyr 


Asp 


Pro 


He 


Asn 


Cys 


Val 


Val 


Gly 


Arg 


Ser 


Cys 


Leu 


Glu 




610 










615 










620 










Thr 


Met 


Thr 


Arg 


Asn 


He 


Met 


Arg 


Gly 


Gin 


Pro 


Val 


Lys 


Val 


Asp 


Glu 


625 










630 










635 










640 


Thr 


Ala 


Trp 


Met 


Tyr 


Met 


His 


Gin 


Gin 


Val 


Leu 


Gin 


Val 


Val 


Leu 


Leu 










645 










650 










655 




Pro 


Phe 


Phe 


Asp 


Cys 


Val 


Leu 


Lys 


Ser 


Gly 


Val 


Trp 


Ala 


Val 


Lys 


Glu 








660 










665 










670 






Ala 


Arg 


Gin 


Leu 


Thr 


Asp 


Tyr 


He 


Val 


Arg 


Glu 


Val 


Leu 


Leu 


Lys 


Tyr 






675 










680 










685 








Thr 


Ala 


Asp 


Pro 


Asp 


Gin 


His 


Lys 


Phe 


Leu 


Leu 


Phe 


Lys 


Lys 


Pro 


Val 




690 










695 










700 










Met 


Asp 


Leu 


He 


Ala 


Lys 


He 


Val 


Thr 


His 


Tyr 


Ala 


Val 


He 


His 


Ser 


705 










710 










715 










720 


Ala 


Ala 


Asp 


Asn 


Gly 


Gly 


Val 


Cys 


Leu 


Ala 


Phe 


Pro 


Arg 


Asp 


Pro 


Pro 










725 










730 










735 




Phe 


He 


Val 


Glu 


Asn 


Asp 


Thr 


Ser 


Leu 


Arg 


Tyr 


Tyr 


Thr 


Leu 


Thr 


Asp 








740 










745 










750 






Thr 


Pro 


Gin 


Ser 


He 


Leu 


Asn 


Gly 


Asp 


Asn 


Val 


Ala 


Glu 


Asn 


Leu 


Lys 






755 










760 










765 








Ser 


Ala 


Thr 


Ser 


Val 


Ala 


Ser 


Ser 


Pro 


Ser 


Ser 


Ser 


Ser 


Arg 


Tyr 


Ser 




770 










775 










780 










Ser 


Glu 


Thr 


Pro 


He 


Arg 


Val 


Val 


Asn 


Leu 


Pro 


Val 


Pro 


Thr 


Gly 


Arg 


785 










790 










795 










800 


Phe 


Leu 


Lys 


Met 


Asn 


Lys 


Asp 


Leu 


Glu 


Leu 


Phe 


He 


Asn 


Val 


Pro 


Leu 










805 










810 










815 




lie 


Ser 


Ser 


Lys 


Glu 


Gin 


Lys 


Gin 


Gin 


Gin 


Gin 


Gin 


Thr 


Thr 


Ala 


Thr 








820 










825 










830 






Ala 


Pro 


Phe 


Ser 


Ser 


Glu 


Thr 


He 


Ser 


Lys 


Ser 


Phe 


Leu 


Asn 


Tyr 


Val 






835 










840 










845 








Pro 


Pro 


Lys 


Ser 


Leu 


Thr 


Arg 


Asn 


Val 


Thr 


Tyr 


Gly 


Gin 


Asn 


He 


Ala 




850 










855 










860 










Glu 


Asp 


Gly 


Phe 


Leu 


Gly 


Leu 


Lys 


Asn 


Lys 


Gly 


Glu 


Leu 


Val 


Ser 


Tyr 


865 










870 










875 










880 


Phe 


Lys 


Val 


Val 


Lys 


Asn 


Thr 


Glu 


Arg 


Asp 


Asp 


Gly 


He 


Lys 


Asp 


Met 










885 










890 










895 




Glu 


He 


Gly 


Asp 


He 


Asn 


Asn 


His 


Gin 


Asp 


Asp 


Thr 


Gly 


Ser 


Leu 


Ser 








900 










905 










910 






Ser 


Ser 


Ser 


Ser 


Ser 


Phe 


Val 


Asp 


Gly 


Val 


Arg 


Thr 


Ser 


Phe 


Ser 


Val 






915 










920 










925 








Asp 


Gly 


Lys 


He 


Glu 


His 


Val 


Ser 


Ala 


Phe 


Leu 


Pro 


Gly 


Thr 


Thr 


Ser 




930 










935 










940 










Gin 


Pro 


Thr 


Asn 


Leu 


Pro 


Val 


His 


Ala 


Ser 


Lys 


Gin 


Val 


Lys 


Tyr 


Ser 


945 










950 










955 










960 


Val 


Lys 


Glu 


Leu 


Gly 


Met 


Val 


Phe 


Phe 


Glu 


Pro 


Leu 


Leu 


Ser 


Ser 


Ala 










965 










970 










975 




Val 


Leu 


Tyr 


Glu 


Ala 


Ser 


Lys 


Thr 


Lys 


Ser 


Thr 


Gin 


His 


Leu 


Ser 


Pro 








980 










985 










990 






Met 


Arg 


He 


Tyr 


Lys 


Glu 


Cys 


Val 


Ser 


Pro 


Leu 


Ser 


Thr 


Gly 


Arg 


He 



995 1000 1005 
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Asp lie Phe Pro Ser Lys Val Gly Thr Vai Ala Gly Thr Gly Phe Glu 

1010 1015 1020 

Phe lie Trp Lys Val Leu Gin Tyr Asp Thr Gly Leu Pro Thr Thr Leu 
1025 1030 1035 1040 

Glu Arg Leu Ser Pro Lys lie Pro Ser Val Pro lie Ser Gly Glu Asp 

1045 1050 1055 

Ser Lys Met Glu Val He Ala Glu Ser Gly Lys Gly Val Gin Asn lie 

1060 1065 1070 

He Ala He Ala Ala Asp Gin Leu Arg Gly Ser Asn Asn lie Val Gly 

1075 1080 1085 

Gly Gly Thr Arg Arg Ala He Gin Gin Gin Gin Gin Gin Gin Gin Gin 

1090 1095 1100 

Glu Thr Gin Ala Val Val Pro Val Asn Val Pro Ala Arg Phe Glu Pro 
1105 1110 1115 1120 

Thr Phe Thr Glu He Glu Leu Phe Leu Gin Asn Lys Phe Arg Asn Val 

1125 1130 1135 

He Ala Thr He He Ser Arg Met Met Met Leu Val Ser Asn Glu Glu 

1140 1145 1150 

Met Lys He He Lys Glu Val Cys Glu His Val Ser His He Met Val 

1155 1160 1165 

Asp Gly Val Asp Pro Arg Lys Ala He Glu Glu He Arg He Thr Ala 

1170 1175 1180 

Glu Gin Asn Gly He Thr He Asp Thr Gly Asn Glu Gly Tyr Gly Ser 
1185 1190 1195 1200 

Leu Arg Tyr Ala Ser Ser Gly Arg Leu Phe He Asn Asp Glu Ala Ser 

1205 1210 1215 

Glu Glu Ala Ala Ala Ala He Gly Gly Gly Gly Ala Leu Gly Thr Gly 

1220 1225 1230 

Arg Arg Val Pro Val Glu Leu Arg Ser He Leu Asp Lys Leu Asn Thr 

1235 1240 1245 

He Gly Ser Thr Thr Gin Gin Gin Gin Gin Gin Gin Arg Gin Gin Arg 

1250 1255 1260 

Gin Ala Asn Asn Asn Asn Thr Val Pro Glu Asp He Lys Val His Asn 
1265 1270 1275 1280 

Glu Gin Met Gin Lys He Arg Asp Ser Ser Leu Phe Thr Ser Lys Leu 

1285 1290 1295 

Leu Asn Tyr He Arg Asp Asp Gly Arg Lys Asp Arg He Lys Thr Asn 

1300 1305 1310 

He Ser Glu Thr Leu Lys Lys Tyr Ser Arg He Pro Ser Tyr Phe He 

1315 1320 1325 

Ala Ser Lys Ala Gin Lys Pro He Pro Trp Lys His Thr Lys Asp Asn 

1330 1335 1340 

He Asn Leu Asn Lys He Pro Glu Asp Leu Asn Phe Ser Pro Ala Gin 
1345 1350 1355 1360 

Asn Leu Phe Val Pro Val Asn Pro Arg His He Leu Thr Asp Met Gin 

1365 1370 1375 

Trp Leu Asn Cys He Ser He He Glu Thr Ala Thr Arg Asp Ser Ala 

1380 1385 1390 

He Val Met Gin Ser Phe Gin Glu Gin Ala Asp Lys Thr Thr Thr Gin 

1395 1400 1405 

Leu Glu Glu Leu Leu Ser Gin Trp Asn Asn He Val Ser Gin Val Thr 

1410 1415 1420 

Asp Glu Lys Ser Pro Ala Tyr Val Ser Ser Vai Lys Leu Glu Trp Leu 
1425 1430 1435 1440 

Asn Asn Glu Ala Ser Arg He Ala Ala He Arg Glu Asn Ser Glu Lys 

1445 1450 1455 

Ser Lys He Val Met Gly Val Gin Gly Lys He Val Asn He Asp Glu 

1460 1465 1470 

Leu Gly He Val Ala Val Ala Arg Ser He Val Asp Vai Asp Phe Tyr 

1475 1480 1485 

He Lys Met Pro Asn Val Trp Ala Ser Arg Asp Trp Lys Asn Leu He 
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1490 1495 
Tyr Tyr Ala Val Asn lie Ala Ala 
1505 1510 
Arg Gly lie Met Ala Ala Ser Gin 
1525 

Leu Ala Leu lie Ala Ala Glu Gin 
1540 



1500 

Thr Pro Leu lie Asn Asn lie Ser 

1515 1520 
Thr Ser Val Leu Tyr Asp Ser Ser 

1530 1535 
Ala Thr Arg Asn lie Thr Met 
1545 1550 



<210> 236 
<211> 669 
<212> DNA 
<213> SHRIMP 



<400> 236 

atggaatgga 

gcagtaatac 

agaggagaaa 

gtacaacaag 

aataacccct 

cctcctgata 

agagtagatg 

aaacaatata 

tataatagct 

gctgccatca 

aaaatattgg 

atccaataa 



taaaccaacg 
gttcggcagc 
aggcatggac 
attatactga 
tt ttaaagag 
tagaggatga 
gagccaatat 
tatctgataa 
ccaagaataa 
tgaaacaccc 
gtatcggagc 



gacaagtaga 
taagcaagca 
gacttcagca 
tgatgatatt 
atatgcaaaa 
tgtcattata 
ctatttcgcc 
agtgctgtgt 
gtggacgtat 
cgactacaag 
atcggaaaaa 



gaggatcttt 
ctggctattg 
gcagcagcag 
acacaagtgt 
cttatagata 
cacactagag 
ataattgacg 
ggttctctca 
ggatgtaacc 
gaagagacta 
ctgaacatta 



tcaacacgta 
aaaaacacgc 
cttcttctaa 
ctattgcaaa 
atttagcaat 
atgcctccaa 
gtgatttatg 
accgggaaaa 
taaactttga 
catctacaaa 
cccactattt 



tacagggaat 
agcagaaaga 
ttttaataat 
cagtgttttg 
atcttcttta 
ctctacagtc 
tgtataccct 
ggcactgt tc 
tatcgttgac 
acatatacgt 
aaactacttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

669 



<210> 237 
<211> 218 
<212> PRT 
<213> SHRIMP 



<400> 237 



Met 


Glu 


Trp 


He 


1 








Tyr 


Thr 


Gly 


Asn 








20 


Lys 


His 


Ala 


Ala 






35 




Ala 


Ala 


Ala 


Ala 




50 






Asp 


Asp 


Asp 


He 


65 








Pro 


Phe 


Leu 


Lys 


Ser 


Leu 


Pro 


Pro 








100 


Asn 


Ser 


Thr 


Val 






115 




Asp 


Gly 


Asp 


Leu 




130 






Leu 


Cys 


Gly 


Ser 


145 








Lys 


Asn 


Lys 


Trp 


Ala 


Ala 


He 


Met 








180 


Lys 


His 


He 


Arg 






195 




lie 


Thr 


His 


Tyr 



Asn Gin Arg Thr 

5 

Ala Val He Arg 

Glu Arg Arg Gly 
40 

Ser Ser Asn Phe 
55 

Thr Gin Val Ser 
70 

Arg Tyr Ala Lys 

85 

Asp He Glu Asp 

Arg Val Asp Gly 
120 

Cys Val Tyr Pro 
135 

Leu Asn Arg Glu 
150 

Thr Tyr Gly Cys 
165 

Lys His Pro Asp 

Lys He Leu Gly 
200 

Leu Asn Tyr Phe 



Ser Arg Glu Asp 
10 

Ser Ala Ala Lys 
25 

Glu Lys Ala Trp 

Asn Asn Val Gin 
60 

He Ala Asn Ser 
75 

Leu He Asp Asn 
90 

Asp Val He He 
105 

Ala Asn He Tyr 

Lys Gin Tyr He 
140 

Lys Ala Leu Phe 
155 

Asn Leu Asn Phe 
170 

Tyr Lys Glu Glu 
185 

He Gly Ala Ser 
He Gin 



Leu Phe Asn Thr 
15 

Gin Ala He Glu 
30 

Thr Thr Ser Ala 
45 

Gin Asp Tyr Thr 

Val Leu Asn Asn 
80 

Leu Ala He Ser 
95 

His Thr Arg Asp 
110 

Phe Ala He He 
125 

Ser Asp Lys Val 

Tyr Asn Ser Ser 
160 

Asp He Val Asp 

175 

Thr Thr Ser Thr 
190 

Glu Lys Leu Asn 
205 
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210 215 



<210> 238 
<211> 6948 
<212> DNA 
<213> SHRIMP 

<400> 238 

atgagcgata cagggcagat ggaagaaaat 
gatgaagaag aggaagaaac tggtagtagt 
gacgccacgt actccatgta cactggagaa 
cctaaagaaa gaagtgtaca aagggtgcaa 
caacgttcta atgttcggac acggagacct 
gatcgatcgc gagaacggtc agaaaaactt 
gaacggcaaa agaaatatac tccaaaggta 
tttcgtgaag gaggaagaaa attcaaggcg 
gcaaccaatg ttcttgaaag ggaagaaatt 
gaaattacag gagatactat attaggtggg 
gaggatgaat taactataca acattcatct 
gttgtcagtt ctcctatacc gccaaagccc 
gaagatatag tggggaaaaa tattagccag 
gacgaagaag acgaacattt gtacgaagaa 
gcagcagcag cagcttccac aagacctccc 
gttgttgctg ttgcagacga aaccttgaag 
gaggccgagg aacaaagagc ggccgcagtt 
gcggccgctg ctgctgctgc tgctgctgct 
gaaagagaag cagaggaaca aagagcggcc 
cgggaagcag acgaaaaaaa ggaaagagaa 
gctgctgccg ctgctgccca acgggaagca 
caaagagcgg ccgctgctgc tgaaagggaa 
gaacaaatgc gcataaagga agaggagagg 
aaacgtcgag aactagcagc caaggaggaa 
gagcaacttg aaaaattgaa tttccagttg 
gaacaaatgt tagaagaaga gaaggcctca 
gcgatccaag caatagaata tgaggatgaa 
ttagttccta tggatacgga tttgtacgga 
gtacagaata atacattaac ttctgcattt 
caattggttg ctcaatccct tgaaaaatcc 
aaacatcttc ctgtggatga tcctgctttt 
tctttaaaga atctgggaaa tgtttacaaa 
agggaccttt ttaaatcgaa ggcattaatt 
cgtctaacag atgtatcgac agatctcagg 
ttggatgtat ccgttaacat acgctctggt 
gacacggcat tagcagatca agttccttcg 
gccttattgc cattttcatc tgcaggtttg 
tacaatgaaa tagtgaacca actaagcagt 
aatattgtcc caacattgaa ccaaatcaaa 
agctctcgtc aatatgctat tgaagaaagg 
gaaataagaa aattcctcgc tataatgaac 
tggacggatg aaagacaacg ctctat tgct 
gataaaatta aagagagtgt tgctacacta 
agtaatcccc tcctgcacaa atcctcagtt 
aacgacttta gaaattttct cgatatccaa 
tcaggccaaa atattgatga cctttcactg 
ttgtgcctcg aattatccat aattttagac 
ttgcctgcaa ttacttaccc cgcaggggaa 
gttgacatta gacaagaaat tggtaagaac 
acactgtcag aggcgttaca aatttttcag 
caacagcagc tgttacaaca acaacaggac 
caaatagaag aacaacaacg ggttcaggaa 



aggcctgcta cccagaaaag gagacctgga 60 
aatgttccat attatgccaa ctttggcgat 120 
ggaaaaaggg gtaaatttgt attagagcca 180 
aaaccaccta aagaaaagga ggaaagggaa 24 0 
ggtcaagagt ttgaacagaa agtgctacaa 300 
gggcaaaatt tggcagagaa aggattgcaa 360 
gcacaaacaa tgacaaaaaa aataatcagg 420 
ccgcaacagc agacatctga caaaggtgca 4 80 
gagatggctg cagaaagaga acaaccagta 540 
ctaggagaag aagatgacga agatatggga 600 
atggctgtat cacaacccgt tcaacaaatc 660 
actaggcccg ctcctgatat tcccatacaa 720 
ttaccaccat taccacttga tgattatgag 780 
gtgaatgatt tcttagtggc accaccaaca 840 
aggcctaata ttcctcctcc acctcctcct 900 
aacttggctt caattgcagc cttggaaaag 960 
gaaagggaaa gagaagtaga ggaacaaaga 1020 
gccgcccaac gggaagcaga cgaaaaaagg 1080 
gctgctgctg ctgctgccgc tgctgcccaa 114 0 
gtagaggaac aaagagcggc cgctgctgct 12 00 
gacgaaaaaa gggaaagaga agtagaggaa 12 60 
atacttgccc aacaacttca agaaatgaaa 1320 
cggaaagaac tagcagataa ggaggaagaa 1380 
gaaaagcgtc aagaaatatt agctaaagaa 144 0 
ggtacagaaa tcacgtccaa aagagcactc 1500 
cgctcacggt ccgcagccag tgcacagtta 1560 
cttcctcagg cagtcgaacc tcaaggacag 1620 
aaaatgtacg atctcaataa gaaattagaa 1680 
gaagacgtga acaaaacaaa cgaacagaac 17 40 
gctaaagcca ttgaaaaatt aactagtcaa 1800 
atgcagagaa taataacaga gagggatttt 18 60 
agagttctcg gggtctattt tacattaaaa 1920 
acagataaag aatcaaggga tctggaggtg 1980 
gctaatgatc tcaatacaat actggaaagg 204 0 
ggaacattat acactaaatt tacagaggca 2100 
aggattgaaa taagtaacag atcaagatct 2160 
gatactaatt ttactaatag ttccgacaag 2220 
ataaatgagg ctatgaatat tttgaaagaa 2280 
attgatgtca ccaatctatt aacagtttca 2340 
gtgtattctg atgtgtcccg aatggattct 2400 
agtaaaatat ccccttattt taaaggcgat 24 60 
gacagtattt cctctcagat aaaatcaaac 2520 
cacgatatca atacaacctc aagaatacgt 2580 
ttatcatctc cagacttttt aaatgctgtt 2640 
ggaggttctc aatttactta tgatgtcctt 2700 
gcatcaaaaa ccactgaaaa ggttacagaa 2760 
gtgatccata aaaatgcttt gagtttaaat 2820 
acatctatgg aagaaagtgg ttcattggct 2880 
atatcagatt ctagcgctga acttagtcgc 2940 
caacaacagc aacaacaaca gcaacaattc 3000 
caacaaaatc aacaacaatt attacaacaa 3060 
cagcagcaac agcagcaaag ggaccaacaa 3120 
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caacaggaac 
caaagggaac 
ttgcaacaac 
gacgatggtg 
cgtaaggttg 
cgagaagaaa 
gccacccctc 
acaacccagt 
gagaggattg 
attataacta 
ttcaatcaaa 
tcagcttctt 
agcatgcaag 
gcaattactc 
gcggctgagc 
aaagtggatc 
agtacttcca 
cataatgaaa 
gccaatctta 
ctccgcacat 
ttccgccaat 
aaggcactgg 
agtgtcaaaa 
atatttgcta 
aaatttt taa 
ctgtgtgaag 
cccttagccg 
gacacgtcaa 
gtggcttatg 
tcgggtctgg 
tcccaccggt 
caactacaaa 
tgtgttgccg 
caatccatac 
gcagaaaaaa 
gcaaaggaac 
aagttggcga 
aataaagccg 
attccaattg 
gtaattaaac 
tcttcagcct 
attttagatg 
tcaggatatg 
aaattccctg 
gaaagcgtat 
ctcttaatac 
agcagcagca 
caagttgttg 
agacgtgatg 
aaaggagtat 
cttaaaactt 
agacctatat 
ctcgaaggat 
tcatccttga 
cttgccacaa 
ggacaaaaac 
caacaacatc 
gatcaagcta 
tccatgagaa 
agggatgggg 
ataaaccctg 



aacagcaaag 
aacaacagca 
agcaacaatt 
atgaagaaag 
cagaatcagt 
ataacttcca 
cttcacctat 
gtttttcaaa 
tagaagtatg 
ccgcacaaac 
ttgaagagtt 
ctactaatcc 
tttgtgagtt 
tggccaatgc 
gagaaattgc 
taagttcagg 
aacaagaatt 
tggatattgg 
ggctcgtgaa 
ggctaggaga 
acattgatcg 
gtgtttcttt 
atgatgcatc 
tcgacggagg 
ggcaatctat 
cgacaatagc 
ataaagtaga 
ccggagacat 
tgaacctatg 
caaacaaaaa 
ttggatctta 
aacagaatat 
ccatggcagc 
tcttcccact 
atgtcgatat 
taaattcata 
gagtgtgcag 
agagtttagt 
atgataccaa 
catggatggt 
atttctcgtt 
catgggcttg 
aaaatcatat 
ccgacggacg 
tggaagattt 
cgccagatcc 
gcagcaagga 
gacaagaaca 
caaatattga 
caatttcagt 
ccatcatcaa 
ttgatcatca 
cagatttaaa 
ctctggcctc 
aacaagtaga 
ttaatgaaaa 
aatattctga 
gtaatggtgg 
aggcaaaact 
aaaattcatc 
gagcactttt 



ggaacaacag 
gcaacagcag 
tcagcaatta 
agaggaaaga 
agcgacaaaa 
atctaaaata 
catgaacttg 
actaacagct 
caagagtaag 
agaaatggaa 
ttcaaacaat 
tcctcctcct 
ggatgatcaa 
tgtacttcaa 
tctaaagcta 
agctactggt 
ggaagatttc 
attaattttg 
acctcatgga 
aacagatata 
caattggaag 
atcagatacg 
aaaatcagta 
aatgatttct 
gacagataaa 
gtctcttttc 
accacgcctt 
gtttcgtata 
caataatcgt 
agtcgtggcc 
tgatgtcaca 
ctctttactt 
cttcactgcc 
agaaggggga 
ggttcaagaa 
ctacaactgg 
gatgattatc 
agacactaat 
aactctggca 
ttcgttcaag 
tcaaaatatc 
cgcgcccggt 
caaattaaaa 
aggcactgtg 
taccgacttt 
tttactttct 
taatacaatt 
agcaccacca 
accaataata 
actggatga t 
tgatgccata 
aacacagaaa 
gaatggacaa 
cgattggaat 
gaaggaagaa 
attatcattt 
aaggggagga 
catagatgat 
ggcattagcc 
agctaaaccg 
attacgtaga 



cagcaacagc 
caacagagtg 
ctacaacaac 
gaagaagggg 
tatactgctg 
gca tcagcaa 
acaaaattga 
gaaaataata 
aatctcaatt 
gatcgagtga 
gtaaaacaac 
tctactcctt 
cgtaccctag 
actacaaaat 
gagaatggga 
gtttccgacc 
attgcagaag 
gatgccaaga 
ataaa tgtgc 
ttggatgaag 
gtggaagaac 
ttggcacaca 
ccatttgcgt 
agcctttcaa 
gaggttgctc 
acggcatgta 
caagaaaaat 
cgagtttgtc 
ataaactata 
ggtaaaacaa 
tatgattttt 
ttagagaagg 
gacccttctc 
gagat tgtta 
ttgtggaagg 
ttcatataca 
ggtatagtta 
gctctttctg 
ataaatat tg 
caaatgttta 
caacaacaaa 
aagttaacaa 
aaagatga tc 
gtcgaaggat 
tcaattgaag 
tccatgt teg 
a ttggaaagg 
a taaacacgt 
gggacgcctt 
ttcaatgagg 
cgagaaatag 
aatatacact 
cgt tcaggac 
ctaccttctc 
gaagaaaaga 
gtcgtgaata 
ggaatgaaga 
atagaact ta 
gttactaata 
tcaaactttg 
ggaggaggag 



agcaaaggga 
accagtttcg 
aaggaagaag 
ctgaaaagga 
acttgactac 
aattgggaac 
gagaggaata 
gtattatgcg 
taatgggaaa 
agaatatatt 
aacaacaaca 
ctactactcc 
aaaaggctgc 
ccgcttcagc 
aaacatctat 
aacaaaaatg 
aaaactttgt 
agaacgatcc 
agtctttccc 
atactgtaca 
atgagcgtga 
tcaaggacta 
tgaacactct 
gaacagcctt 
aaggacctgt 
gtaaccttct 
tagcggccgc 
atctcatgta 
cgttaaatgt 
ctaaagggca 
cagtattgta 
gtttcaatgc 
tctccatctc 
ttgaaaaaca 
aaactgcact 
gtaaggatac 
aggctgttct 
atattttcaa 
tggtgttcac 
gacagaaaga 
aacatcaaca 
aggcagcaca 
ttttatgggg 
gggcccaaca 
taaacgcacc 
gtaaaggaaa 
gaggattaat 
cgtcggatac 
atagtgtaat 
atagtccaga 
ggcaacgcat 
acagttcacc 
aatcttgggc 
tggagctttt 
gcgaaaggga 
aagctatcgg 
ggtatcagca 
tgaatagtaa 
aaattgcagc 
gcaatagatt 
taagaggagg 



acaacaacaa 
acaacaatta 
aagagggggt 
tgattgtgtg 
cttattccaa 
ccttgtcttt 
ttccacattc 
tattttcccc 
atacttgtac 
gtctggtatt 
acaagctgct 
tcctgttaca 
catagtagag 
tccttccacg 
ccgtatggaa 
gatcgacgaa 
agaaactgca 
gacccgtgat 
atattacgta 
tcctgaatat 
agatacatta 
ctattctccc 
attgtacaac 
tatttaccgt 
tcggtctcaa 
tcgatcgtct 
tgctgctgta 
caattttata 
attgagagcg 
tacatcttct 
caagattctt 
atgggaatcg 
agatgccgac 
tgaaaacgat 
cacactcatg 
cgatatggaa 
cagattaaca 
gctacctgtg 
tttgaataat 
cgggggagtt 
aactgcttcc 
cgtgtttatt 
tgcttcaatg 
atacaataac 
tgcctctgga 
tggtggaagc 
tttaaaccgc 
taaaaaaata 
taaggcaagt 
agatttcgcc 
gacttataca 
taaaattatt 
tccttcttca 
atatagagaa 
agaagataaa 
aactatccaa 
acactctgct 
agatgctact 
agcagcagca 
ggatgaagca 
acaaacaccc 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
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cagagttcaa tgctaacaat gttccgtcct ggacaaactg gtggcaatag tagttggtgg 6840 
actactaata caccccttat tcaacgcaca actagtgttg ggaataattt agttgtgctt 6900 
gtacccaacc tattggattc acaccctcct acatttaatt aggaataa 6948 

<210> 239 
<211> 2293 
<212> PRT 
<213> SHRIMP 



<400> 239 
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Thr 


Gly 


Gin 
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Glu 


Glu 


Asn 


Arg 
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Ala 


Thr 


Gin 


Lys 


1 








5 










10 










15 




Arg 


Arg 


Pro 


Gly 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Thr 


Gly 


Ser 


Ser 


Asn 


Val 








20 










25 










30 






Pro 


Tyr 


Tyr 


Ala 


Asn 


Phe 


Gly 


Asp 


Asp 


Ala 


Thr 


Tyr 


Ser 


Met 


Tyr 


Thr 






35 










40 










45 








Gly 


Glu 


Gly 


Lys 


Arg 


Gly 


Lys 


Phe 


Val 


Leu 


Glu 


Pro 


Pro 


Lys 


Glu 


Arg 




50 










55 










60 










Ser 


Val 


Gin 


Arg 


Val 


Gin 


Lys 


Pro 


Pro 


Lys 


Glu 


Lys 


Glu 


Glu 


Arg 


Glu 


65 










70 










75 










80 


Gin 


Arg 


Ser 


Asn 


Val 


Arg 


Thr 


Arg 


Arg 


Pro 


Gly 


Gin 


Glu 


Phe 


Glu 


Gin 










85 










90 










95 




Lys 


Val 


Leu 


Gin 


Asp 


Arg 


Ser 


Arg 


Glu 


Arg 


Ser 


Glu 


Lys 


Leu 


Gly 


Gin 








100 










105 










110 






Asn 


Leu 


Ala 


Glu 


Lys 


Gly 


Leu 


Gin 


Glu 


Arg 


Gin 


Lys 


Lys 


Tyr 


Thr 


Pro 






115 










120 










125 








Lys 


Val 


Ala 


Gin 


Thr 


Met 


Thr 


Lys 


Lys 


He 


He 


Arg 


Phe 


Arg 


Glu 


Gly 




130 










135 










140 










Gly 


Arg 


Lys 


Phe 


Lys 


Ala 


Pro 


Gin 


Gin 


Gin 


Thr 


Ser 


Asp 


Lys 


Gly 


Ala 


145 










150 










155 










160 


Ala 


Thr 


Asn 


Val 


Leu 


Glu 


Arg 


Glu 


Glu 


He 


Glu 


Met 


Ala 


Ala 


Glu 


Arg 










165 










170 










175 




Glu 


Gin 


Pro 


Val 


Glu 


He 


Thr 


Gly 


Asp 


Thr 


He 


Leu 


Gly 


Gly 


Leu 


Gly 








180 










185 










190 






Glu 


Glu 


Asp 


Asp 


Glu 


Asp 


Met 


Gly 


Glu 


Asp 


Glu 


Leu 


Thr 


He 


Gin 


His 






195 










200 










205 








Ser 


Ser 


Met 


Ala 


Val 


Ser 


Gin 


Pro 


Val 


Gin 


Gin 


He 


Val 


Val 


Ser 


Ser 




210 










215 










220 










Pro 


He 


Pro 


Pro 


Lys 


Pro 


Trp 


Ala 


Pro 


Asp 


He 


Pro 


He 


Gin 


Glu 


Asp 


225 










230 










235 










240 


He 


Val 


Gly 


Lys 


Asn 


He 


Ser 


Gin 


Leu 


Pro 


Pro 


Leu 


Pro 


Leu 


Asp 


Asp 










245 










250 










255 




Tyr 


Glu 


Asp 


Glu 


Glu 


Asp 


Glu 


His 


Leu 


Tyr 


Glu 


Glu 


Val 


Asn 


Asp 


Phe 








260 










265 










270 






Leu 


Val 


Ala 


Pro 


Pro 


Thr 


Ala 


Ala 


Ala 


Ala 


Ala 


Ser 


Trp 


Pro 


Arg 


Pro 






275 










280 










285 








Asn 


He 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Val 


Val 


Ala 


Val 


Ala 


Asp 


Glu 


Thr 




290 










295 










300 










Leu 


Lys 


Asn 


Leu 


Ala 


Ser 


He 


Ala 


Ala 


Leu 


Glu 


Lys 


Glu 


Ala 


Glu 


Glu 


305 










310 










315 










320 


Gin 


Arg 


Ala 


Ala 


Ala 


Val 


Glu 


Arg 


Glu 


Arg 


Glu 


Val 


Glu 


Glu 


Gin 


Arg 










325 










330 










335 




Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Gin 


Arg 


Glu 


Ala 








340 










345 










350 






Asp 


Glu 


Lys 


Arg 


Glu 


Arg 


Glu 


Ala 


Glu 


Glu 


Gin 


Arg 


Ala 


Ala 


Ala 


Ala 






355 










360 










365 








Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Gin 


Arg 


Glu 


Ala 


Asp 


Glu 


Lys 


Lys 


Glu 




370 










375 










380 










Arg 


Glu 


Val 


Glu 


Glu 


Gin 


Arg 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


Ala 


385 










390 










395 










400 


Ala 


Ala 


Gin 


Arg 


Glu 


Ala 


Asp 


Glu 


Lys 


Arg 


Glu 


Arg 


Glu 


Val 


Glu 


Glu 
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405 










410 










415 




Gin 


Arg 


Ala 


Ala 


Ala 


Ala 


Ala 


Glu 


Arg 


Glu 


He 


Leu 


Ala 


Gin 


Gin 


Leu 








420 










425 










430 






Gin 


Glu 


Met 


Lys 


Glu 


Gin 


Met 


Arg 


He 


Lys 


Glu 


Glu 


Glu 


Arg 


Arg 


Lys 






435 










440 










445 








Glu 


Leu 


Ala 


Asp 


Lys 


Glu 


Glu 


Glu 


Lys 


Arg 


Arg 


Glu 


Leu 


Ala 


Ala 


Lys 




450 










455 










460 










Glu 


Glu 


Glu 


Lys 


Arg 


Gin 


Glu 


He 


Leu 


Ala 


Lys 


Glu 


Glu 


Gin 


Leu 


Glu 


465 










470 










475 










480 


Lys 


Leu 


Asn 


Phe 


Gin 


Leu 


Gly 


Thr 


Glu 


He 


Thr 


Ser 


Lys 


Arg 


Ala 


Leu 










485 










490 










495 




Glu 


Gin 


Met 


Leu 


Glu 


Glu 


Glu 


Lys 


Ala 


Ser 


Arg 


Ser 


Arg 


Ser 


Ala 


Ala 








500 










505 










510 






Ser 


Ala 


Gin 


Leu 


Ala 


He 


Gin 


Ala 


He 


Glu 


Tyr 


Glu 


Asp 


Glu 


Leu 


Pro 






515 










520 










525 








Gin 


Ala 


Val 


Glu 


Pro 


Gin 


Gly 


Gin 


Leu 


Val 


Pro 


Met 


Asp 


Thr 


Asp 


Leu 




530 










535 










540 










Tyr 


Gly 


Lys 


Met 


Tyr 


Asp 


Leu 


Asn 


Lys 


Lys 


Leu 


Glu 


Val 


Gin 


Asn 


Asn 


545 










550 










555 










560 


Thr 


Leu 


Thr 


Ser 


Ala 


Phe 


Glu 


Asp 


Val 


Asn 


Lys 


Thr 


Asn 


Glu 


Gin 


Asn 










565 










570 










575 




Gin 


Leu 


Val 


Ala 


Gin 


Ser 


Leu 


Glu 


Lys 


Ser 


Ala 


Lys 


Ala 


He 


Glu 


Lys 








580 










585 










590 






Leu 


Thr 


Ser 


Gin 


Lys 


His 


Leu 


Pro 


Val 


Asp 


Asp 


Pro 


Ala 


Phe 


Met 


Gin 






595 










600 










605 








Arg 


He 


He 


Thr 


Glu 


Arg 


Asp 


Phe 


Ser 


Leu 


Lys 


Asn 


Leu 


Gly 


Asn 


Val 




610 










615 










620 










Tyr 


Lys 


Arg 


Val 


Leu 


Gly 


Val 


Tyr 


Phe 


Thr 


Leu 


Lys 


Arg 


Asp 


Leu 


Phe 


625 










630 










635 










640 


Lys 


Ser 


Lys 


Ala 


Leu 


He 


Thr 


Asp 


Lys 


Glu 


Ser 


Arg 


Asp 


Leu 


Glu 


Val 










645 










650 










655 




Arg 


Leu 


Thr 


Asp 


Val 


Ser 


Thr 


Asp 


Leu 


Arg 


Ala 


Asn 


Asp 


Leu 


Asn 


Thr 








660 










665 










670 






lie 


Arg 


Leu 


Asp 


Val 


Ser 


Val 


Asn 


He 


Arg 


Ser 


Gly 


Gly 


Thr 


Leu 


Tyr 






675 










680 










685 








Thr 


Lys 


Phe 


Thr 


Glu 


Ala 


Asp 


Thr 


Ala 


Asp 


Gin 


Val 


Pro 


Ser 


Arg 


He 




690 










695 










700 










Glu 


He 


Ser 


Asn 


Arg 


Ser 


Arg 


Ser 


Ala 


Leu 


Leu 


Pro 


Phe 


Ser 


Ser 


Ala 


705 










710 










715 










720 


Gly 


Leu 


Asp 


Thr 


Asn 


Phe 


Thr 


Asn 


Ser 


Ser 


Asp 


Lys 


Tyr 


Asn 


Glu 


He 










725 










730 










735 




Val 


Asn 


Gin 


Leu 


Ser 


Ser 


He 


Asn 


Glu 


Ala 


Met 


Asn 


He 


Leu 


Lys 


Glu 








740 










745 










750 






Asn 


He 


Val 


Pro 


Thr 


Leu 


Asn 


Gin 


He 


Lys 


He 


Asp 


Val 


Thr 


Asn 


Leu 






755 










760 










765 








Leu 


Thr 


Val 


Ser 


Ser 


Ser 


Arg 


Gin 


Tyr 


Ala 


He 


Glu 


Glu 


Arg 


Val 


Tyr 




770 










775 










780 










Ser 


Asp 


Val 


Ser 


Arg 


Met 


Asp 


Ser 


Glu 


He 


Arg 


Lys 


Phe 


Leu 


Ala 


He 


785 










790 










795 










800 


Met 


Asn 


Ser 


Lys 


He 


Ser 


Pro 


Tyr 


Phe 


Lys 


Gly 


Asp 


Trp 


Thr 


Asp 


Glu 










805 










810 










815 




Arg 


Gin 


Arg 


Ser 


He 


Ala 


Asp 


Ser 


He 


Ser 


Ser 


Gin 


He 


Lys 


Ser 


Asn 








820 










825 










830 






Asp 


Lys 


He 


Lys 


Glu 


Ser 


Val 


Ala 


Thr 


Leu 


His 


Asp 


He 


Asn 


Thr 


Thr 






835 










840 










845 








Ser 


Arg 


He 


Arg 


Ser 


Asn 


Pro 


Leu 


Leu 


His 


Lys 


Ser 


Ser 


Val 


Leu 


Ser 




850 










855 










860 










Ser 


Pro 


Asp 


Phe 


Leu 


Asn 


Ala 


Val 


Asn 


Asp 


Phe 


Arg 


Asn 


Phe 


Leu 


Asp 


865 










870 










875 










880 


He 


Gin 


Gly 


Gly 


Ser 


Gin 


Phe 


Thr 


Tyr 


Asp 


Val 


Leu 


Ser 


Gly 


Gin 


Asn 



885 890 895 
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He 


Asp 


Asp 


Leu 
900 


Ser 


Leu 


Ala 


Ser 


Lys 
905 


Thr 


Thr 


Glu 


Lys 


Val 
910 


Thr 


Glu 


Leu 


Cys 


Leu 
915 


Glu 


Leu 


Ser 


He 


He 
920 


Leu 


Asp 


Val 


He 


His 
925 


Lys 


Asn 


Ala 


Leu 


Ser 
930 


Leu 


Asn 


Leu 


Pro 


Ala 
935 


He 


Thr 


Tyr 


Pro 


Ala 
940 


Gly 


Glu 


Thr 


Ser 


Met 


Glu 


Glu 


Ser 


Gly 


Ser 


Leu 


Ala 


Val 


Asp 


He 


Arg 


Gin 


Glu 


He 


Gly 


945 










950 










955 










960 


Lys 


Asn 


He 


Ser 


Asp 
965 


Ser 


Ser 


Ala 


Glu 


Leu 
97 0 


Ser 


Arg 


Thr 


Leu 


Ser 
975 


Glu 


Ala 


Leu 


Gin 


He 
980 


Phe 


Gin 


Gin 


Gin 


Gin 
985 


Gin 


Gin 


Gin 


Gin 


Gin 
990 


Gin 


Phe 


Gin 


Gin 


Gin 


Leu 


Leu 


Gin 


Gin 


Gin 


Gin 


Asp 


Gin 


Gin 


Asn 


Gin 


Gin 


Gin 






995 










1000 








1005 






Leu 


Leu 


Gin 


Gin 


Gin 


He 


Glu 


Glu 


Gin 


Gin 


Arg 


Val 


Gin 


Glu 


Gin 


Gin 




1010 








1015 








1020 








Gin 


Gin 


Gin 


Gin 


Arg 


Asp 


Gin 


Gin 


Gin 


Gin 


Glu 


Gin 


Gin 


Gin 


Arg 


Glu 


1025 








1030 








1035 








1040 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Arg 


Glu 


Gin 


Gin 


Gin 


Gin 


Arg 


Glu 


Gin 










1045 








1050 








1055 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Gin 


Ser 


Asp 


Gin 


Phe 


Arg 


Gin 


Gin 


Leu 








1060 








1065 








1070 




Leu 


Gin 


Gin 


Gin 


Gin 


Gin 


Phe 


Gin 


Gin 


Leu 


Leu 


Gin 


Gin 


Gin 


Gly 


Arg 






1075 








1080 








1085 






Arg 


Arg 


Gly 


Gly 


Asp 


Asp 


Gly 


Asp 


Glu 


Glu 


Arg 


Glu 


Glu 


Arg 


Glu 


Glu 




1090 








1095 








1100 








Gly 


Ala 


Glu 


Lys 


Asp 


Asp 


Cys 


Val 


Arg 


Lys 


Val 


Ala 


Glu 


Ser 


Val 


Ala 


1105 








1110 








1115 








1120 


Thr 


Lys 


Tyr 


Thr 


Ala 


Asp 


Leu 


Thr 


Thr 


Leu 


Phe 


Gin 


Arg 


Glu 


Glu 


Asn 










1125 








1130 








1135 


Asn 


Phe 


Gin 


Ser 


Lys 


He 


Ala 


Ser 


Ala 


Lys 


Leu 


Gly 


Thr 


Leu 


Val 


Phe 








1140 








1145 








1150 




Ala 


Thr 


Pro 


Pro 


Ser 


Pro 


He 


Met 


Asn 


Leu 


Thr 


Lys 


Leu 


Arg 


Glu 


Glu 






1155 








1160 








1165 






Tyr 


Ser 


Thr 


Phe 


Thr 


Thr 


Gin 


Cys 


Phe 


Ser 


Lys 


Leu 


Thr 


Ala 


Glu 


Asn 




1170 








1175 








1180 








Asn 


Ser 


He 


Met 


Arg 


He 


Phe 


Pro 


Glu 


Arg 


He 


Val 


Glu 


Val 


Cys 


Lys 


1185 








1190 








1195 








1200 


Ser 


Lys 


Asn 


Leu 


Asn 


Leu 


Met 


Gly 


Lys 


Tyr 


Leu 


Tyr 


He 


He 


Thr 


Thr 










1205 








1210 








1215 


Ala 


Gin 


Thr 


Glu 


Met 


Glu 


Asp 


Arg 


Val 


Lys 


Asn 


He 


Leu 


Ser 


Gly 


He 








1220 








1225 








1230 




Phe 


Asn 


Gin 


He 


Glu 


Glu 


Phe 


Ser 


Asn 


Asn 


Val 


Lys 


Gin 


Gin 


Gin 


Gin 






1235 








1240 








1245 






Gin 


Gin 


Ala 


Ala 


Ser 


Ala 


Ser 


Ser 


Thr 


Asn 


Pro 


Pro 


Pro 


Pro 


Ser 


Thr 




1250 








1255 








1260 








Pro 


Ser 


Thr 


Thr 


Pro 


Pro 


Val 


Thr 


Ser 


Met 


Gin 


Val 


Cys 


Glu 


Leu 


Asp 


1265 








1270 








1275 








1280 


Asp 


Gin 


Arg 


Thr 


Leu 


Glu 


Lys 


Ala 


Ala 


He 


Val 


Glu 


Ala 


He 


Thr 


Leu 










1285 








1290 








1295 


Ala 


Asn 


Ala 


Val 


Leu 


Gin 


Thr 


Thr 


Lys 


Ser 


Ala 


Ser 


Ala 


Pro 


Ser 


Thr 








1300 








1305 








1310 




Ala 


Ala 


Glu 


Arg 


Glu 


He 


Ala 


Leu 


Lys 


Leu 


Glu 


Asn 


Gly 


Lys 


Thr 


Ser 






1315 








1320 








1325 






He 


Arg 


Met 


Glu 


Lys 


Val 


Asp 


Leu 


Ser 


Ser 


Gly 


Ala 


Thr 


Gly 


Val 


Ser 




1330 








1335 








1340 








Asp 


Gin 


Gin 


Lys 


Trp 


He 


Asp 


Glu 


Ser 


Thr 


Ser 


Lys 


Gin 


Glu 


Leu 


Glu 


1345 








1350 








1355 








1360 


Asp 


Phe 


He 


Ala 


Glu 


Glu 


Asn 


Phe 


Val 


Glu 


Thr 


Ala 


His 


Asn 


Glu 


Met 










1365 








1370 








1375 


Asp 


He 


Gly 


Leu 


He 


Leu 


Asp 


Ala 


Lys 


Lys 


Asn 


Asp 


Pro 


Thr 


Arg 


Asp 
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1380 1385 1390 

Ala Asn Leu Arg Leu Val Lys Pro His Gly lie Asn Val Gin Ser Phe 

1395 1400 1405 

Pro Tyr Tyr Val Leu Arg Thr Trp Leu Gly Glu Thr Asp lie Leu Asp 

1410 1415 1420 

Glu Asp Thr Val His Pro Glu Tyr Phe Arg Gin Tyr lie Asp Arg Asn 
1425 1430 1435 1440 

Trp Lys Val Glu Glu His Glu Arg Glu Asp Thr Leu Lys Ala Leu Gly 

1445 1450 1455 

Val Ser Leu Ser Asp Thr Leu Ala His lie Lys Asp Tyr Tyr Ser Pro 

1460 1465 1470 

Ser Val Lys Asn Asp Ala Ser Lys Ser Val Pro Phe Ala Leu Asn Thr 

1475 1480 1485 

Leu Leu Tyr Asn He Phe Ala He Asp Gly Gly Met He Ser Ser Leu 

1490 1495 1500 

Ser Arg Thr Ala Phe He Tyr Arg Lys Phe Leu Arg Gin Ser Met Thr 
1505 1510 1515 1520 

Asp Lys Glu Val Ala Gin Gly Pro Val Arg Ser Gin Leu Cys Glu Ala 

1525 1530 1535 

Thr He Ala Ser Leu Phe Thr Ala Cys Ser Asn Leu Leu Arg Ser Ser 

1540 1545 1550 

Pro Leu Ala Asp Lys Val Glu Pro Arg Leu Gin Glu Lys Leu Ala Ala 

1555 1560 1565 

Ala Ala Ala Val Asp Thr Ser Thr Gly Asp Met Phe Arg He Arg Val 

1570 1575 1580 

Cys His Leu Met Tyr Asn Phe He Val Ala Tyr Val Asn Leu Cys Asn 
1585 1590 1595 1600 

Asn Arg He Asn Tyr Thr Leu Asn Val Leu Arg Ala Ser Gly Leu Ala 

1605 1610 1615 

Asn Lys Lys Val Val Ala Gly Lys Thr Thr Lys Gly His Thr Ser Ser 

1620 1625 1630 

Ser His Arg Phe Gly Ser Tyr Asp Val Thr Tyr Asp Phe Ser Val Leu 

1635 1640 1645 

Tyr Lys He Leu Gin Leu Gin Lys Gin Asn He Ser Leu Leu Leu Glu 

1650 1655 1660 

Lys Gly Phe Asn Ala Trp Glu Ser Cys Val Ala Ala Met Ala Ala Phe 
1665 1670 1675 1680 

Thr Ala Asp Pro Ser Leu Ser He Ser Asp Ala Asp Gin Ser He Leu 

1685 1690 1695 

Phe Pro Leu Glu Gly Gly Glu He Val He Glu Lys His Glu Asn Asp 

1700 1705 1710 

Ala Glu Lys Asn Val Asp Met Val Gin Glu Leu Trp Lys Glu Thr Ala 

1715 1720 1725 

Leu Thr Leu Met Ala Lys Glu Leu Asn Ser Tyr Tyr Asn Trp Phe He 

1730 1735 1740 

Tyr Ser Lys Asp Thr Asp Met Glu Lys Leu Ala. Arg Val Cys Arg Met 
1745 1750 1755 1760 

He He Gly He Val Lys Ala Val Leu Arg Leu Thr Asn Lys Ala Glu 

1765 1770 1775 

Ser Leu Val Asp Thr Asn Ala Leu Ser Asp He Phe Lys Leu Pro Val 

1780 1785 1790 

He Pro He Asp Asp Thr Lys Thr Leu Ala He Asn He Val Val Phe 

1795 1800 1805 

Thr Leu Asn Asn Val He Lys Pro Trp Met Val Ser Phe Lys Gin Met 

1810 1815 1820 

Phe Arg Gin Lys Asp Gly Gly Val Ser Ser Ala Tyr Phe Ser Phe Gin 
1825 1830 1835 1840 

Asn He Gin Gin Gin Lys His Gin Gin Thr Ala Ser He Leu Asp Ala 

1845 1850 1855 

Trp Ala Cys Ala Pro Gly Lys Leu Thr Lys Ala Ala His Val Phe He 
1860 1865 1870 
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Ser Gly Tyr Glu Asn His lie Lys Leu Lys Lys Asp Asp Leu Leu Trp 

1875 1880 1885 

Gly Ala Ser Met Lys Phe Pro Ala Asp Gly Arg Gly Thr Val Val Glu 

1890 1895 1900 

Gly Trp Ala Gin Gin Tyr Asn Asn Glu Ser Val Leu Glu Asp Phe Thr 
1905 1910 1915 1920 

Asp Phe Ser lie Glu Val Asn Ala Pro Ala Ser Gly Leu Leu lie Pro 

1925 1930 1935 

Pro Asp Pro Leu Leu Ser Ser Met Phe Gly Lys Gly Asn Gly Gly Ser 

1940 1945 1950 

Ser Ser Ser Ser Ser Lys Asp Asn Thr lie lie Gly Lys Gly Gly Leu 

1955 1960 1965 

lie Leu Asn Arg Gin Val Val Gly Gin Glu Gin Ala Pro Pro lie Asn 

1970 1975 1980 

Thr Ser Ser Asp Thr Lys Lys lie Arg Arg Asp Ala Asn lie Glu Pro 
1985 1990 1995 2000 

lie He Gly Thr Pro Tyr Ser Val He Lys Ala Ser Lys Gly Val Ser 

2005 2010 2015 

lie Ser Val Leu Asp Asp Phe Asn Glu Asp Ser Pro Glu Asp Phe Ala 

2020 2025 2030 

Leu Lys Thr Ser He He Asn Asp Ala He Arg Glu He Gly Gin Arg 

2035 2040 2045 

Met Thr Tyr Trp He Phe Asp His Gin Thr Gin Lys Asn He His Tyr 

2050 2055 2060 

Ser Ser Pro Lys He He Gly Ser Asp Leu Lys Asn Gly Gin Arg Ser 
2065 2070 2075 2080 

Gly Gin Ser Trp Ala Pro Ser Ser Ser Ser Leu Thr Leu Ala Ser Asp 

2085 2090 2095 

Trp Asn Leu Pro Ser Leu Glu Leu Leu Tyr Arg Glu Leu Ala Thr Lys 

2100 2105 2110 

Gin Val Glu Lys Glu Glu Glu Glu Lys Ser Glu Glu Asp Lys Gly Gin 

2115 2120 2125 

Lys Leu Asn Glu Lys Leu Ser Phe Val Val Asn Lys Ala He Gly Thr 

2130 2135 2140 

He Gin Gin Gin His Gin Tyr Ser Gly Gly Gly Met Lys Arg Tyr Gin 
2145 2150 2155 2160 

Gin His Ser Ala Asp Gin Asn Gly Gly He Asp Asp He Glu Leu Met 

2165 2170 2175 

Asn Ser Lys Asp Ala Thr Ser Met Arg Lys Ala Lys Leu Ala Val Thr 

2180 2185 2190 

Asn Lys He Ala Ala Ala Ala Ala Arg Asp Gly Glu Asn Ser Ser Ala 

2195 2200 2205 

Lys Pro Ser Asn Phe Gly Asn Arg Leu Asp Glu Ala He Asn Pro Gly 

2210 2215 2220 

Ala Leu Leu Leu Arg Arg Gly Gly Gly Val Arg Gly Gly Gin Thr Pro 
2225 2230 2235 2240 

Gin Ser Ser Met Leu Thr Met Phe Arg Pro Gly Gin Thr Gly Gly Asn 

2245 2250 2255 

Ser Ser Trp Trp Thr Thr Asn Thr Pro Leu He Gin Arg Thr Thr Ser 

2260 2265 2270 

Val Gly Asn Asn Leu Val Val Leu Val Pro Asn Leu Leu Asp Ser His 

2275 2280 2285 

Pro Pro Thr Phe Asn 
2290 



<210> 240 
<211> 4311 
<212> DNA 
<213> SHRIMP 
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<400> 240 

atggattttg 

gtgaagaaag 

aaggttatta 

gatggacaaa 

aagagcactg 

agtgtgcgca 

aaagtgtttt 

aagttaaacg 

aaacacaacg 

gattgtttta 

ggaaatacat 

gtaccagttt 

gccaagcttc 

tacatcactt 

tgtcacaccg 

catgtggccg 

gagtatgaaa 

ttttctaacg 

gatgttactg 

gacaggcgtc 

tcatcgtcat 

gttgtggagg 

ggttcagaag 

agaagggtta 

agtatcaata 

ccccaaactg 

aatattttgt 

atgagcagtt 

agtatccgtc 

ttgtttgatg 

tctgaaatat 

ggacaaaaga 

acggaatcta 

acagtcatac 

cgcaagcgta 

atgatgaaaa 

tctacaccag 

ctcagtttca 

atattcggtc 

aagacttt tg 

tatgagaggg 

gaaaatgaaa 

ccgatcgctt 

aacaccccta 

aggacatatg 

attaagagag 

ctgtggtacc 

aacaacaata 

gaggaggata 

gggagaaaga 

aaaagtttgt 

ataagctgga 

acccttgatg 

ggaacagatg 

gtaggcccct 

ggcaaattgt 

gattcat taa 

ttgcatttgt 

tataatattt 

aagatctcat 



aaggaactac 
ttgcagagca 
aggacctgga 
aactggacaa 
cagataattc 
gaaccaagat 
ctgtcgagga 
ccaactgttc 
tctcctcatt 
tgcttgaaac 
atgcagtttt 
tcgacatgaa 
ctgttctgtg 
caattgagac 
aagatttgga 
tgtacgacta 
acacggctag 
aatgtgcaat 
ttaataatga 
tttttgatgc 
cgtcttcctc 
gggaggaaga 
atgctcttgt 
agaatgaacc 
acgaagatga 
aagctacagt 
atgaaactat 
accgtgtatt 
gccgtccaac 
aggttgtcac 
ttggaggaat 
agagtctgta 
atgtgaataa 
ctaacgtatt 
cctctagtgc 
atgatgaagg 
aacaacaaca 
taaaaatgcc 
aacgtctttg 
aagatcttga 
aggtaactcg 
tggatgtttt 
actttgaaga 
agcgcaaaaa 
aagaatacca 
aattggaagg 
tgactgtaat 
atgataacga 
aaatggaaga 
agggtcaaaa 
gtgaaaattc 
tgaacaagaa 
aggtctcaag 
ataaatatga 
ttagtatacc 
acgttaacac 
acgatgagga 
ctaagcatga 
ctcctagctc 
ctggtatgag 



cagttctacc 
ttcctttgcc 
aggggagagg 
ggctatgttg 
cattgaatcg 
tgaaagtgaa 
tttagaaatc 
aagaattgga 
tgagaacaac 
aacctatcct 
cttgactggt 
tgcaggtatt 
catgtttgga 
gcagtctttt 
gaggaagaag 
cagtggagac 
ttggtgcaac 
tagtgatatg 
agaacatgaa 
tagtccttcc 
tccaacagtt 
agaggaagaa 
tatgaagaaa 
tgcactcaaa 
tggtgatgat 
gaaaaattcc 
gcaaatttct 
caccaacaaa 
catttggact 
atcaattgtg 
gtttaacgcc 
tggaaacaag 
tgcactattt 
ctccttcaaa 
ttcatcatct 
ggaaaaggtt 
acaagctggt 
acgcagtaat 
tggactgtct 
aaatgaaatc 
cttgtatgag 
gtctcaccaa 
agtactggag 
caagtatgca 
tgcgtttagc 
agacaactat 
gaccaggaac 
agaaaccgat 
aagtatggac 
attcaacagt 
gatggtagtt 
aatccctccc 
gtttgtgttt 
aactgttttt 
tgttgatttt 
catagacgga 
tgttgattct 
cacctttttc 
ttctctcact 
caataataat 



ccctcaaaaa 
aatcttcatg 
aagaaaatgt 
gacgatatta 
accatcaagg 
gccaagaaca 
tactccaagg 
ggcaagtatg 
aacaaccaag 
cttgttgggt 
gttgggctag 
caaaccctaa 
cgtacagaat 
gacgaagagg 
cgcatgaatg 
gggaaagaac 
gcaaacggtg 
aatgacttgt 
gaacgttcta 
cccatcaaga 
cctcctccta 
attgatgaag 
ttagccaagc 
attacttctg 
gacgatgccg 
ttcatggccc 
cttgctaaaa 
ctccaagagt 
gaagaaagtc 
gcacatcaga 
aactctacca 
cacatttctt 
gcgtgggtga 
atggcatcag 
aacgatgaac 
gcacaagaat 
catgacaagg 
gtcaatggct 
gatgcttcca 
atgaggagct 
aaatgcaggt 
ggggaattgt 
aatattaaga 
aaggtactgg 
aagtttgtac 
acccatgacg 
aggatttgcg 
attgtggagg 
gtagaacaac 
attggggatc 
tctattgcaa 
ggtttcttga 
agcgatgtca 
ggagtcagta 
tcaagcgcag 
aagggcattt 
actacaacag 
aatattaata 
gaaaagaaaa 
ggcatgtgtg 



tgtcccagtt 
acaaggctac 
ctaccccaaa 
tcaacgagta 
aaattgaaaa 
gtgtaacttc 
ggcgagtgtg 
cagtgtcaat 
ttttctctga 
tcgaaacttc 
aaagatctct 
acatgactgg 
atgacaactt 
aaaatgatgc 
acgcaccagc 
aattgctcta 
tggtcacatc 
gctgttttgc 
tgaatattgt 
cggaagaaga 
caccatacga 
acgaaagtag 
tttctactat 
ggggtaacaa 
ttgacgctac 
caaacgacga 
tttgtaataa 
gtttgaatac 
aacaatttgc 
tggctcaaga 
acattaaggg 
cctcgtgctt 
aatcgaaact 
aaaagccctc 
accaagaacc 
catcatctcc 
aaactatcaa 
cggcttcata 
gcacatttaa 
cattcactag 
ctcaagctgt 
ttgccgagt t 
gttggagcct 
tgagcgttaa 
caatgttctt 
ttcacttttc 
atgtgctcca 
aagaggagga 
agaagcaagt 
aagtcattag 
ttaatagttt 
aggattctag 
aaatcaa tag 
cgcgtgtgga 
gactagataa 
tgacaatttc 
acaagctaga 
agaataaggt 
agacaaaat t 
tacaaactcc 



gtattcatca 
tcttgcatca 
gtcctcttct 
tcaggccgtt 
tgtacttgaa 
cagcccagaa 
caaaggtctc 
gagtatcaaa 
agaacccagg 
tacagaagat 
acctaaatat 
tttgaggatg 
ggaagat ttt 
cagaatgagg 
gattacacct 
tatgataacc 
tgacagtgga 
tgactgcatc 
ggtcgaatct 
tggagaaaat 
aggtaacgca 
caagtatgaa 
gaaacaaatg 
tagtagcagt 
tgcattatgc 
gagaactgaa 
tcca tcatct 
catggacgat 
taagggtttg 
tatttgcaag 
taaatatgaa 
caaaaccaac 
cca ttctggc 
aaaaatgaag 
a tcaacaaaa 
ttcttcatcg 
tttaattccc 
tttgtctgaa 
gagaatgtgc 
actgactaga 
agatattgag 
ct tggaggac 
agaaaacgtt 
tgctattcgt 
gttcaatctg 
ttccacttgc 
gtacatcaac 
aggagaagga 
tcgcaaggga 
aaaa tttgtg 
gatctctgga 
cacaatcaat 
gaaaatcaat 
ttcacatatt 
ggcctcatgt 
acccaaatat 
gaaggatatt 
tct tccattc 
caataggaag 
ttctagttca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 
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aattcagtct 
tgctcccttt 
aggaacatgt 
catagacacg 
attattgata 
agcgccttta 
agtgatttga 
ttgacccatg 
aactatacat 
gctgctgctg 
gctgctgcaa 
attagaggaa 



cttccgtctc 
cttctacaaa 
ggaggtgtgg 
ctgtaaaact 
ggaa taggaa 
ccggcgagtc 
acctatgtat 
aagagtacca 
ttaagagtat 
ctgatgatga 
tccaagactt 
agtacaagaa 



gtctattgta 
gaaaaagagc 
atttgttgta 
tgtagctgaa 
gattagattt 
tacatatttg 
ttatacttca 
ggacggaaat 
caaatcctgc 
tggttctact 
tatgcatgta 
atctttggca 



gctccttcat 
atctggaacg 
ccacccaaac 
actgcaccta 
aacggtctaa 
ctcaataaga 
tctttaaatg 
gctttggatg 
tcgtccaaag 
acttctacct 
atgattaaga 
aagaaaacaa 



cttctgttct 
agaacatgtt 
tttgcagttt 
aaacaaagtt 
agaaggtatg 
atatgactgc 
acccattgta 
actatggtgc 
atgaaaccgc 
cttcttcaac 
aaattgatgc 
agaagcatta 



ggctctatct 
tttgacatct 
tattgttaac 
gtgtaggaat 
caagagtgtt 
aacttcacct 
tacttgtaaa 
agtttttgta 
tgacgacaat 
tgatactgat 
catgaaagac 
a 



3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4311 



<210> 241 
<211> 1430 
<212> PRT 
<213> SHRIMP 



<400> 241 
Met Asp Phe Glu 
1 

Lys Ser Val Lys 
20 

Lys Ala Thr Leu 
35 

Lys Lys Met Ser 
50 

Lys Ala Met Leu 
65 

Thr Ala Asp Asn 

Leu Glu Ser Val 
100 

Val Thr Ser Ser 
115 

Tyr Ser Lys Gly 
130 

Ser Arg lie Gly 
145 

Asn Val Ser Ser 

Pro Arg Asp Cys 
180 

Glu Thr Ser Thr 
195 

Val Gly Leu Glu 
210 

Asn Ala Gly lie 
225 

Leu Pro Val Leu 

Asp Phe Tyr lie 
260 

Asn Asp Ala Arg 
275 

Arg Met Asn Asp 
290 

Tyr Ser Gly Asp 
305 

Glu Asn Thr Ala 



Gly Thr Thr Ser 

5 

Lys Val Ala Glu 

Ala Ser Lys Val 
40 

Thr Pro Lys Ser 
55 

Asp Asp lie lie 
70 

Ser He Glu Ser 
85 

Arg Arg Thr Lys 

Pro Glu Lys Val 
120 

Arg Val Cys Lys 
135 

Gly Lys Tyr Ala 
150 

Phe Glu Asn Asn 
165 

Phe Met Leu Glu 

Glu Asp Gly Asn 
200 

Arg Ser Leu Pro 
215 

Gin Thr Leu Asn 
230 

Cys Met Phe Gly 
245 

Thr Ser He Glu 

Met Arg Cys His 
280 

Ala Pro Ala He 

295 

Gly Lys Glu Gin 
310 

Ser Trp Cys Asn 
325 



Ser Thr Pro Ser 
10 

His Ser Phe Ala 
25 

He Lys Asp Leu 

Ser Ser Asp Gly 
60 

Asn Glu Tyr Gin 
75 

Thr He Lys Glu 
90 

He Glu Ser Glu 
105 

Phe Ser Val Glu 

Gly Leu Lys Leu 
140 

Val Ser Met Ser 
155 

Asn Asn Gin Val 
170 

Thr Thr Tyr Pro 
185 

Thr Tyr Ala Val 

Lys Tyr Val Pro 
220 

Met Thr Gly Leu 
235 

Arg Thr Glu Tyr 

250 

Thr Gin Ser Phe 
265 

Thr Glu Asp Leu 

Thr Pro His Val 
300 

Leu Leu Tyr Met 
315 

Ala Asn Gly Val 
330 



Lys 


Met 


Ser 


Gin 






15 




Asn 


Leu 


His 


Asp 




30 






Glu 


Glv 


Glu 


Arg 


45 








Gin 


Lys 


Leu 


Asp 


Ala 


Val 


Lys 


Ser 








80 


He 


Glu 


Asn 


Val 






95 




Ala 


Lys 


Asn 


Ser 




110 






Asp 


Leu 


Glu 


He 


125 








Asn 


Ala 


Asn 


Cys 


He 


Lys 


Lys 


His 








160 


Phe 


Ser 


Glu 


Glu 






175 




Leu 


Val 


Gly 


Phe 




190 






Phe 


Leu 


Thr 


Gly 


205 








Val 


Phe 


Asp 


Met 


Arg 


Met 


Ala 


Lys 








240 


Asp 


Asn 


Leu 


Glu 






255 




Asp 


Glu 


Glu 


Glu 




270 






Glu 


Arg 


Lys 


Lys 


285 








Ala 


Val 


Tyr 


Asp 


He 


Thr 


Glu 


Tyr 








320 


Val 


Thr 


Ser 


Asp 






335 
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Ser Gly Phe Ser Asn Glu 
340 

Cys Phe Ala Asp Cys lie 
355 

Glu Arg Ser Met Asn lie 
370 

Asp Ser Pro He Lys Thr 
385 390 



Ser 


Ser 


Ser 


Pro 


Thr 


Val 










405 




Val 


Val 


Glu 


Gly 


Glu 


Glu 








420 






Ser 


Lys 


Tyr 


Glu 


Gly 


Ser 






435 








Lys 


Leu 


Ser 


Thr 


Met 


Lys 




450 










Leu 


Lys 


He 


Thr 


Ser 


Gly 


465 










470 


Glu 


Asp 


Asp 


Gly 


Asp 


Asp 










485 




Pro 


Gin 


Thr 


Glu 


Ala 


Thr 








500 






Glu 


Arg 


Thr 


Glu 


Asn 


He 






515 








Lys 


He 


Cys 


Asn 


Asn 


Pro 




530 










Asn 


Lys 


Leu 


Gin 


Glu 


Cys 


545 










550 


Arg 


Pro 


Thr 


He 


Trp 


Thr 










565 




Leu 


Phe 


Asp 


Glu 


Val 


Val 








580 






Asp 


He 


Cys 


Lys 


Ser 


Glu 






595 








Thr 


Asn 


He 


Lys 


Gly 


Lys 




610 










Asn 


Lys 


His 


Ser 


Ser 


Cys 


625 










630 


Asn 


Ala 


Leu 


Phe 


Ala 


Trp 










645 




He 


Pro 


Asn 


Val 


Phe 


Ser 








660 






Met 


Lys 


Arg 


Lys 


Arg 


Thr 






675 








Gin 


Glu 


Pro 


Ser 


Thr 


Lys 




690 










Ala 


Gin 


Glu 


Ser 


Ser 


Ser 


705 










710 


Gin 


Gin 


Ala 


Gly 


His 


Asp 










725 




Phe 


He 


Lys 


Met 


Pro 


Arg 








740 






Ser 


Glu 


He 


Phe 


Gly 


Gin 






755 








Thr 


Phe 


Lys 


Arg 


Met 


Cys 




770 










Met 


Arg 


Ser 


Ser 


Phe 


Thr 


785 










790 


Arg 


Leu 


Tyr 


Glu 


Lys 


Cys 










805 




Glu 


Met 


Asp 


Val 


Leu 


Ser 



Cys Ala He Ser Asp 
345 

Asp Val Thr Val Asn 
360 

Val Val Glu Ser Asp 
375 

Glu Glu Asp Gly Glu 
395 

Pro Pro Pro Thr Pro 
410 

Glu Glu Glu Glu He 
425 

Glu Asp Ala Leu Val 
440 

Gin Met Arg Arg Val 
455 

Gly Asn Asn Ser Ser 
475 

Asp Asp Ala Val Asp 
490 

Val Lys Asn Ser Phe 

505 

Leu Tyr Glu Thr Met 
520 

Ser Ser Met Ser Ser 
535 

Leu Asn Thr Met Asp 
555 

Glu Glu Ser Gin Gin 
570 

Thr Ser He Val Ala 
585 

He Phe Gly Gly Met 
600 

Tyr Glu Gly Gin Lys 
615 

Phe Lys Thr Asn Thr 
635 

Val Lys Ser Lys Leu 
650 

Phe Lys Met Ala Ser 
665 

Ser Ser Ala Ser Ser 
680 

Met Met Lys Asn Asp 
695 

Pro Ser Ser Ser Ser 
715 

Lys Glu Thr He Asn 
730 

Ser Asn Val Asn Gly 

745 

Arg Leu Cys Gly Leu 
760 

Lys Thr Phe Glu Asp 
775 

Arg Leu Thr Arg Tyr 
795 

Arg Ser Gin Ala Val 
810 

His Gin Gly Glu Leu 



Met Asn Asp Leu Cys 
350 

Asn Glu Glu His Glu 
365 

Arg Arg Leu Phe Asp 
380 

Asn Ser Ser Ser Ser 
400 

Tyr Glu Gly Asn Ala 
415 

Asp Glu Asp Glu Ser 
430 

Met Lys Lys Leu Ala 
445 

Lys Asn Glu Pro Ala 
460 

Ser Ser He Asn Asn 
480 

Ala Thr Ala Leu Cys 
495 

Met Ala Pro Asn Asp 
510 

Gin He Ser Leu Ala 
525 

Tyr Arg Val Phe Thr 
540 

Asp Ser He Arg Arg 
560 

Phe Ala Lys Gly Leu 

575 

His Gin Met Ala Gin 
590 

Phe Asn Ala Asn Ser 
605 

Lys Ser Leu Tyr Gly 
620 

Glu Ser Asn Val Asn 
640 

His Ser Gly Thr Val 
655 

Glu Lys Pro Ser Lys 
670 

Ser Asn Asp Glu His 
685 

Glu Gly Glu Lys Val 
700 

Thr Pro Glu Gin Gin 
720 

Leu He Pro Leu Ser 
735 

Ser Ala Ser Tyr Leu 
750 

Ser Asp Ala Ser Ser 
765 

Leu Glu Asn Glu He 
780 

Glu Arg Glu Val Thr 
800 

Asp He Glu Glu Asn 
815 

Phe Ala Glu Phe Leu 
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820 










825 










830 






Glu 


Asp 


Pro 
835 


He 


Ala 


Tyr 


Phe 


Glu 
840 


Glu 


Val 


Leu 


Glu 


Asn 
84 5 


He 


Lys 


Ser 


Trp 


Ser 


Leu 


Glu 


Asn 


Val 


Asn 


Thr 


Pro 


Lys 


Arg 


Lys 


Asn 


Lys 


Tyr 


Ala 


850 










855 










860 










Lys 


Val 


Leu 


Val 


Ser 


Val 


Asn 


Ala 


He 


Arg 


Arg 


Thr 


Tyr 


Glu 


Glu 


Tyr 


865 










870 










875 










o o 


His 


Ala 


Phe 


Ser 


Lys 
885 


Phe 


Val 


Pro 


Met 


Phe 
890 


Leu 


Phe 


Asn 


Leu 


He 
895 


Lys 


Arg 


Glu 


Leu 


Glu 


Gly 


Asp 


Asn 


Tyr 


Thr 


His 


Asp 


Val 


His 


Phe 


Ser 


Ser 




1 


900 










905 










910 






Thr 


Cys 


Leu 


Trp 


Tyr 


Leu 


Thr 


Val 


Met 


Thr 


Arg 


Asn 


Arg 


He 


Cys 


Asp 




915 










920 










925 








Val 


Leu 

930 


Gin 


Tyr 


He 


Asn 


Asn 
935 


Asn 


Asn 


Asn 


Asp 


Asn 
940 


Glu 


Glu 


Thr 


Asp 


He 


Val 


Glu 


Glu 


Glu 


Glu 


Glu 


Gly 


Glu 


Gly 


Glu 


Glu 


Asp 


Lys 


Met 


Glu 


945 










950 










955 










960 


Glu 


Ser 


Met 


Asp 


Val 
965 


Glu 


Gin 


Gin 


Lys 


Gin 
970 


Val 


Arg 


Lys 


Gly 


Gly 
975 


Arg 


Lys 


Lys 


Gly Gin 


Lys 


Phe 


Asn 


Ser 


He 


Gly 


Asp 


Gin 


val 


He 


Arg 


Lys 




980 










985 










990 






Phe 


Val 


Lys 


Ser 


Leu 


Cys 


Glu 


Asn 


Ser 


Met 


Val 


Val 


Ser 


He 


Ala 


He 






995 










1000 








1005 






Asn 


Ser 


Leu 


He 


Ser 


Gly 


He 


Ser 


Trp 


Met 


Asn 


Lys 


Lys 


He 


Pro 


Pro 




1010 








1015 








1020 








Gly 


Phe 


Leu 


Lys 


Asp 


Ser 


Ser 


Thr 


He 


Asn 


Thr 


Leu 


Asp 


Glu 


Val 


Ser 


1025 






1030 








1035 








104C 


Arg 


Phe 


Val 


Phe 


Ser 


Asp 


Val 


Lys 


He 


Asn 


Arg 


Lys 


He 


Asn 


Gly 


Thr 








1045 








1050 








1055 


Asp 


Asp 


Lys 


Tyr 


Glu 


Thr 


Val 


Phe 


Gly 


Val 


Ser 


Thr 


Arg 


Val 


Asp 


Ser 








1060 








1065 








1070 




His 


He 


Val 


Gly 


Pro 


Phe 


Ser 


He 


Pro 


Val 


Asp 


Phe 


Ser 


Ser 


Ala 


Gly 






1075 








1080 








1085 






Leu 


Asp 


Lys 


Ala 


Ser 


Cys 


Gly 


Lys 


Leu 


Tyr 


Val 


Asn 


Thr 


He 


Asp 


Gly 




1090 








1095 








1100 








Lys 


Gly 


He 


Leu 


Thr 


He 


Ser 


Pro 


Lys 


Tyr 


Asp 


Ser 


Leu 


Asn 


Asp 


Glu 


1105 








1110 








1115 








112C 


Asp 


Val 


Asp 


Ser 


Thr 


Thr 


Thr 


Asp 


Lys 


Leu 


Glu 


Lys 


Asp 


He 


Leu 


His 








1125 








1130 








1135 


Leu 


Ser 


Lys 


His 


Asp 


Thr 


Phe 


Phe 


Asn 


He 


Asn 


Lys 


Asn 


Lys 


Val 


Leu 






1140 








1145 








1150 




Pro 


Phe 


Tyr 


Asn 


He 


Ser 


Pro 


Ser 


Ser 


Ser 


Leu 


Thr 


Glu 


Lys 


Lys 


Lys 






1155 








1160 








1165 






Thr 


Lys 


Phe 


Asn 


Arg 


Lys 


Lys 


He 


Ser 


Ser 


Gly 


Met 


Ser 


Asn 


Asn 


Asn 




1170 








1175 








1180 








Gly 


Met 


Cys 


Val 


Gin 


Thr 


Pro 


Ser 


Ser 


Ser 


Asn 


Ser 


Val 


Ser 


Ser 


Val 


1185 






1190 








1195 








1201 


Ser 


Ser 


He 


Val 


Ala 


Pro 


Ser 


Ser 


Ser 


Val 


Leu 


Ala 


Leu 


Ser 


Cys 


Ser 










1205 








1210 








1215 


Leu 


Ser 


Ser 


Thr 


Lys 


Lys 


Lys 


Ser 


He 


Trp 


Asn 


Glu 


Asn 


Met 


Phe 


Leu 








1220 








1225 








1230 




Thr 


Ser 


Arg 


Asn 


Met 


Trp Arg Cys Gly 


Phe 


Val 


Val 


Pro 


Pro 


Lys 


Leu 






1235 








1240 








1245 






Cys 


Ser 


Phe 


He 


Val 


Asn 


His 


Arg 


His 


Ala 


Val 


Lys 


Leu 


Val 


Ala 


Glu 




1250 








1255 








1260 









Thr Ala Pro Lys Thr Lys Leu Cys Arg Asn He He Asp Arg Asn Arg 
1265 1270 1275 1280 

Lys He Arg Phe Asn Gly Leu Lys Lys Val Cys Lys Ser Val Ser Ala 

1285 1290 1295 



Phe Thr Gly Glu Ser Thr Tyr Leu Leu Asn Lys Asn Met Thr Ala Thr 
1300 1305 1310 
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Ser Pro Ser Asp Leu Asn Leu Cys lie Tyr Thr Ser Ser Leu Asn Asp 

1315 1320 1325 

Pro Leu Tyr Thr Cys Lys Leu Thr His Glu Glu Tyr Gin Asp Gly Asn 

1330 1335 1340 

Ala Leu Asp Asp Tyr Gly Ala Val Phe Val Asn Tyr Thr Phe Lys Ser 
1345 1350 1355 1360 

lie Lys Ser Cys Ser Ser Lys Asp Glu Thr Ala Asp Asp Asn Ala Ala 

1365 1370 1375 

Ala Ala Asp Asp Asp Gly Ser Thr Thr Ser Thr Ser Ser Ser Thr Asp 

1380 1385 1390 

Thr Asp Ala Ala Ala lie Gin Asp Phe Met His Val Met lie Lys Lys 

1395 1400 1405 

lie Asp Ala Met Lys Asp lie Arg Gly Lys Tyr Lys Lys Ser Leu Ala 

1410 1415 1420 

Lys Lys Thr Lys Lys His 
1425 1430 



<210> 242 
<211> 909 
<212> DNA 
<213> SHRIMP 



<400> 242 

atggtgtcgt 

gtttt tacaa 

gacgttattg 

ggtgacgacg 

cttggacgca 

cgcaaaaagc 

aagcgtgccg 

gccgcacctc 

cctccacctg 

cctgaggatg 

gatgaagaag 

gaagaggatg 

gatga tttct 

ttctaccgca 

cgcaaaaagc 

aagcgttaa 



ctattaccca 
ctgaaggagc 
accccgacca 
acgacgacga 
aaaagcgtgc 
gtgccgcacc 
ca.cct:ccacc 
cacctgagga 
aggatgaaga 
aagaagagga 
aggatgagtt 
agttctaccg 
accgcaaaaa 
aaaagcgtgc 
gtgccgcacc 



cctctctctg 
tagtgtgaga 
ccgctgccaa 
tgacgatgga 
cgcacctcca 
tccacctgag 
tgaggatgaa 
tgaagaagag 
agaggatgag 
tgagttctac 
ctaccgcaaa 
caaaaagcgt 
gcgtgccgca 
cgcacctcca 
tccacctgag 



ttgttcgtcg 
gtgaaacggt 
gggcgactgt 
ggaactttcg 
cctgaggatg 
gatgaagaag 
gaagaggatg 
gatgagttct 
ttctaccgca 
cgcaaaaagc 
aagcgtgccg 
gccgcacctc 
cctccacctg 
cctgaggatg 
gatgaagaag 



tggctgtagt 
gtgctgttag 
gccgcaggtc 
atacagtagg 
aagaagagga 
aggatgattt 
agttctaccg 
accgcaaaaa 
aaaagcgtgc 
gtgccgcacc 
cacctccacc 
cacctgagga 
aggatgaaga 
aagaagagga 
aggatgattt 



agcttccgtc 
cccgtgcccc 
tactcgagga 
gtctggtata 
tgatttctac 
ctaccgcaaa 
caaaaagcgt 
gcgtgccgca 
cgcacctcca 
tccacctgag 
tgaggatgaa 
tgaagaagag 
agaggatgat 
tgatttctac 
ctaccgcaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

909 



<210> 243 

<211> 302 

<212> PRT 

<213> SHRIMP 



<400> 243 



Met 


Val 


Ser 


Ser 


1 








Val 


Ala 


Ser 


Val 








20 


Arg 


Cys 


Ala 


Val 






35 




Cys 


Gin 


Gly 


Arg 




50 






Asp 


Asp 


Asp 


Asp 


65 








Leu 


Gly 


Arg 


Lys 


Asp 


Asp 


Phe 


Tyr 








100 



lie Thr His Leu 
5 

Val Phe Thr Thr 

Ser Pro Cys Pro 
40 

Leu Cys Arg Arg 
55 

Asp Gly Gly Thr 
70 

Lys Arg Ala Ala 
85 

Arg Lys Lys Arg 



Ser Leu Leu Phe 
10 

Glu Gly Ala Ser 
25 

Asp Val lie Asp 

Ser Thr Arg Gly 
60 

Phe Asp Thr Val 
75 

Pro Pro Pro Glu 
90 

Ala Ala Pro Pro 
105 



Val Val Ala Val 
15 

Val Arg Val Lys 
30 

Pro Asp His Arg 
45 

Gly Asp Asp Asp 

Gly Ser Gly lie 
80 

Asp Glu Glu Glu 

95 

Pro Glu Asp Glu 
110 
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Glu 


Glu 


Asp 
115 


Asp 


Phe 


Tyr 


Arg 


Lys 

120 


Lys 


Arg 


Ala 


Ala 


Pro 
125 


Pro 


Pro 


Glu 


Asp 


Glu 
130 


Glu 


Glu 


Asp 


Glu 


Phe 
135 


Tyr 


Arg 


Lys 


Lys 


Arg 
140 


Ala 


Ala 


Pro 


Pro 


Pro 


Glu 


Asp 


Glu 


Glu 


Glu 


Asp 


Glu 


Phe 


Tyr 


Arg 


Lys 


Lys 


Arg 


Ala 


Ala 


145 










150 










155 










160 


Pro 


Pro 


Pro 


Glu 


Asp 
165 


Glu 


Glu 


Glu 


Asp 


Glu 
170 


Phe 


Tyr 


Arg 


Lys 


Lys 
175 


Arg 


Ala 


Ala 


Pro 


Pro 
180 


Pro 


Glu 


Asp 


Glu 


Glu 
185 


Glu 


Asp 


Glu 


Phe 


Tyr 
190 


Arg 


Lys 


Lys 


Arg 


Ala 
195 


Ala 


Pro 


Pro 


Pro 


Glu 
200 


Asp 


Glu 


Glu 


Glu 


Asp 
205 


Glu 


Phe 


Tyr 


Arg 


Lys 
210 


Lys 


Arg 


Ala 


Ala 


Pro 
215 


Pro 


Pro 


Glu 


Asp 


Glu 

220 


Glu 


Glu 


Asp 


Glu 


Phe 


Tyr 


Arg 


Lys 


Lys 


Arg 


Ala 


Ala 


Pro 


Pro 


Pro 


Glu 


Asp 


Glu 


Glu 


Glu 


225 










230 










235 










240 


Asp 


Asp 


Phe 


Tyr 


Arg 
245 


Lys 


Lys 


Arg 


Ala 


Ala 
250 


Pro 


Pro 


Pro 


Glu 


Asp 
255 


Glu 


Glu 


Glu 


Asp 


Asp 
260 


Phe 


Tyr 


Arg 


Lys 


Lys 
265 


Arg 


Ala 


Ala 


Pro 


Pro 
270 


Pro 


Glu 


Asp 


Glu 


Glu 
275 


Glu 


Asp 


Asp 


Phe 


Tyr 
280 


Arg 


Lys 


Lys 


Arg 


Ala 
285 


Ala 


Pro 


Pro 


Pro 


Glu 
290 


Asp 


Glu 


Glu 


Glu 


Asp 
295 


Asp 


Phe 


Tyr 


Arg 


Lys 
300 


Lys 


Arg 







<210> 244 
<211> 1119 
<212> DNA 
<213> SHRIMP 



<400> 244 

atggataact 

cagttggata 

aatagtgaac 

gagatacaac 

gatgatatca 

ataattgaag 

atagcggaca 

agacactcta 

tgtgacgact 

aaattcatta 

ataattttca 

gtaaaagaag 

gacaacctca 

tttgatcacg 

tacttgaaaa 

ttgacttttc 

tctgatagag 

caactcccca 

tctctttctg 



tgaaagggga 
gatctatatt 
aaacagctaa 
aattcaagga 
tctacatttt 
cgagggaata 
tgttaacctt 
gtgaacaagc 
tgtgcaa tea 
ctttccatgc 
aagatgtcat 
gtgtggccat 
aggataatta 
ggggacattt 
cactcatggg 
taacagtgta 
agatcaataa 
aacctaaaca 
tccaaaaaca 



atttgttgcg 
ggtatttgtt 
aaaggaaggt 
cgaaataaac 
tgatcatgga 
ctctaaaccg 
gacttttatg 
cactaatagt 
attaccaaaa 
gaacctatac 
acaacaactt 
tagatgtgcg 
caatacattt 
tgaagaagta 
attaaaatgc 
cactaatgtt 
ttatcttgtc 
ataccgtccc 
tctgagtggg 



cttaaaacag 
gatgttgttg 
ctagcaacta 
aacaaatata 
gggagtttca 
ctgagggaat 
actgtgtaca 
ataaatgtca 
gaagaggaag 
atgctggaca 
cacgtgattt 
aaacagatga 
tcaaacattt 
aaacacaaag 
atgttcaaac 
atagcagaat 
caacttgtaa 
ctcagtttga 
tttctttag 



acctcaccca 
gtagattata 
gagtggcaaa 
atgctctaac 
aaagagcaaa 
tagagtgcat 
ccaatatcat 
ccctcggacg 
aagaggaaga 
cacgcctaaa 
tgcaaaagga 
acgaaataag 
tgaatgaaat 
ccataactct 
gcatatccga 
ttataaacgc 
catgtaacga 
tagataacat 



ttacaaaaca 
tgttatagta 
gcaagccaca 
aaatactttg 
acataaggcc 
gtttacgcgt 
tactgaattt 
tcttttcttg 
tttgaaacag 
gaaagat ttg 
tacctatgct 
tcaatacagg 
tgtctacatt 
gactagaaat 
aatgttgtca 
tagcaatatt 
attgtgcaac 
agcttatttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1119 



<210> 245 
<211> 368 
<212> PRT 
<213> SHRIMP 



<400> 245 

Met Asp Asn Leu Lys Gly Glu Phe Val Lys Thr Asp Leu Thr His Tyr 

15 10 15 

Lys Thr Gin Leu Asp Arg Ser lie Leu Val Phe Val Asp Val Val Gly 
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20 










25 










30 






Arg 


Leu 


Tyr 


Val 


He 


Val 


Asn 


Ser 


Glu 


Gin 


Thr 


Ala 


Lys 


Lys 


Glu 


Gly 






35 










40 










45 








Leu 


Ala 


Thr 


Arg 


Val 


Ala 


Lys 


Gin 


Ala 


Thr 


Glu 


He 


Gin 


Gin 


Phe 


Lys 




50 










55 










60 










Asp 


Glu 


He 


Asn 


Asn 


Lys 


Tyr 


Asn 


Ala 


Leu 


Thr 


Asn 


Thr 


Leu 


Asp 


Asp 


65 










70 










75 










80 


lie 


He 


Tyr 


He 


Phe 


Asp 


His 


Gly 


Gly 


Ser 


Phe 


Lys 


Arg 


Ala 


Lys 


His 










85 










90 










95 




Lys 


Ala 


He 


He 


Glu 


Ala 


Arg 


Glu 


Tyr 


Ser 


Lys 


Pro 


Leu 


Arg 


Glu 


Leu 








100 










105 










110 






Glu 


Cys 


Met 


Phe 


Thr 


Arg 


He 


Ala 


Asp 


Met 


Leu 


Thr 


Leu 


Thr 


Phe 


Met 






115 










120 










125 








Thr 


Val 


Tyr 


Thr 


Asn 


He 


He 


Thr 


Glu 


Phe 


Arg 


His 


Ser 


Ser 


Glu 


Gin 




130 










135 










140 










Ala 


Thr 


Asn 


Ser 


He 


Asn 


Val 


Thr 


Leu 


Gly 


Arg 


Leu 


Phe 


Leu 


Cys 


Asp 


145 










150 










155 










160 


Asp 


Leu 


Cys 


Asn 


Gin 


Leu 


Pro 


Lys 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Leu 










165 










170 










175 




Lys 


Gin 


Lys 


Phe 


He 


Thr 


Phe 


His 


Ala 


Asn 


Leu 


Tyr 


Met 


Leu 


Asp 


Thr 








180 










185 










190 






Arg 


Leu 


Lys 


Lys 


Asp 


Leu 


He 


He 


Phe 


Lys 


Asp 


Val 


He 


Gin 


Gin 


Leu 






195 










200 










205 








His 


Val 


He 


Leu 


Gin 


Lys 


Asp 


Thr 


Tyr 


Ala 


Val 


Lys 


Glu 


Gly 


Val 


Ala 




210 










215 










220 










lie 


Arg 


Cys 


Ala 


Lys 


Gin 


Met 


Asn 


Glu 


He 


Ser 


Gin 


Tyr 


Arg 


Asp 


Asn 


225 










230 










235 










240 


Leu 


Lys 


Asp 


Asn 


Tyr 


Asn 


Thr 


Phe 


Ser 


Asn 


He 


Leu 


Asn 


Glu 


He 


Val 










245 










250 










255 




Tyr 


He 


Phe 


Asp 


His 


Gly 


Gly 


His 


Phe 


Glu 


Glu 


Val 


Lys 


His 


Lys 


Ala 








260 










265 










270 






He 


Thr 


Leu 


Thr 


Arg 


Asn 


Tyr 


Leu 


Lys 


Thr 


Leu 


Met 


Gly 


Leu 


Lys 


Cys 






275 










280 










285 








Met 


Phe 


Lys 


Arg 


He 


Ser 


Glu 


Met 


Leu 


Ser 


Leu 


Thr 


Phe 


Leu 


Thr 


Val 




290 










295 










300 










Tyr 


Thr 


Asn 


Val 


He 


Ala 


Glu 


Phe 


He 


Asn 


Asn 


He 


Ser 


Asp 


Arg 


Glu 


305 










310 










315 










320 


lie 


Asn 


Asn 


Tyr 


Leu 


Val 


Gin 


Leu 


Val 


Thr 


Cys 


Asn 


Glu 


Leu 


Cys 


Asn 










325 










330 










335 




Gin 


Leu 


Pro 


Lys 


Pro 


Lys 


Gin 


Tyr 


Arg 


Pro 


Leu 


Ser 


Leu 


He 


Asp 


Asn 








340 










345 










350 






He 


Ala 


Tyr 


Phe 


Ser 


Leu 


Ser 


Val 


Gin 


Lys 


His 


Leu 


Ser 


Gly 


Phe 


Leu 






355 










360 










365 









<210> 246 
<211> 1545 
<212> DNA 
<213> SHRIMP 

<400> 246 

atggatagtt gttgcctgat atcgaggata acaccagaac tggctggcaa gttaacctgg 60 

atcttcatac cagaaaacaa cttcaagatt gtccagaact cactcccaga cgaccaagtt 120 

atctcccaat tcagatattt cgaccataga cattgctata cgtttatgga gattttgatg 180 

gcaaacatta aaatccaaga caggaaacaa aacaccacag ccatatgtga attgacaact 240 

ggaagagaag gacttttatg tagaagaacc atacctgtat ttttgggttc agaggaaaaa 300 

cgagaagagt tattggggaa tctccctgaa ggtgcagaaa ttttcaggcc tagagaagtt 360 

atgcaagtaa ttggtactct cttggacaag aaactagaaa ttgacgacgg tatagcttct 420 

gtaaaggctg ccctctgtgc tggttcatca tcgttatacc taatcatgag ccacatagtg 480 

aaaatgacct tttctgctat cacaaacatg aaggatataa acgaagaata tttcgtagac 540 

tttatatttc gtcataaaca attcctcaac cctgaattct tcaagcacct tatatctttg 600 
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ctcaagaatt 
atgctatgga 
tccagcgata 
ctccgtattt 
ccgcctccgg 
attaatgcag 
cctcctatac 
atcctgtcaa 
tgcaataaat 
tgcccacctt 
ccttcgtttg 
acgttagaaa 
ggctacacta 
agatataagg 
tacataaaaa 
gtatttgaga 



ccaggaagga 
ccctttccaa 
gtgattacgg 
ttagggaacg 
aggtgtataa 
gcgatgaaat 
ctaagatggt 
agattttgaa 
gcaataggat 
gcagaaagaa 
tgcaaaaagg 
aattctgttc 
ttgagtctaa 
aactgtgggc 
ttgaagacat 
ccgtctgcaa 



acatgttgcc 
gatgaggttc 
catctgtgaa 
aaaatgctgc 
ttgggatgga 
tatcggcatg 
tgtacgaaga 
gttcaggcag 
tttcaggctc 
aagtgttgca 
aacaaaacgt 
ctgggaaaga 
gtggcagaac 
ctttgtgaac 
cgaccagtta 
aataaagagc 



catctagtaa 
acagaaatgg 
aaatgtgcac 
gatagatgtt 
aaaataaccc 
ctaaactcaa 
gtggtggacg 
gcaaatatcc 
actatcttag 
gtaaatacac 
ctacgagtgg 
ttcaatactg 
tgggaatctt 
aaacaggaaa 
ttgaggagta 
agagatggtt 



gacgtctaga 
aagaaaacta 
gaaaaactcc 
gccgtcttta 
aacaatccaa 
atgataaggg 
gtgtctacgg 
ccacgtgtct 
ggcctacaag 
aacagaaagg 
ataccggtag 
aagttttgct 
ttctgggtta 
tatcttccat 
tcttgcaaga 
tgtga 



acactttctc 
cttcccaatc 
caaatacaag 
tcaccaacaa 
taaaggctac 
aaaaacattc 
gcaaggaact 
attcgtgaca 
aaacatcctt 
agaaaataaa 
caacaagaac 
cccttggctt 
ttcgagtacc 
gaaagactcc 
ccagaagggt 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1545 



<210> 247 
<211> 514 
<212> PRT 
<213> SHRIMP 



<400> 247 



Met 


Asp 


Ser 


Cys 


Cys 


Leu 


He 


Ser 


Arg 


He 


Thr 


Pro 


Glu 


Leu 


Ala 


Gly 


1 








5 










10 










15 




Lys 


Leu 


Thr 


Trp 


He 


Phe 


He 


Pro 


Glu 


Asn 


Asn 


Phe 


Lys 


He 


Val 


Gin 








20 










25 










30 






Asn 


Ser 


Leu 


Pro 


Asp 


Asp 


Gin 


Val 


He 


Ser 


Gin 


Phe 


Arg 


Tyr 


Phe 


Asp 






35 










40 










45 








His 


Arg 


His 


Cys 


Tyr 


Thr 


Phe 


Met 


Glu 


He 


Leu 


Met 


Ala 


Asn 


He 


Lys 




50 










55 










60 










He 


Gin 


Asp 


Arg 


Lys 


Gin 


Asn 


Thr 


Thr 


Ala 


He 


Cys 


Glu 


Leu 


Thr 


Thr 


65 










70 










75 










80 


Gly 


Arg 


Glu 


Gly 


Leu 


Leu 


Cys 


Arg 


Arg 


Thr 


He 


Pro 


Val 


Phe 


Leu 


Gly 










85 










90 










95 




Ser 


Glu 


Glu 


Lys 


Arg 


Glu 


Glu 


Leu 


Leu 


Gly 


Asn 


Leu 


Pro 


Glu 


Gly 


Ala 








100 










105 










110 






Glu 


He 


Phe 


Arg 


Pro 


Arg 


Glu 


Val 


Met 


Gin 


Val 


He 


Gly 


Thr 


Leu 


Leu 






115 










120 










125 








Asp 


Lys 


Lys 


Leu 


Glu 


He 


Asp 


Asp 


Gly 


He 


Ala 


Ser 


Val 


Lys 


Ala 


Ala 




130 










135 










140 










Leu 


Cys 


Ala 


Gly 


Ser 


Ser 


Ser 


Leu 


Tyr 


Leu 


He 


Met 


Ser 


His 


He 


Val 


145 










150 










155 










160 


Lys 


Met 


Thr 


Phe 


Ser 


Ala 


He 


Thr 


Asn 


Met 


Lys 


Asp 


He 


Asn 


Glu 


Glu 










165 










170 










175 




Tyr 


Phe 


Val 


Asp 


Phe 


He 


Phe 


Arg 


His 


Lys 


Gin 


Phe 


Leu 


Asn 


Pro 


Glu 








180 










185 










190 






Phe 


Phe 


Lys 


His 


Leu 


He 


Ser 


Leu 


Leu 


Lys 


Asn 


Ser 


Arg 


Lys 


Glu 


His 






195 










200 










205 








Val 


Ala 


His 


Leu 


Val 


Arg 


Arg 


Leu 


Glu 


His 


Phe 


Leu 


Met 


Leu 


Trp 


Thr 




210 










215 










220 










Leu 


Ser 


Lys 


Met 


Arg 


Phe 


Thr 


Glu 


Met 


Glu 


Glu 


Asn 


Tyr 


Phe 


Pro 


He 


225 










230 










235 










240 


Ser 


Ser 


Asp 


Ser 


Asp 


Tyr 


Gly 


He 


Cys 


Glu 


Lys 


Cys 


Ala 


Arg 


Lys 


Thr 










245 










250 










255 




Pro 


Lys 


Tyr 


Lys 


Leu 


Arg 


He 


Phe 


Arg 


Glu 


Arg 


Lys 


Cys 


Cys 


Asp 


Arg 








260 










265 










270 






Cys 


Cys 


Arg 


Leu 


Tyr 


His 


Gin 


Gin 


Pro 


Pro 


Pro 


Glu 


Val 


Tyr 


Asn 


Trp 






275 










280 










285 








Asp 


Gly 


Lys 


He 


Thr 


Gin 


Gin 


Ser 


Asn 


Lys 


Gly 


Tyr 


He 


Asn 


Ala 


Gly 




290 










295 










300 
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Asp 


Glu 


He 


He 


Gly 


Met 


Leu 


Asn 


Ser 


Asn 


Asp 


Lys 


Gly 


Lys 


Thr 


Phe 


305 










310 










315 










320 


Pro 


Pro 


He 


Pro 


Lys 


Met 


Val 


Val 


Arg 


Arg 


Val 


Val 


Asp 


Gly 


Val 


Tyr 










325 










330 










335 




Gly 


Gin 


Gly 


Thr 


He 


Leu 


Ser 


Lys 


He 


Leu 


Lys 


Phe 


Arg 


Gin 


Ala 


Asn 








340 










345 










350 






He 


Pro 


Thr 


Cys 


Leu 


Phe 


Val 


Thr 


Cys 


Asn 


Lys 


Cys 


Asn 


Arg 


He 


Phe 






355 










360 










365 








Arg 


Leu 


Thr 


He 


Leu 


Gly 


Pro 


Thr 


Arg 


Asn 


He 


Leu 


Cys 


Pro 


Pro 


Cys 




370 










375 










380 










Arg 


Lys 


Lys 


Ser 


Val 


Ala 


Val 


Asn 


Thr 


Gin 


Gin 


Lys 


Gly 


Glu 


Asn 


Lys 


386 










390 










395 










400 


Pro 


Ser 


Phe 


Val 


Gin 


Lys 


Gly 


Thr 


Lys 


Arg 


Leu 


Arg 


Val 


Asp 


Thr 












405 










410 










415 




Ser 


Asn 


Lys 


Asn 


Thr 


Leu 


Glu 


Lys 


Phe 


Cys 


Ser 


Trp 


Glu 


Arg 


Phe 


Asn 








420 










425 










430 






Thr 


Glu 


Val 


Leu 


Leu 


Pro 


Trp 


Leu 


Gly 


Tyr 


Thr 


He 


Glu 


Ser 


Lys 


Trp 






435 










440 










445 








Gin 


Asn 


Trp 


Glu 


Ser 


Phe 


Leu 


Gly 


Tyr 


Ser 


Ser 


Thr 


Arg 


Tyr 


Lys 


Glu 




450 










455 










460 










Leu 


Trp 


Ala 


Phe 


Val 


Asn 


Lys 


Gin 


Glu 


He 


Ser 


Ser 


Met 


Lys 


Asp 


Ser 


465 










470 










475 










480 


Tyr 


He 


Lys 


He 


Glu 


Asp 


He 


Asp 


Gin 


Leu 


Leu 


Arg 


Ser 


He 


Leu 


Gin 










485 










490 










495 




Asp 


Gin 


Lys 


Gly 


Val 


Phe 


Glu 


Thr 


Val 


Cys 


Lys 


He 


Lys 


Ser 


Arg 


Asp 








500 










505 










510 







Gly Leu 



<210> 248 
<211> 1242 
<212> DNA 
<213> SHRIMP 



<400> 248 

atggagtcaa 

gctgaagaaa 

tttgacaaat 

agagaaaacc 

tacaaaaaag 

aaagattggg 

tttgcatcct 

cagattccag 

aacgtctacg 

tttaacgctg 

atgcaaagca 

ttctttagtg 

ctcacctcct 

atgctgttga 

actgaagccg 

ggaatgaact 

gaaatgggac 

tccctcgaga 

gtcatggctt 

ctcccaattc 

gaagacggcg 

<210> 249 
<211> 409 
<212> PRT 



tcaaactgtt 
tcaagtcaga 
tcaccaaaaa 
ccaaccgttt 
aggttgcctg 
agaaactcac 
ctgacggaat 
aagcgaggag 
gagaactgat 
cacaacactt 
ataacgattt 
gtgcattcgc 
ccaatgagtt 
aaaagggttt 
tccgaattga 
gcaagttgat 
tagaaaagca 
ataagaccaa 
ctatcaataa 
tgactgcacc 
tcggggacta 



caccgttgct 
atataaaacc 
actcattatg 
tgtatcccgc 
tttttggaca 
tcaagatgag 
tgtaattgaa 
tttctttgac 
tgatagactg 
ccccgccatc 
ggcggaacta 
atccattttc 
catttctagg 
tgttgatacc 
acaagaattt 
gagccagtac 
ctataatgtt 
cttttttgaa 
gatcaagaag 
tcctccagtc 
catcagttat 



ggtctgaata 
gaggaggaaa 
caagtaga ta 
cccattgtcc 
ttggaagaga 
aaggatttca 
aatcttacaa 
ttccaagttg 
gtgcccgacg 
aagaagaagg 
attgttgcct 
tggatcaaga 
gacgaaggtc 
ccatcaagag 
ctcacagttt 
attgaatttg 
accaacccct 
aagagag teg 
gaccaacaaa 
tcttcctcat 
gacgattttt 



tggagcaagc 
agaggattgc 
cgtctaaaca 
atgaaga tct 
ttgatttcga 
ttctccagat 
cacgtcttcg 
gaatggagag 
aaaaagacaa 
agcagtgggc 
ttgctgcagt 
acaggggtat 
ttcatcgcga 
aaaggattct 
ccctgcctgt 
tggcaga taa 
tcccattcat 
ccgagtatca 
cccaagaaac 
catccgaaca 
ag 



caaccaagtg 
ccaggaagtg 
cttacttaca 
ctgggaaatg 
aagggatcct 
tctggcgttc 
tcaagtggcg 
tattcatggc 
ggctatcttg 
tattaattgg 
tgaaggaatc 
tttgcctggt 
ctttgcatgc 
tgaaattgtc 
taaattagtg 
actattggtt 
ggacaatatt 
acgtgcccag 
tggttctcct 
agaagatgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1242 
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<213> SHRIMP 
<400> 249 

Met Glu Ser He Lys Leu Phe Thr Val Ala Gin Met Glu Gin Ala Asn 

15 10 15 

Gin Val Ala Glu Glu He Lys Ser Glu Tyr Lys Thr Glu Glu Glu Lys 

20 25 30 

Arg He Ala Gin Glu Val Phe Asp Lys Phe Thr Lys Lys Leu He Met 

35 40 45 

Gin Val Asp Thr Ser Lys His Leu Leu Thr Arg Glu Asn Pro Asn Arg 

50 55 60 

Phe Val Ser Arg Pro He Val His Glu Asp Leu Trp Glu Met Tyr Lys 
65 70 75 80 

Lys Glu Val Ala Cys Phe Trp Thr Leu Glu Glu He Asp Phe Glu Arg 

85 90 95 

Asp Pro Lys Asp Trp Glu Lys Leu Thr Gin Asp Glu Lys Asp Phe He 

100 105 110 

Leu Gin He Leu Ala Phe Phe Ala Ser Ser Asp Gly He Val He Glu 

115 120 125 

Asn Leu Thr Thr Arg Leu Arg Gin Val Ala Gin He Pro Glu Ala Arg 

130 135 140 

Ser Phe Phe Asp Phe Gin Val Gly Met Glu Ser He His Gly Asn Val 
145 150 155 160 

Tyr Gly Glu Leu He Asp Arg Leu Val Pro Asp Glu Lys Asp Lys Ala 

165 170 175 

He Leu Phe Asn Ala Ala Gin His Phe Pro Ala He Lys Lys Lys Glu 

180 185 190 

Gin Trp Ala He Asn Trp Met Gin Ser Asn Asn Asp Leu Ala Glu Leu 

195 200 205 

He Val Ala Phe Ala Ala Val Glu Gly He Phe Phe Ser Gly Ala Phe 

210 215 220 

Ala Ser He Phe Trp He Lys Asn Arg Gly He Leu Pro Gly Leu Thr 
225 230 235 240 

Ser Ser Asn Glu Phe He Ser Arg Asp Glu Gly Leu His Arg Asp Phe 

245 250 255 

Ala Cys Met Leu Leu Lys Lys Gly Phe Val Asp Thr Pro Ser Arg Glu 

260 265 270 

Arg He He Val Thr Glu Ala Val Arg He Glu Gin Glu Phe Leu Thr 

275 280 285 

Val Ser Leu Pro Val Lys Leu Val Gly Met Asn Cys Lys Leu Met Ser 

290 295 300 

Gin Tyr He Glu Phe Val Ala Asp Lys Leu Leu Val Glu Met Gly Leu 
305 310 315 320 

Glu Lys His Tyr Asn Val Thr Asn Pro Phe Pro Phe Met Asp Asn He 

325 330 335 

Ser Leu Glu Asn Lys Thr Asn Phe Phe Glu Lys Arg Val Ala Glu Tyr 

340 345 350 

Gin Arg Ala Gin Val Met Ala Ser He Asn Lys He Lys Lys Asp Gin 

355 360 365 

Gin Thr Gin Glu Thr Gly Ser Pro Leu Pro He Leu Thr Ala Pro Pro 

370 375 380 

Pro Val Ser Ser Ser Ser Ser Glu Gin Glu Asp Val Glu Asp Gly Val 
385 390 395 400 

Gly Asp Tyr He Ser Tyr Asp Asp Phe 
405 



<210> 250 

<211> 915 

<212> DNA 

<213> SHRIMP 
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<400> 250 

atgaatttac 

tctaccatct 

gaagaggtgg 

ggtgtggatc 

aaaggtcctt 

gaggattgta 

gtcttttccc 

aaccctggta 

aaatactctt 

tatcaagaga 

gctcgaggcc 

tttgctctag 

attgaaaatc 

taccaacaca 

gccctctaca 

atgtcttgga 



ttccaatatt 
gccttttacc 
tataccctac 
ttttgaccct 
gtccaaggat 
aagggtttag 
tttgttgggt 
tagtaactga 
ctaaaat ttt 
ggattaagat 
acttggctcc 
agaatgcagt 
gtgcaagaac 
agaagaagga 
agattgtgta 
aataa 



cctgacaacc 
tgatgggaag 
agatgtgtgt 
ctgtatagga 
caataacgct 
aaaggggttc 
agacagagat 
tgatgaagat 
tggtaaagga 
attaaagtct 
ggctggagat 
acctcaaata 
tacgccaggt 
atatctagac 
cgacaagaat 



ttttttgttg 
aaacaaccct 
gggccaaagg 
ggtaaaaaca 
gtcgttgaac 
cgaattcctg 
atgcacgcca 
ttggtagatt 
ttcaatccga 
cattttaaca 
tttttcctcg 
cagaaccata 
gccgcgtggg 
aagaagaaga 
aacaaggaat 



cggtagatgc 
tggtttttga 
gagctggcga 
atggaggtga 
gagattactc 
gcactgacca 
agtgggtgcg 
ctggtattag 
gacctcttta 
agaggacggg 
cttcagagag 
acaatggtga 
ctgagactgg 
agtacatccc 
tgttccgtgt 



atgctcttgc 
ttctgtatta 
attattcact 
atggtcagga 
ccttgacgag 
ttttcatact 
caacaaaata 
gactaaattt 
ctccctcgac 
taatttcttt 
atgggcaact 
atggaaagat 
accaatattt 
tatccctcat 
acagagtgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

915 



<210> 251 
<211> 298 
<212> PRT 
<213> SHRIMP 



<400> 251 



Met 


Asn 


Leu 


Leu 


Pro 


lie 


Phe 


Leu 


Thr 


Thr 


Phe 


Phe 


Val 


Ala 


Val 


Asp 


1 








5 










10 










15 




Ala 


Cys 


Ser 


Cys 


Ser 


Thr 


He 


Cys 


Leu 


Leu 


Pro 


Asp 


Gly 


Lys 


Lys 


Gin 








20 










25 










30 






Pro 


Leu 


Val 


Phe 


Asp 


Ser 


Val 


Leu 


Glu 


Glu 


Val 


Val 


Tyr 


Pro 


Thr 


Asp 






35 










40 










45 








Val 


Cys 


Gly 


Pro 


Lys 


Gly 


Ala 


Gly 


Glu 


Leu 


Phe 


Thr 


Gly 


Val 


Asp 


Leu 




50 










55 










60 










Leu 


Thr 


Leu 


Cys 


lie 


Gly 


Gly 


Lys 


Asn 


Asn 


Gly 


Gly 


Glu 


Trp 


Ser 


Gly 


65 










70 










75 










80 


Lys 


Gly 


Pro 


Cys 


Pro 


Arg 


He 


Asn 


Asn 


Ala 


Val 


Val 


Glu 


Arg 


Asp 


Tyr 










85 










90 










95 




Ser 


Leu 


Asp 


Glu 


Glu 


Asp 


Cys 


Lys 


Gly 


Phe 


Arg 


Lys 


Gly 


Phe 


Arg 


He 








100 










105 










110 






Pro 


Gly 


Thr 


Asp 


His 


Phe 


His 


Thr 


Val 


Phe 


Ser 


Leu 


Cys 


Trp 


Val 


Asp 






115 










120 










125 








Arg 


Asp 


Met 


His 


Ala 


Lys 


Trp 


Val 


Arg 


Asn 


Lys 


He 


Asn 


Pro 


Gly 


He 




130 










135 










140 










Val 


Thr 


Asp 


Asp 


Glu 


Asp 


Leu 


Val 


Asp 


Ser 


Gly 


He 


Arg 


Thr 


Lys 


Phe 


145 










150 










155 










160 


Lys 


Tyr 


Ser 


Ser 


Lys 


lie 


Phe 


Gly 


Lys 


Gly 


Phe 


Asn 


Pro 


Arg 


Pro 


Lys 










165 










170 










175 




Leu 


Asp 


Tyr 


Gin 


Glu 


Arg 


He 


Lys 


He 


Leu 


Lys 


Ser 


His 


Phe 


Asn 


Lys 








180 










185 










190 






Arg 


Thr 


Gly 


Asn 


Phe 


Phe 


Arg 


His 


Leu 


Ala 


Pro 


Ala 


Gly 


Asp 


Phe 


Phe 






195 










200 










205 








Leu 


Ala 


Ser 


Trp 


Ala 


Thr 


Phe 


Ala 


Leu 


Glu 


Asn 


Ala 


Val 


Pro 


Gin 


He 




210 










215 










220 










Gin 


Asn 


His 


Asn 


Asn 


Gly 


Glu 


Trp 


Lys 


Asp 


He 


Glu 


Asn 


Arg 


Ala 


Arg 


225 










230 










235 










240 


Thr 


Thr 


Pro 


Gly 


Ala 


Ala 


Trp 


Ala 


Glu 


Thr 


Gly 


Pro 


He 


Phe 


Tyr 


Gin 










245 










250 










255 




His 


Lys 


Lys 


Lys 


Glu 


Tyr 


Leu 


Asp 


Lys 


Lys 


Lys 


Lys 


Tyr 


He 


Pro 


He 








260 










265 










270 






Pro 


His 


Ala 


Leu 


Tyr 


Lys 


He 


Val 


Tyr 


Asp 


Lys 


Asn 


Asn 


Lys 


Glu 


Leu 






275 










280 










285 
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Phe Arg Val Gin Ser Asp Met Ser Trp Lys 
290 295 



<210> 252 
<211> 789 
<212> DNA 
<213> SHRIMP 



<400> 252 

atgtcgttgg 

acctctgtat 

tacataattt 

ttcgatgaag 

tgcgcacaga 

atgggggaag 

gttaatttta 

gatactacta 

ccttttttca 

catgataacc 

cataaagctc 

actctcgatt 

gactatattg 

ctccgttga 



ctgtgacaga 
atcacacccc 
ctcgtcgatt 
aaacaggagc 
aaggaggaat 
taacgcgcct 
aacatgggca 
ctgttactcc 
cacgcactct 
caaaaagatc 
cttctgttcc 
gttcaatgga 
ctaacgaccc 



agattacggg 
tctattaggt 
ca tgaactac 
agttatagct 
atatgatgca 
atttactatt 
gacaaataat 
tcctgttact 
catatccgag 
ttcatttaag 
taccttcccc 
gattttttca 
cgtaccattt 



cacaatgaaa 
gcagaccatg 
acaaatttat 
aatatctgtc 
ccagaagatg 
ataggaggta 
cctgcctatg 
cctcctccat 
tcgtcttcag 
gtgtatgata 
cctaaaacct 
cgagacaggg 
ttagtggatg 



agttgatcaa 
taatgaaatc 
taaaacaagt 
tgttaaaaat 
ttgcattctt 
ggcccaatat 
gttatctcac 
ctccagctgc 
ttgaccatta 
ttcacgcaga 
cgtttgaaat 
atgttttaga 
ttgtgcaccg 



acggttacaa 60 
catatcagac 120 
tgaatatgtt 180 
cctagaaaga 240 
caattctaag 300 
gacggtgcgg 360 
agatgataat 4 20 
aagaagatcc 480 
tgtattgatg 540 
aacctttccc 600 
ttctgacgtg 660 
caatgttcac 720 
tggatctagt 780 
789 



<210> 253 
<211> 260 
<212> PRT 
<213> SHRIMP 



<400> 253 



Met 


Ser 


Leu 


Ala 


Val 


Thr 


Glu 


Asp 


Tyr 


Gly 


His 


Asn 


Glu 


Lys 


Leu 


He 


1 








5 










10 










15 




Lys 


Arg 


Leu 


Gin 


Thr 


Ser 


Val 


Tyr 


His 


Thr 


Pro 


Leu 


Leu 


Gly 


Ala 


Asp 








20 










25 










30 






His 


Val 


Met 


Lys 


Ser 


He 


Ser 


Asp 


Tyr 


He 


He 


Ser 


Arg 


Arg 


Phe 


Met 






35 










40 










45 








Asn 


Tyr 


Thr 


Asn 


Leu 


Leu 


Lys 


Gin 


Val 


Glu 


Tyr 


Val 


Phe 


Asp 


Glu 


Glu 




50 










55 










60 










Thr 


Gly 


Ala 


Val 


He 


Ala 


Asn 


He 


Cys 


Leu 


Leu 


Lys 


He 


Arg 


Cys 


Ala 


65 










70 










75 










80 


Gin 


Lys 


Gly 


Gly 


He 


Tyr 


Asp 


Ala 


Pro 


Glu 


Asp 


Val 


Ala 


Phe 


Phe 


Asn 










85 










90 










95 




Ser 


Lys 


Met 


Gly 


Glu 


Val 


Thr 


Arg 


Leu 


Phe 


Thr 


He 


He 


Gly 


Gly 


Arg 








100 










105 










110 






Pro 


Asn 


Met 


Thr 


Val 


Arg 


Val 


Asn 


Phe 


Lys 


His 


Gly 


Gin 


Thr 


Asn 


Asn 






115 










120 










125 








Pro 


Ala 


Tyr 


Gly 


Tyr 


Leu 


Thr 


Asp 


Asp 


Asn 


Asp 


Thr 


Thr 


Thr 


Val 


Thr 




130 










135 










140 










Pro 


Pro 


Val 


Thr 


Pro 


Pro 


Pro 


Ser 


Pro 


Ala 


Ala 


Arg 


Arg 


Ser 


Pro 


Phe 


145 










150 










155 










160 


Phe 


Thr 


Arg 


Thr 


Leu 


He 


Ser 


Glu 


Ser 


Ser 


Ser 


Val 


Asp 


His 


Tyr 


Val 










165 










170 










175 




Leu 


Met 


His 


Asp 


Asn 


Pro 


Lys 


Arg 


Ser 


Ser 


Phe 


Lys 


Val 


Tyr 


Asp 


He 








180 










185 










190 






His 


Ala 


Glu 


Thr 


Phe 


Pro 


His 


Lys 


Ala 


Pro 


Ser 


Val 


Pro 


Thr 


Phe 


Pro 






195 










200 










205 








Pro 


Lys 


Thr 


Ser 


Phe 


Glu 


He 


Ser 


Asp 


Val 


Thr 


Leu 


Asp 


Cys 


Ser 


Met 




210 










215 










220 










Glu 


He 


Phe 


Ser 


Arg 


Asp 


Arg 


Asp 


Val 


Leu 


Asp 


Asn 


Val 


His 


Asp 


Tyr 


225 










230 










235 










240 
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lie Ala Asn Asp Pro Val Pro Phe Leu Val Asp Val Val His Arg Gly 
245 250 255 

Ser Ser Leu Arg 
260 



<210> 254 
<211> 2571 
<212> DNA 
<213> SHRIMP 



<400> 254 

atggcatcca 

ccacaaccca 

gctgtggata 

ctttctagta 

aaacatcttc 

cctattgaaa 

atatcattat 

tccaacgcgt 

cgtaaaaagt 

catggtcctc 

aacaggctgg 

ataattaata 

atatcgaggg 

gacagtaatc 

aatacatttt 

ttagtgcata 

tcattatcgg 

aaggtgattt 

tattctaagg 

acgtttttgc 

attatatgta 

ctccctctct 

tttagagatt 

cagtctgctt 

gatgatgatg 

cccttaataa 

agagacaagc 

aatacactct 

gagaatgttt 

ggaacattat 

acaacacctt 

ggtctttaca 

gagtttgtcc 

tcctacgata 

tacgaaatag 

actatgaaca 

caaaagaata 

tctgaatgtg 

tgcggtcatt 

aatggcagcc 

ctcggaaaaa 

aaaaatgtcg 

agatatgaat 



cttcttcatc 
aaaagaaaaa 
aaaaggtggt 
ccgaactgca 
tagacgaagg 
caatatacga 
caatgttgat 
gcgttcaaaa 
atctgatatt 
tttctcacat 
tgcgttactt 
gtgtcttgtg 
gtgttttcat 
ggccacaaaa 
catccagaat 
actttgatta 
acagcgtcaa 
ccgactggaa 
ggttgatgac 
gcgccttgaa 
tactgaaaaa 
tcaaacactt 
acagtctcat 
cacttttcat 
aaggaggaaa 
gccaacttat 
actttactcc 
tgtgccttat 
tgcatatagc 
ccagcgaaca 
taatgatcgc 
agcccaaaat 
tcttaaaggg 
ttaacatcat 
cagcagcagg 
cgtggaactt 
ggcagaaatt 
caatatgctt 
gcttccacaa 
gcgcaagagg 
gactaggggt 
tgccttcggt 
ttcttgtagg 



aactaagaaa 
atcaaagaaa 
aaatctcata 
aatgataact 
agaatttaac 
aatactctct 
ccacatactt 
tgttttgggg 
ggaagactta 
gctccattct 
tagcacgtat 
gggaattgat 
tgtgtcgtac 
tactccttca 
ctcagtgtac 
cggttccaaa 
cacttttgtg 
aatgccttat 
attatcagtg 
ggaaagggaa 
gtctttgttc 
tttaaaaatt 
gtgcaaaact 
ggaagaattc 
gggacatacc 
ttcacaccca 
cctcatgcac 
aataaatgga 
gattgaaaat 
aattgaaaaa 
cttggcgagg 
aaaggtccgt 
cctaaaggaa 
aaaggatgca 
actgcgaggc 
tttcaccaaa 
tgtaaagatt 
ggatagtctg 
cgtctgttgg 
aggaggaata 
tgccgactat 
agaagaagga 
tggagtgtgg 



agagtacatg 
gtactaccat 
cacaagatat 
gaatgtaatg 
cctactataa 
tcttctgctg 
ttcttcgctg 
aacgactata 
ctgaacggcg 
tcaatcccta 
gatggaga tg 
aaaagtgttc 
gtaccgatgc 
caaaatatgt 
agaaacaata 
gatgcatcat 
aggttgtaca 
gggttcttta 
tctgaatata 
attttaatct 
ggatttaatt 
gaagaggtta 
caaaaagatt 
ccttggct tg 
ctattaacat 
attttaaaat 
ctcgccaaca 
gctaaaccag 
gttaactatg 
atggtcaa tg 
gagaatattg 
ttcggttctt 
tcagttgca t 
gtaatggaca 
aataactgcg 
aattcaacca 
gtggatggt a 
gacggggatc 
ttatccttga 
aaatgcccgt 
gatattgaaa 
agaagggaa t 
acaaatgaaa 



aagaagatga 
ttcctgttga 
tagatcaaga 
gtgcgcgaga 
ttgaagtagt 
acgacaagaa 
ataagggtac 
aagtggaatt 
tttcaaatca 
ttgtacaaga 
ctcaattcga 
tcaacgaatt 
gtgtacgaac 
cagcactagg 
cctttaaaaa 
catcatctcc 
ccaactatga 
agaaaacttt 
cactcaaaaa 
ataaaatgga 
tcaggtgttt 
aacatatagc 
tgcagagttt 
caaaaacttg 
ttgctatagt 
cgttagtgaa 
cgtctataat 
agt tcataaa 
gagtcatcac 
taagaagaat 
tactcgctca 
caaagaggct 
atttggaaac 
acagtctttt 
accc tgaagc 
aatgggcaag 
tgaataggac 
ttccttcagg 
taaggatgag 
Gctgcagaca 
cagaggaaga 
ggaggaagat 
taaaactata 



aaatctcatc 
caagtataga 
aaaggaccac 
agatctgctt 
atcatccatg 
gtttgtacag 
tatgtgggta 
tgaaaatata 
ttggtctgaa 
catgttattg 
tatatcattc 
gacacaattg 
accttcaaag 
tatgaaactc 
actaaccgag 
tcctcctcct 
catattctta 
tgacgtcctt 
agagttggtt 
gaagagagac 
aaaacaatta 
acgttttgtc 
ccctgccata 
gatcgacgac 
gcacagatat 
tactacatgt 
gtaccaatgc 
caagttcaac 
tgaattgaga 
gatggataat 
gctttttgac 
aaggatacca 
gaggaatata 
tgaagaggag 
agacgagaag 
ctctattttc 
atatgaagac 
gagaacaacg 
cgggccgaat 
agtcacctgc 
acgtgacacg 
tggtgttgac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2571 



<210> 255 
<211> 846 
<212> PRT 
<213> SHRIMP 
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<400> 255 



Met 


Ala 


Ser 


Thr 


Ser 


Ser 


Ser 


Thr 


Lys 


Lys 


Arg 


Val 


His 


Glu 


Glu 


Asp 


1 








5 










10 










15 




Glu 


Asn 


Leu 


He 


Pro 


Gin 


Pro 


Lys 


Lys 


Lys 


Lys 


Ser 


Lys 


Lys 


Val 


Leu 








20 










25 










30 






Pro 


Phe 


Pro 


Val 


Asp 


Lys 


Tyr 


Arg 


Ala 


Val 


Asp 


Lys 


Lys 


Val 


Val 


Asn 






35 










40 










45 








Leu 


He 


His 


Lys 


He 


Leu 


Asp 


Gin 


Glu 


Lys 


Asp 


His 


Leu 


Ser 


Ser 


Thr 




50 










55 










60 










Glu 


Leu 


Gin 


Met 


He 


Thr 


Glu 


Cys 


Asn 


Gly 


Ala 


Arg 


Glu 


Asp 


Leu 


Leu 


65 










70 










75 










80 


Lys 


His 


Leu 


Leu 


Asp 


Glu 


Gly 


Glu 


Phe 


Asn 


Pro 


Thr 


He 


He 


Glu 


Val 










85 










90 










95 




Val 


Ser 


Ser 


Met 


Pro 


He 


Glu 


Thr 


He 


Tyr 


Glu 


He 


Leu 


Ser 


Ser 


Ser 








100 










105 










HO 






Ala 


Asp 


Asp 


Lys 


Lys 


Phe 


Val 


Gin 


He 


Ser 


Leu 


Ser 


Met 


Leu 


He 


His 






115 










120 










125 








He 


Leu 


Phe 


Phe 


Ala 


Asp 


Lys 


Gly 


Thr 


Met 


Trp 


Val 


Ser 


Asn 


Ala 


Cys 




130 










135 










140 










Val 


Gin 


Asn 


Val 


Leu 


Gly Asn 


Asp 


Tyr 


Lys 


Val 


Glu 


Phe 


Glu 


Asn 


He 


145 










150 










155 










160 


Arg 


Lys 


Lys 


Tyr 


Leu 


He 


Asp 


Leu 


Leu 


Asn 


Gly 


Val 


Ser 


Asn 


His 


Trp 










165 










170 










175 




Ser 


Glu 


His 


Gly 


Pro 


Leu 


Ser 


His 


Met 


Leu 


His 


Ser 


Ser 


He 


Pro 


He 








180 










185 










190 






Val 


Gin 


Asp 


Met 


Leu 


Leu 


Asn 


Arg 


Leu 


Val 


Arg 


Tyr 


Phe 


Ser 


Thr 


Tyr 






195 










200 










205 








Asp 


Gly 


Asp 


Ala 


Gin 


Phe 


Asp 


He 


Ser 


Phe 


He 


He 


Asn 


Ser 


Val 


Leu 




210 










215 










220 










Trp 


Gly 


He 


Asp 


Lys 


Ser 


Val 


Leu 


Asn 


Glu 


Leu 


Thr 


Gin 


Leu 


He 


Ser 


225 










230 










235 










240 


Arg 


Gly 


Val 


Phe 


He 


Val 


Ser 


Tyr 


Val 


Pro 


Met 


Arg 


Val 


Arg 


Thr 


Pro 










245 










250 










255 




Ser 


Lys 


Asp 


Ser 


Asn 


Arg 


Pro 


Gin 


Asn 


Thr 


Pro 


Ser 


Gin 


Asn 


Met 


Ser 








260 










265 










270 






Ala 


Leu 


Gly 


Met 


Lys 


Leu 


Asn 


Thr 


Phe 


Ser 


Ser 


Arg 


He 


Ser 


Val 


Tyr 






275 










280 










285 








Arg 


Asn 


Asn 


Thr 


Phe 


Lys 


Lys 


Leu 


Thr 


Glu 


Leu 


Val 


His 


Asn 


Phe 


Asp 




290 










295 










300 










Tyr 


Gly 


Ser 


Lys 


Asp 


Ala 


Ser 


Ser 


Ser 


Ser 


Pro 


Pro 


Pro 


Pro 


Ser 


Leu 


305 










310 










315 










320 


Ser 


Asp 


Ser 


Val 


Asn 


Thr 


Phe 


Val 


Arg 


Leu 


Tyr 


Thr 


Asn 


Tyr 


Asp 


He 










325 










330 










335 




Phe 


Leu 


Lys 


Val 


He 


Ser 


Asp 


Trp 


Lys 


Met 


Pro 


Tyr 


Gly 


Phe 


Phe 


Lys 








340 










345 










350 






Lys 


Thr 


Phe 


Asp 


Val 


Lys 


Lys 


Gly 


Leu 


Met 


Thr 


Leu 


Ser 


Val 


Ser 


Glu 






355 










360 










365 








Tyr 


Thr 


Leu 


Lys 


Lys 


Glu 


Leu 


Val 


Thr 


Phe 


Leu 


Arg 


Ala 


Leu 


Lys 


Glu 




370 










375 










380 










Arg 


Glu 


Tie 


Leu 


He 


Tyr 


Lys 


Met 


Glu 


Lys 


Arg 


Asp 


He 


He 


Cys 


He 


385 










390 










395 










400 


Leu 


Lys 


Lys 


Ser 


Leu 


Phe 


Gly 


Phe 


Asn 


Phe 


Arg 


Cys 


Leu 


Lys 


Gin 


Leu 










405 










410 










415 




Leu 


Pro 


Leu 


Phe 


Lys 


His 


Phe 


Leu 


Lys 


He 


Glu 


Glu 


Val 


Lys 


His 


He 








420 










425 










430 






Ala 


Arg 


Phe 


Val 


Phe 


Arg 


Asp 


Tyr 


Ser 


Leu 


Met 


Cys 


Lys 


Thr 


Gin 


Lys 






435 










440 










445 








Asp 


Leu 


Gin 


Ser 


Phe 


Pro 


Ala 


He 


Gin 


Ser 


Ala 


Ser 


Leu 


Phe 


Met 


Glu 




450 










455 










460 










Glu 


Phe 


Pro 


Trp 


Leu 


Ala 


Lys 


Thr 


Trp 


He 


Asp 


Asp 


Asp 


Asp 


Asp 


Glu 


465 










470 










475 










480 



wo 01/38351 



349 



PCT/USOO/28888 



Gly 


Gly 


Lys 


Gly 


His 
485 


Thr 


Leu 


Leu 


Thr 


Phe 
490 


Ala 


He 


Val 


His 


Arg 
495 


Tyr 


Pro 


Leu 


lie 


Ser 
500 


Gin 


Leu 


He 


Ser 


His 
505 


Pro 


He 


Leu 


Lys 


Ser 
510 


Leu 


Val 


Asn 


Thr 


Thr 

515 


Cys 


Arg 


Asp 


Lys 


His 
520 


Phe 


Thr 


Pro 


Leu 


Met 

525 


His 


Leu 


Ala 


Asn 


Thr 
530 


Ser 


He 


Met 


Tyr 


Gin 
535 


Cys 


Asn 


Thr 


Leu 


Leu 
540 


Cys 


Leu 


He 


He 


Asn 


Gly 


Ala 


Lys 


Pro 


Glu 


Phe 


He 


Asn 


Lys 


Phe 


Asn 


Glu 


Asn 


Val 


Leu 


545 










550 










555 










560 


His 


lie 


Ala 


He 


Glu 

565 


Asn 


Val 


Asn 


Tyr 


Gly 
570 


Val 


He 


Thr 


Glu 


Leu 

575 


Arg 


Gly 


Thr 


Leu 


Ser 
580 


Ser 


Glu 


Gin 


He 


Glu 
585 


Lys 


Met 


Val 


Asn 


Val 
590 


Arg 


Arg 


Met 


Met 


Asp 
595 


Asn 


Thr 


Thr 


Pro 


Leu 
600 


Met 


He 


Ala 


Arg 


Glu 
605 


Asn 


He 


Val 


Leu 


Ala 
610 


Gin 


Leu 


Phe 


Asp 


Gly 

615 


Lys 


Pro 


Lys 


He 


Lys 
620 


Val 


Arg 


Phe 


Gly 


Ser 


Ser 


Lys 


Arg 


Leu 


Arg 


He 


Pro 


Glu 


Phe 


Val 


Leu 


Leu 


Lys 


Gly 


Leu 


625 










630 










635 










640 


Lys 


Glu 


Ser 


Val 


Ala 
645 


Tyr 


Leu 


Glu 


Thr 


Arg 
650 


Asn 


He 


Ser 


Tyr 


Asp 
655 


He 


Asn 


lie 


lie 


Lys 
660 


Asp 


Ala 


Val 


Met 


Asp 
665 


Asn 


Ser 


Leu 


Phe 


Glu 
670 


Glu 


Glu 


Tyr 


Glu 


lie 
675 


Ala 


Ala 


Ala 


Gly 


Leu 
680 


Arg 


Gly 


Asn 


Asn 


Cys 
685 


Asp 


Pro 


Glu 


Ala 


Asp 
690 


Glu 


Lys 


Thr 


Met 


Asn 
695 


Thr 


Trp 


Asn 


Phe 


Phe 
700 


Thr 


Lys 


Asn 


Ser 


Thr 


Lys 


Trp 


Ala 


Ser 


Ser 


He 


Phe 


Gin 


Lys 


Asn 


Arg 


Gin 


Lys 


Phe 


Val 


705 










710 










715 










720 


Lys 


lie 


Val 


Asp 


Gly 
725 


Met 


Asn 


Arg 


Thr 


Tyr 
730 


Glu 


Asp 


Ser 


Glu 


Cys 
735 


Ala 


lie 


Cys 


Leu 


Asp 
740 


Ser 


Leu 


Asp 


Gly 


Asp 
745 


Leu 


Pro 


Ser 


Gly 


Arg 
750 


Thr 


Thr 


Cys 


Gly 


His 
755 


Cys 


Phe 


His 


Asn 


Val 
760 


Cys 


Trp 


Leu 


Ser 


Leu 
765 


He 


.Arg 


Met 


Ser 


Gly 
770 


Pro 


Asn 


Asn 


Gly 


Ser 
775 


Arg 


Arg 


Gly 


Gly 


He 
780 


Lys 


Cys 


Pro 


Ser 


Cys 


Arg 


Gin 


Val 


Thr 


Cys 


Leu 


Gly 


Lys 


Arg 


Leu 


Gly 


Val 


Ala 


Asp 


Tyr 


785 










790 










795 










800 


Asp 


lie 


Glu 


Thr 


Glu 
805 


Glu 


Glu 


Arg 


Asp 


Thr 
810 


Lys 


Asn 


Val 


Val 


Pro 
815 


Ser 


Val 


Glu 


Glu 


Gly 
820 


Arg 


Arg 


Glu 


Trp 


Arg 
825 


Lys 


He 


Gly 


Val 


Asp 
830 


Arg 


Tyr 


Glu 


Phe 


Leu 
835 


Val 


Gly 


Gly 


Val 


Trp 
840 


Thr 


Asn 


Glu 


He 


Lys 
845 


Leu 







<210> 256 
<211> 930 
<212> DNA 
<213> SHRIMP 



<400> 256 

atggacaatc 

gtcggctgct 

gacagaacta 

aggaacggga 

aaggacctgg 

acccctcaag 

cccaggccag 



ttatcaccaa 
ccatgacta t 
ctacttgca t 
gcaacaccga 
acatctctga 
tcacaaaact 
ttgccaatac 



cgacaacatt 
tgggctcgca 
ttcttgctca 
atccagtttc 
acccatgctc 
ggtaacattt 
acctcaacaa 



atcctcgtca 
ctggccatga 
ccgtgggaga 
atcagccacg 
aaatctacca 
tctggtccaa 
caaccaacaa 



ctttcctgag 
acatgctcgt 
agaataagaa 
tccggttcaa 
cttacgatct 
cctatgctag 
gtacaaataa 



tggattggca 60 
gaagtgcatc 120 
taagaagaac 180 
cactccagat 240 
ggccaatgtt 300 
tccgcctaca 360 
agaggaagaa 420 
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agtgtctata 
ctgccaacac 
aacggacgac 
gaggagggag 
caacaatctc 
aacaccctaa 
acgtttaaga 
actacaattg 
aggaaattgt 



tgccaatgtc 
cgccgccatc 
atctgaagct 
aggatgataa 
catctatcca 
ccaggagcag 
aattcccacc 
tgtcttccac 
catttaagag 



gagctgctcg 
tccacctaga 
tccttcaaac 
tgtggaagaa 
gaagtgtatc 
tagtaacaac 
taacaataat 
accttctcct 
gaaacaataa 



tcgtcatttt 
agcaatggcg 
ccaccttctc 
catgtctacg 
caggaattga 
aacaacaatg 
aacatgtggg 
acctttattc 



cttctgacaa 
gtgattacgt 
ccatcttcaa 
aatacgtgcc 
aggagatgaa 
ctccacgtat 
agaatcatgt 
cttcacctaa 



tagtcttcct 
gtcatatgta 
tatcaagaat 
agaagtacct 
acacaagaaa 
aacccaagtt 
gtatggaaac 
aagtatcata 



480 
540 
600 
660 
720 
780 
840 
900 
930 



<210> 257 
<211> 305 
<212> PRT 
<213> SHRIMP 



<400> 257 



Met 


Asp 


Asn 


Leu 


He 


Thr 


Asn 


Asp 


Asn 


He 


He 


Leu 


Val 


Thr 


Phe 


Leu 


1 








5 










10 










15 




Ser 


Gly 


Leu 


Ala 


Val 


Gly 


Cys 


Ser 


Met 


Thr 


He 


Gly 


Leu 


Ala 


Met 


Asn 








20 










25 










30 






Met 


Leu 


Val 


Lys 


Cys 


He 


Asp 


Arg 


Thr 


Thr 


Thr 


Cys 


He 


Ser 


Cys 


Ser 






35 










40 










45 








Pro 


Trp 


Glu 


Lys 


Asn 


Lys 


Asn 


Lys 


Lys 


Asn 


Arg 


Asn 


Gly 


Ser 


Asn 


Thr 




50 










55 










60 










Glu 


Ser 


Ser 


Phe 


He 


Ser 


His 


Val 


Arg 


Phe 


Asn 


Thr 


Pro 


Asp 


Lys 


Asp 


65 










70 










75 










80 


Leu 


Asp 


He 


Ser 


Glu 


Pro 


Met 


Leu 


Lys 


Ser 


Thr 


Thr 


Tyr 


Asp 


Leu 


Ala 










85 










90 










95 




Asn 


Val 


Thr 


Pro 


Gin 


Val 


Thr 


Lys 


Leu 


Val 


Thr 


Phe 


Ser 


Gly 


Pro 


Thr 








100 










105 










110 






Tyr 


Asp 


Pro 


Thr 


Pro 


Arg 


Pro 


Val 


Ala 


Asn 


Thr 


Pro 


Gin 


Gin 


Gin 


Pro 






115 










120 










125 








Thr 


Ser 


Thr 


Asn 


Lys 


Glu 


Glu 


Glu 


Ser 


Val 


Tyr 


Met 


Pro 


Met 


Ser 


Ser 




130 










135 










140 










Cys 


Ser 


Ser 


Ser 


Phe 


Ser 


Ser 


Asp 


Asn 


Ser 


Leu 


Pro 


Leu 


Pro 


Thr 


Pro 


145 










150 










155 










160 


Pro 


Pro 


Ser 


Pro 


Pro 


Arg 


Ser 


Asn 


Gly 


Gly 


Asp 


Tyr 


Val 


Ser 


Tyr 


Val 










165 










170 










175 




Asn 


Gly 


Arg 


His 


Leu 


Lys 


Leu 


Pro 


Ser 


Asn 


Pro 


Pro 


Ser 


Pro 


He 


Phe 








180 










185 










190 






Asn 


He 


Lys 


Asn 


Glu 


Glu 


Gly 


Glu 


Asp 


Asp 


Asn 


Val 


Glu 


Glu 


His 


Val 






195 










200 










205 








Tyr 


Glu 


Tyr 


Val 


Pro 


Glu 


Val 


Pro 


Gin 


Gin 


Ser 


Pro 


Ser 


He 


Gin 


Lys 




210 










215 










220 










Cys 


He 


Gin 


Glu 


Leu 


Lys 


Glu 


Met 


Lys 


His 


Lys 


Lys 


Asn 


Thr 


Leu 


Thr 


225 










230 










235 










240 


Arg 


Ser 


Ser 


Ser 


Asn 


Asn 


Asn 


Asn 


Asn 


Ala 


Pro 


Arg 


He 


Thr 


Gin 


Val 










245 










250 










255 




Thr 


Phe 


Lys 


Lys 


Phe 


Pro 


Pro 


Asn 


Asn 


Asn 


Asn 


Met 


Trp 


Glu 


Asn 


His 








260 










265 










270 






Val 


Tyr 


Gly 


Asn 


Thr 


Thr 


He 


Val 


Ser 


Ser 


Thr 


Pro 


Ser 


Pro 


Thr 


Phe 






275 










280 










285 








He 


Pro 


Ser 


Pro 


Lys 


Ser 


He 


He 


Arg 


Lys 


Leu 


Ser 


Phe 


Lys 


Arg 


Lys 




290 










295 










300 











Gin 
305 



<210> 258 
<211> 549 
<212> DNA 
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<213> SHRIMP 



<400> 258 

atgggagaaa 

tccagaaaca 

tttctccagt 

tgtcattctg 

gaagaggacc 

gtcgaacaag 

gtgtccttgc 

agggaagaat 

atcccaagac 

acggtataa 



gtattttcga 
agaagttatt 
gtcggatgat 
taaacgtgcc 
gtgccaataa 
gtgaattatg 
ataaggactt 
ctcggtctat 
cagaagggac 



tgctgtaagt 
gcgggaactc 
agattttcac 
agatgttgtg 
ccccgggcta 
catcatcaac 
atttggtgaa 
ccacctatat 
aaatactgtg 



ctggcaacaa 
aagaatatgc 
ttttctggtg 
cctaatacaa 
tacgattcta 
aagtcaagcg 
gatatcttgg 
ctggaggctg 
aactacacta 



ataatcctaa 
ggaaagattt 
acattataga 
tatttgcagt 
ttgaaggagt 
ttcacgagtt 
atggaataga 
ggcagagtat 
tcgttttttc 



aaagtcaaac 
tccttcaacc 
taaacattac 
tttcttacct 
atgtataaca 
caatattctg 
aactgcatca 
cagaacccca 
taaccaagtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

549 



<210> 259 
<211> 180 
<212> PRT 
<213> SHRIMP 



<400> 259 



Met 


Gly 


Glu 


Ser 


He 


Phe 


Asp 


Ala 


Val 


Ser 


Leu 


Ala 


Thr 


Asn 


Asn 


Pro 


1 








5 










10 










15 




Lys 


Lys 


Ser 


Asn 


Ser 


Arg 


Asn 


Lys 


Lys 


Leu 


Leu 


Arg 


Glu 


Leu 


Lys 


Asn 








20 










25 










30 






Met 


Arg 


Lys 


Asp 


Phe 


Pro 


Ser 


Thr 


Phe 


Leu 


Gin 


Cys 


Arg 


Met 


He 


Asp 






35 










40 










45 








Phe 


His 


Phe 


Ser 


Gly 


Asp 


He 


He 


Asp 


Lys 


His 


Tyr 


Cys 


His 


Ser 


Val 




50 










55 










60 










Asn 


Val 


Pro 


Asp 


Val 


Val 


Pro 


Asn 


Thr 


He 


Phe 


Ala 


Val 


Phe 


Leu 


Pro 


65 










70 










75 










80 


Glu 


Glu 


Asp 


Arg 


Ala 


Asn 


Asn 


Pro 


Gly 


Asp 


Ser 


He 


Glu 


Gly 


Val 


Cys 










85 










90 










95 




lie 


Thr 


Val 


Glu 


Gin 


Gly 


Glu 


Leu 


Cys 


He 


He 


Asn 


Lys 


Ser 


Ser 


Val 








100 










105 










110 






His 


Glu 


Phe 


Asn 


He 


Leu 


Val 


Ser 


Leu 


His 


Lys 


Asp 


Leu 


Phe 


Gly 


Glu 






115 










120 










125 








Asp 


He 


Leu 


Asp 


Gly 


He 


Glu 


Thr 


Ala 


Ser 


Arg 


Glu 


Glu 


Ser 


Arg 


Ser 




130 










135 










140 










lie 


His 


Leu 


Tyr 


Leu 


Glu 


Ala 


Gly 


Gin 


Ser 


He 


Arg 


Thr 


Pro 


He 


Pro 


145 










150 










155 










160 


Arg 


Pro 


Glu 


Gly 


Thr 


Asn 


Thr 


Val 


Asn 


Tyr 


Thr 


He 


Val 


Phe 


Ser 


Asn 



165 



170 



175 



Gin Val Thr 



Val 
180 



<210> 260 
<211> 3543 
<212> DNA 
<213> SHRIMP 



<400> 260 

atggaagcgg 

gaggacgata 

gaggaaatga 

gacacgggag 

tttgtgagag 

gaggaagagg 

gaggaagaag 

gctgaccctg 

aaccagtgga 



catcaaatct 
tcaataataa 
acgaggatga 
ttagaaatgg 
atgtcactaa 
aagatgaaga 
aaggagatga 
cgttattggc 
aaagtatagt 



gcgtataact 
tgttgattat 
ggaggaggag 
aagaaataaa 
tgacatgtac 
gggagaagaa 
atatgaagat 
gctagcggca 
gaacaccccc 



gaaggtgcgg 
agtaact tgt 
gaggaggagg 
gatcctccat 
gatgatgatg 

gggggtgaat 

gacaatgaag 
caacaagaag 
tcgccagtag 



gagtgttgga 
atgaagacga 
at tacgaaga 
cttctaaaaa 
atgaagaaga 
atgacggaaa 
gtgaagggga 
atgcgacaat 
ggccaaatag 



tatcgacaac 
agaggatgag 
tgaagatgag 
acaaagtaaa 
agaggaagag 
tctagaagac 
agaagatgag 
tatacctgaa 
gcaagttctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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cccatgttga 
cagaagaaca 
gaagaggaag 
gaaaaagaac 
aatgaaagcg 
gatggaatgg 
gaagaagagg 
gaagagcatg 
gaagaatatg 
actatgcatc 
gagtgcgagg 
ataaaggaac 
gaacaatatg 
cctccagaca 
atgcctcctc 
accaatattc 
aacaacaaca 
gggaaagagg 
gtaaaaactc 
ttccaagatc 
atcatcgctg 
aaatctgccg 
atcatgtcca 
gctagcaaaa 
gaattaaacg 
gacgagtcta 
cattcttcat 
ctgaaagaag 
ttcctaaaca 
aatatgcttt 
gcccaagagg 
tcatctaaac 
ggttctatag 
aacaaactat 
ttaaattggc 
atttctagtt 
tttgataagg 
agtgaagacg 
gataatataa 
gcgtattctg 
ggcgggggaa 
cctatttttg 
ctaagcgcag 
ataaaatata 
tctacatctg 
tctccctcca 
gttggtagga 
agcagtgtgt 
cttttgaata 
acaaaatatt 
taa 



actttttact 
aagaagatga 
aacaggagga 
ctatagaaca 
tgcactctca 
attatttctt 
atgaagaaga 
tagaaactaa 
aagaatatga 
aatttaacaa 
atgcagttgt 
cttcaccaaa 
accaactaaa 
atgaaatggg 
ctcctcctcc 
ttcttcctcc 
atccatcctt 
ccgaagaaag 
ttctagaaac 
cagaaaaccc 
cagaattaac 
agctgttcag 
accctctttt 
taagaccatc 
aggatgaaac 
ttgctggagg 
tttatagtaa 
acgacgataa 
acctcatatc 
cggacatagt 
attttgaaaa 
taaaacagat 
gggcagaaat 
gtatggacgt 
caaaagagtt 
gttcttctaa 
cagtcgatga 
gcaacatgtt 
gatacgggga 
gagtgagtaa 
ggtttggtgt 
ataatgcatt 
tagtgatcaa 
agctaaatca 
ttgtgcctat 
ccccagcagc 
aaagaacgtt 
atattgacca 
acaaatctag 
acaattcttc 



tgaaaatgtg 
taaccaacaa 
ggaagaggaa 
agaaaaaaat 
tagagttgaa 
ctcttcaata 
gggagaggaa 
agagtctgtt 
agatgaatct 
tgcgcccagg 
gttccctcca 
agcacctaga 
cgatatagca 
agaagaaagt 
acctccacca 
ccctccacct 
tttgagtaca 
attacacaag 
aacaaagaac 
agtaaaacca 
catgaaggca 
gaaaacaaat 
ttcagcagca 
gactcctgct 
ttccatgtct 
aggaggagga 
tcaaactcaa 
tcaaccatgt 
ctttaaaaca 
gaccaaggca 
acatcaaact 
gagtaaagaa 
ttgtttcggg 
gggaaggtta 
cactttgtca 
aatggctgtc 
agaggatgaa 
ttcagagaag 
cagtaatgac 
caataacagc 
agcatacagt 
aaatacacta 
acttttaaag 
agcatctgag 
gcgaaactta 
agcaacagca 
ttctttttca 
agaatcttcc 
cgtcaattct 
atccaattca 



aacgccatgg 
atagaacccg 
gaggaagaag 
gaacccgaga 
tccagcccta 
gcaggtggcg 
gaggaggagg 
caatctcaca 
cggcacacac 
gttcgaagat 
atcatgaaag 
atgttttcta 
ccaccgcctg 
agagacatca 
ccacctcatc 
ccacctacaa 
gtggttggaa 
actatggagt 
cttcaatgct 
tccgagaaag 
tttttagatt 
gagaaattgg 
tacgcgtcaa 
ccttctctta 
tcatctgctg 
ggaggaggag 
gcaaatttgc 
caaacgtata 
tcaagtgctg 
tcagtggcac 
gaaacaaatg 
tctgcaaaca 
gcaatttcta 
acaattttct 
aaagattata 
cctcccattt 
gatggtaatg 
gacaagaagg 
cgtacatcat 
aagaatagta 
gcaggctcat 
gtcaatttca 
aaagctaaat 
aaatacaata 
atgtaccgtc 
atggacgtgc 
aatgacatca 
acaccttcta 
ctagcaaagc 
gaagacgacg 



gcggatcagc 
tagaagagga 
aacaggagga 
aggatgaaga 
tgagtgaagg 
gcaa tga taa 
aggaacctgc 
cggaatatat 
tcgaggatga 
ctccccctcc 
agacagatat 
ttcttggtag 
ttccatctat 
tggaccaaga 
aacctcctca 
atcagtcctt 
aggttaacaa 
ctataatctt 
ctgaactggt 
taatggagcg 
ctgctgcggt 
agttgtttca 
cttatataat 
aaaaggttga 
gtggagtatg 
gaggaggaga 
acatggaact 
agcttggtca 
tttcttggtc 
tgttcggaga 
a tgtatcaga 
taatggaaga 
caatcataga 
tgaacatacc 
aagttcttct 
acgtcttaaa 
gaagtgaggc 
aagcaattcg 
taaaccac tt 
tgtttgatct 
caatcatcga 
tggacaagag 
tgtccattga 
aaaagggtaa 
ct tctaaaaa 
ctagcag tgt 
attccaacat 
gacggaggac 
aagtgaaaag 
atgaagacga 



aggtgaagaa 
agaggatgaa 
ggaagaggaa 
tgcaatagaa 
aggtaatgat 
tgaagaggat 
acaaaaatct 
tgaagaagag 
agaaataagt 
agatatacaa 
tctcccccag 
tggaggtgag 
cgtgactttt 
ttcgatgctg 
actcaaacct 
attttcaaac 
cacattaggt 
aaaaacaagg 
gaaagttgtg 
ccttaaaaat 
gacagacatt 
gaggaagcaa 
gggggaaaga 
aagtatatct 
tgctgaaggc 
ggtggtggag 
gataaatata 
aagactggcg 
tagattagtc 
caccaacaaa 
tttgtctacc 
aatggggtta 
gaaacacatc 
aattgtacta 
tttggatagt 
cagtatacaa 
agagaaaaga 
tcgagtatac 
tttcggtgac 
ccagacacaa 
acacagatcc 
gaaacacttg 
agtgtactgc 
gcacggaaaa 
tcaagatgtt 
atcttctcat 
gagtagcgct 
ttttatggac 
aatgaagcac 
ccaatacgag 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3543 



<210> 261 
<211> 1174 
<212> PRT 
<213> SHRIMP 

<400> 261 

Met Glu Ala Asn Leu Arg lie Thr Glu Gly Ala Gly Val Leu Asp lie 

15 10 15 

Asp Asn Glu Asp Asp lie Asn Asn Asn Val Asp Tyr Ser Asn Leu Tyr 
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20 










25 










30 






Glu 


Asp 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Met 


Asn 


Glu 


Asp 


Glu 


Glu 


Glu 


Glu 






35 










40 










45 








Glu 


Glu 


Glu 


Asp 


Tyr 


Glu 


Asp 


Glu 


Asp 


Glu 


Asp 


Thr 


Gly 


Val 


Arg 


Asn 




50 










55 










60 










Gly 


Arg 


Asn 


Lys 


Asp 


Pro 


Pro 


Ser 


Ser 


Lys 


Lys 


Gin 


Ser 


Lys 


Phe 


Val 


65 










70 










75 










80 


Arg 


Asp 


Val 


Thr 


Asn 


Asp 


Met 


Tyr 


Asp 


Asp 


Asp 


Asp 


Glu 


Glu 


Glu 


Glu 










85 










90 










95 




Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Glu 


Glu 


Gly 


Glu 


Glu 


Gly 


Gly 


Glu 


Tyr 








100 










105 










110 






Asp 


Gly 


Asn 


Leu 


Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Gly 


Asp 


Glu 


Tyr 


Glu 


Asp 






115 










120 










125 








Asp 


Asn 


Glu 


Gly 


Glu 


Gly 


Glu 


Glu 


Asp 


Glu 


Ala 


Asp 


Pro 


Ala 


Leu 


Leu 




130 










135 










140 










Ala 


Ala 


Gin 


Gin 


Glu 


Asp 


Ala 


Thr 


He 


He 


Pro 


Glu 


Asn 


Gin 


Trp 


Lys 


145 










150 










155 










160 


Ser 


He 


Val 


Asn 


Thr 


Pro 


Ser 


Pro 


Val 


Gly^x 


Pro 


Asn 


Arg 


Gin 


Val 


Leu 










165 










170 










175 




Pro 


Met 


Leu 


Asn 


Phe 


Leu 


Leu 


Glu 


Asn 


Val 


Asn 


Ala 


Met 


Gly 


Gly 


Ser 








180 










185 










190 






Ala 


Gly 


Glu 


Glu 


Gin 


Lys 


Asn 


Lys 


Glu 


Asp 


Asp 


Asn 


Gin 


Gin 


He 


Glu 






195 










200 










205 








Pro 


Val 


Glu 


Glu 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Gin 


Glu 


Glu 


Glu 




210 










215 










220 










Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Gin 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Lys 


Glu 


Pro 


225 










230 










235 










240 


lie 


Glu 


Gin 


Glu 


Lys 


Asn 


Glu 


Pro 


Glu 


Lys 


Asp 


Glu 


Asp 


Ala 


He 


Glu 










245 










250 










255 




Asn 


Glu 


Ser 


Val 


His 


Ser 


His 


Arg 


Val 


Glu 


Ser 


Ser 


Pro 


Met 


Ser 


Glu 








260 










265 










270 






Gly 


Gly 


Asn 


Asp 


Asp 


Gly 


Met 


Asp 


Tyr 


Phe 


Phe 


Ser 


Ser 


He 


Ala 


Gly 






275 










280 










285 








Gly 


Gly 


Asn 


Asp 


Asn 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Asp 


Glu 


Glu 


Glu 


Gly 




290 










295 










300 










Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Pro 


Ala 


Gin 


Lys 


Ser 


Glu 


Glu 


His 


Val 


305 










310 










315 










320 


Glu 


Thr 


Lys 


Glu 


Ser 


Val 


Gin 


Ser 


His 


Thr 


Glu 


Tyr 


He 


Glu 


Glu 


Glu 










325 










330 










335 




Glu 


Glu 


Tyr 


Glu 


Glu 


Tyr 


Glu 


Asp 


Glu 


Ser 


Arg 


His 


Thr 


Leu 


Glu 


Asp 








340 










345 










350 






Glu 


Glu 


He 


Ser 


Thr 


Met 


His 


Gin 


Phe 


Asn 


Asn 


Ala 


Pro 


Arg 


Val 


Arg 






355 










360 










365 








Arg 


Ser 


Pro 


Pro 


Pro 


Asp 


He 


Gin 


Glu 


Cys 


Glu 


Asp 


Ala 


Val 


Val 


Phe 




370 










375 










380 










Pro 


Pro 


Tie 


Met 


Lys 


Glu 


Thr 


Asp 


He 


Leu 


Pro 


Gin 


He 


Lys 


Glu 


Pro 


385 










390 










395 










400 


Ser 


Pro 


Lys 


Ala 


Pro 


Arg 


Met 


Phe 


Ser 


He 


Leu 


Gly 


Ser 


Gly 


Gly 


Glu 










405 










410 










415 




Glu 


Gin 


Tyr 


Asp 


Gin 


Leu 


Asn 


Asp 


He 


Ala 


Pro 


Pro 


Pro 


Val 


Pro 


Ser 








420 










425 










430 






lie 


Val 


Thr 


Phe 


Pro 


Pro 


Asp 


Asn 


Glu 


Met 


Gly 


Glu 


Glu 


Ser 


Arg 


Asp 






435 










440 










445 








He 


Met 


Asp 


Gin 


Asp 


Ser 


Met 


Leu 


Met 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 




450 










455 










460 










Pro 


Pro 


Pro 


Pro 


His 


Gin 


Pro 


Pro 


Gin 


Leu 


Lys 


Pro 


Thr 


Asn 


He 


Leu 


465 










470 










475 










480 


Leu 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Pro 


Thr 


Asn 


Gin 


Ser 


Leu 


Phe 


Ser 


Asn 










485 










490 










495 




Asn 


Asn 


Asn 


Asn 


Pro 


Ser 


Phe 


Leu 


Ser 


Thr 


Val 


Val 


Gly 


Lys 


Val 


Asn 



500 505 510 
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Asn 


Thr 


Leu 


Gly 


Gly 


Lys 


Glu 


Ala 


Glu 


Glu 


Arg 


Leu 


His 


Lys 


Thr 


Met 






515 










520 










525 








Glu 


Ser 


He 


He 


Leu 


Lys 


Thr 


Arg 


Val 


Lys 


Thr 


Leu 


Leu 


Glu 


Thr 


Thr 




530 










535 










540 










Lys 


Asn 


Leu 


Gin 


Cys 


Ser 


Glu 


Leu 


Val 


Lys 


Val 


Val 


Phe 


Gin 


Asp 


Pro 


545 










550 










555 










560 


Glu 


Asn 


Pro 


Val 


Lys 


Pro 


Ser 


Glu 


Lys 


Val 


Met 


Glu 


Arg 


Leu 


Lys 


Asn 










565 










570 










575 




He 


He 


Ala 


Ala 


Glu 


Leu 


Thr 


Met 


Lys 


Ala 


Phe 


Leu 


Asp 


Ser 


Ala 


Ala 








580 










585 










590 






Val 


Thr 


Asp 


He 


Lys 


Ser 


Ala 


Glu 


Leu 


Phe 


Arg 


Lys 


Thr 


Asn 


Glu 


Lys 






595 










600 










605 








Leu 


Glu 


Leu 


Phe 


Gin 


Arg 


Lys 


Gin 


He 


Met 


Ser 


Asn 


Pro 


Leu 


Phe 


Ser 




610 










615 










620 










Ala 


Ala 


Tyr 


Ala 


Ser 


Thr 


Tyr 


He 


Met 


Gly 


Glu 


Arg 


Ala 


Ser 


Lys 


He 


625 










630 










635 










640 


Arg 


Pro 


Ser 


Thr 


Pro 


Ala 


Pro 


Ser 


Leu 


Lys 


Lys 


Val 


Glu 


Ser 


He 


Ser 










645 










650 










655 




Glu 


Leu 


Asn 


Glu 


Asp 


Glu 


Thr 


Ser 


Met 


Ser 


Ser 


Ser 


Ala 


Gly 


Gly 


Val 








660 










665 










670 






Cys 


Ala 


Glu 


Gly 


Asp 


Glu 


Ser 


He 


Ala 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 


Gly 






675 










680 










685 








Gly 


Gly 


Gly 


Gly 


Glu 


Val 


Val 


Glu 


His 


Ser 


Ser 


Phe 


Tyr 


Ser 


Asn 


Gin 




690 










695 










700 










Thr 


Gin 


Ala 


Asn 


Leu 


His 


Met 


Glu 


Leu 


He 


Asn 


He 


Leu 


Lys 


Glu 


Asp 


705 










710 










715 










720 


Asp 


Asp 


Asn 


Gin 


Pro 


Cys 


Gin 


Thr 


Tyr 


Lys 


Leu 


Gly 


Gin 


Arg 


Leu 


Ala 










725 










730 










735 




Phe 


Leu 


Asn 


Asn 


Leu 


He 


Ser 


Phe 


Lys 


Thr 


Ser 


Ser 


Ala 


Val 


Ser 


Trp 








740 










745 










750 






Ser 


Arg 


Leu 


Val 


Asn 


Met 


Leu 


Ser 


Asp 


He 


Val 


Thr 


Lys 


Ala 


Ser 


Val 






755 










760 










765 








Phe 


Gly 


Asp 


Thr 


Asn 


Lys 


Ala 


Gin 


Glu 


Asp 


Phe 


Glu 


Lys 


His 


Gin 


Thr 




770 










775 










780 










Glu 


Thr 


Asn 


Asp 


Val 


Ser 


Asp 


Leu 


Ser 


Thr 


Ser 


Ser 


Lys 


Leu 


Lys 


Gin 


785 










790 










795 










800 


Met 


Ser 


Lys 


Glu 


Ser 


Ala 


Asn 


He 


Met 


Glu 


Glu 


Met 


Gly 


Leu 


Gly 


Ser 










805 










810 










815 




He 


Gly 


Ala 


Glu 


He 


Cys 


Phe 


Gly 


Ala 


He 


Ser 


Thr 


He 


He 


Glu 


Lys 








820 










825 










830 






His 


He 


Asn 


Lys 


Leu 


Cys 


Met 


Asp 


Val 


Gly 


Arg 


Leu 


Thr 


He 


Phe 


Leu 






835 










840 










845 








Asn 


He 


Pro 


He 


Val 


Leu 


Leu 


Asn 


Trp 


Pro 


Lys 


Glu 


Phe 


Thr 


Leu 


Ser 




850 










855 










860 










Lys 


Asp 


Tyr 


Lys 


Val 


Leu 


Leu 


Leu 


Asp 


Ser 


He 


Ser 


Ser 


Cys 


Ser 


Ser 


865 










870 










875 










880 


Lys 


Met 


Ala 


Val 


Pro 


Pro 


He 


Tyr 


Val 


Leu 


Asn 


Ser 


He 


Gin 


Phe 


Asp 










885 










890 










895 




Lys 


Ala 


Val 


Asp 


Glu 


Glu 


Asp 


Glu 


Asp 


Gly 


Asn 


Gly 


Ser 


Glu 


Ala 


Glu 








900 










905 










910 






Lys 


Arg 


Ser 


Glu 


Asp 


Gly 


Asn 


Met 


Phe 


Ser 


Glu 


Lys 


Asp 


Lys 


Lys 


Glu 






915 










920 










925 








Ala 


He 


Arg 


Arg 


Val 


Tyr 


Asp 


Asn 


He 


Arg 


Tyr 


Gly 


Asp 


Ser 


Asn 


Asp 




930 










935 










940 










Arg 


Thr 


Ser 


Leu 


Asn 


His 


Phe 


Phe 


Gly 


Asp 


Ala 


Tyr 


Ser 


Gly 


Val 


Ser 


945 










950 










955 










960 


Asn 


Asn 


Asn 


Ser 


Lys 


Asn 


Ser 


Met 


Phe 


Asp 


Leu 


Gin 


Thr 


Gin 


Gly 


Gly 










965 










970 










975 




Gly 


Arg 


Phe 


Gly 


Val 


Ala 


Tyr 


Ser 


Ala 


Gly 


Ser 


Ser 


He 


He 


Glu 


His 








980 










985 










990 






Arg 


Ser 


Pro 


He 


Phe 


Asp 


Asn 


Ala 


Leu 


Asn 


Thr 


Leu 


Val 


Asn 


Phe 


Met 
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995 1000 1005 

Asp Lys Arg Lys His Leu Leu Ser Ala Val Val lie Lys Leu Leu Lys 

1010 1015 1020 

Lys Ala Lys Leu Ser lie Glu Val Tyr Cys lie Lys Tyr Lys Leu Asn 
1025 1030 1035 1040 

Gin Ala Ser Glu Lys Tyr Asn Lys Lys Gly Lys His Gly Lys Ser Thr 

1045 1050 1055 

Ser Val Val Pro Met Arg Asn Leu Met Tyr Arg Pro Ser Lys Asn Gin 

1060 1065 1070 

Asp Val Ser Pro Ser Thr Pro Ala Ala Ala Thr Ala Met Asp Val Pro 

1075 1080 1085 

Ser Ser Val Ser Ser His Val Gly Arg Lys Arg Thr Phe Ser Phe Ser 

1090 1095 1100 

Asn Asp lie Asn Ser Asn Met Ser Ser Ala Ser Ser Val Tyr lie Asp 
1105 1110 1115 1120 

Gin Glu Ser Ser Thr Pro Ser Arg Arg Arg Thr Phe Met Asp Leu Leu 

1125 1130 1135 

Asn Asn Lys Ser Ser Val Asn Ser Leu Ala Lys Gin Val Lys Arg Met 

1140 1145 1150 

Lys His Thr Lys Tyr Tyr Asn Ser Ser Ser Asn Ser Glu Asp Asp Asp 

1155 1160 1165 

Glu Asp Asp Gin Tyr Glu 
1170 



<210> 262 
<211> 786 
<212> DNA 
<213> SHRIMP 



<400> 262 

atgtctaatg 

gaaagacgtg 

aacgaattgt 

ctggatgtga 

aaacaaggaa 

caaaatcttg 

caacaagaaa 

ggtgacgatg 

acggcaaaat 

cgtagtgacg 

gtagtcatat 

aaatctagat 

cgtataaaaa 

gagtaa 



gcgcaactat 
gtgtttctaa 
tgaagaacac 
gatctcgcct 
tagatgttag 
tacattggta 
ctggtcaagt 
gagaatttac 
tgtttcgttc 
tagttaatga 
tctttaatgt 
ataaacacac 
gaggcgatac 



aagcgatgag 
tctatctgaa 
tgtaagacat 
agaagacata 
acaaataata 
tcggccccca 
agggcggtgt 
aaggtacgat 
cataaatgat 
cgtgctaagc 
gattgttgaa 
ccatatacta 

ssggg^cgat 



<210> 263 
<211> 261 
<212> PRT 
<213> SHRIMP 



aggttgatcc 
ttgttgatac 
ggagacagag 
aaaaaggatt 
actgattact 
atatcttgca 
agtgttgcta 
ttctccattc 
gaggatgtag 
tttgaagcat 
gggaagagta 
aacggagaat 
atattgtttg 



tcatccttga 
accccataac 
tttacatgaa 
gtgttttaaa 
tggctaaacg 
cagatataga 
cgtacaattt 
ccttgggaga 
atgcagtgat 
ttaatcgaac 
aagatattga 
ctgcaacata 
caatcactgc 



taaaattgta 
caaacacata 
ggatgcagaa 
ggcaattgaa 
aaaactaacg 
cgaaaaaatt 
gagaattggt 
ttttaaaata 
tcttgtgtct 
aggagaacgc 
tattgtatgt 
cgctgttaaa 
ttttaaggag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

786 



<400> 263 

Met Ser Asn Gly Ala Thr lie Ser Asp Glu Arg Leu lie Leu lie Leu 

15 10 15 

Asp Lys lie Val Glu Arg Arg Gly Val Ser Asn Leu Ser Glu Leu Leu 

20 25 30 

lie His Pro lie Thr Lys His lie Asn Glu Leu Leu Lys Asn Thr Val 

35 40 45 

Arg His Gly Asp Arg Val Tyr Met Lys Asp Ala Glu Leu Asp Val Arg 

50 55 60 

Ser Arg Leu Glu Asp lie Lys Lys Asp Cys Val Leu Lys Ala lie Glu 
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65 70 75 80 

Lys Gin Gly lie Asp Val Arg Gin lie lie Thr Asp Tyr Leu Ala Lys 

85 90 95 

Arg Lys Leu Thr Gin Asn Leu Val His Trp Tyr Arg Pro Pro lie Ser 

100 105 110 

Cys Thr Asp lie Asp Glu Lys lie Gin Gin Glu Thr Gly Gin Val Gly 

115 120 125 

Arg Cys Ser Val Ala Thr Tyr Asn Leu Arg lie Gly Gly Asp Asp Gly 

130 135 140 

Glu Phe Thr Arg Tyr Asp Phe Ser lie Pro Leu Gly Asp Phe Lys lie 
145 150 155 160 

Thr Ala Lys Leu Phe Arg Ser lie Asn Asp Glu Asp Val Asp Ala Val 

165 170 175 

lie Leu Val Ser Arg Ser Asp Val Val Asn Asp Val Leu Ser Phe Glu 

180 185 190 

Ala Phe Asn Arg Thr Gly Glu Arg Val Val He Phe Phe Asn Val He 

195 200 205 

Val Glu Gly Lys Ser Lys Asp He Asp He Val Cys Lys Ser Arg Tyr 

210 215 220 

Lys His Thr His He Leu Asn Gly Glu Ser Ala Thr Tyr Ala Val Lys 
225 230 235 240 

Arg He Lys Arg Gly Asp Thr Arg Asp Asp He Leu Phe Ala He Thr 

245 250 255 

Ala Phe Lys Glu Glu 
260 



<210> 264 
<2H> 852 
<212> DNA 
<213> SHRIMP 



<400> 264 

atgtcgtcta 

caattgggag 

tatcgcttag 

gatcaagtaa 

tacgaagacg 

gttatggaaa 

actgaaattt 

ttttctttca 

ataaacgata 

gat ttaagtt 

cgcaaatcca 

ctaagaaata 

tatattagtc 

acaat tcaca 

tttgtttttt 



acggagatga 
ctgctcacca 
gtgcaattgc 
ggcaagaaat 
ctttcaatcc 
aagtatacga 
taaaagagga 
agtacaatac 
aacacatgtt 
tatcgttcaa 
cgttcgctgg 
gtgattcttc 
atgatacaaa 
tgcctgaaat 
ag 



<210> 265 
<211> 283 
<212> PRT 
<213> SHRIMP 



gcctgctgtg 
tgacaattct 
ctattcggta 
caatagccat 
tgttatctgt 
tgtgaataga 
gcggtggaga 
tattgccttt 
atcaatcact 
aatagattct 
tattgtaaac 
tgataacata 
tggcataatc 
cgaaaacgcg 



actgaagctg 
tggatcacaa 
gcaaaaaatg 
ttaactaatg 
gaagcaattt 
cggatcatgg 
agatatatac 
accgactctt 
ccccctggag 
tcagccaaaa 
agaccaaaaa 
ggtgattatc 
aacccatccg 
agtgatgatt 



aaatcgcttc 
gaaagagtga 
cctcta taaa 
taatgact tt 
ttgagaaagg 
aacccaggga 
c tggtttt ta 
caacttcatt 
ctgctcaagg 
ctctcacgtt 
gtgtagtgat 
taaagagaaa 
aggattcggc 
tatacataga 



agtggaggct 
ccaattaaag 
atatatagag 
tgaacatctt 
aataccagtg 
agatttcata 
tcatacatca 
tagtgtacca 
ggatttaatt 
agaatttaac 
attatcaaat 
tgatcctata 
ctctctcatt 
tttcaatctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

852 



<400> 265 

Met Ser Ser Asn Gly Asp Glu Pro Ala Val Thr Glu Ala Glu He Ala 

15 10 15 

Ser Val Glu Ala Gin Leu Gly Ala Ala His His Asp Asn Ser Trp He 

20 25 30 

Thr Arg Lys Ser Asp Gin Leu Lys Tyr Arg Leu Gly Ala He Ala Tyr 
35 40 45 



wo 01/38351 



357 



PCT/USOO/28888 



Ser Val Ala Lys Asn Ala Ser lie Lys Tyr lie Glu Asp Gin Val Arg 

50 55 60 

Gin Glu lie Asn Ser His Leu Thr Asn Val Met Thr Phe Glu His Leu 
65 70 75 80 

Tyr Glu Asp Ala Phe Asn Pro Val lie Cys Glu Ala lie Phe Glu Lys 

85 90 95 

Gly lie Pro Val Val Met Glu Lys Val Tyr Asp Val Asn Arg Arg lie 

100 105 110 

Met Glu Pro Arg Glu Asp Phe lie Thr Glu lie Leu Lys Glu Glu Arg 

115 120 125 

Trp Arg Arg Tyr lie Pro Gly Phe Tyr His' Thr Ser Phe Ser Phe Lys 

130 135 140 

Tyr Asn Thr lie Ala Phe Thr Asp Ser Ser Thr Ser Phe Ser Val Pro 
145 150 155 160 

lie Asn Asp Lys His Met Leu Ser lie Thr Pro Pro Gly Ala Ala Gin 

165 170 175 

Gly Asp Leu lie Asp Leu Ser Leu Ser Phe Lys lie Asp Ser Ser Ala 

180 185 190 

Lys Thr Leu Thr Leu Glu Phe Asn Arg Lys Ser Thr Phe Ala Gly lie 

195 200 205 

Val Asn Arg Pro Lys Ser Val Val lie Leu Ser Asn Leu Arg Asn Ser 

210 215 220 

Asp Ser Ser Asp Asn lie Gly Asp Tyr Leu Lys Arg Asn Asp Pro lie 
225 230 235 240 

Tyr lie Ser His Asp Thr Asn Gly lie lie Asn Pro Ser Glu Asp Ser 

245 250 255 

Ala Ser Leu lie Thr lie His Met Pro Glu lie Glu Asn Ala Ser Asp 

260 265 270 

Asp Leu Tyr lie Asp Phe Asn Leu Phe Val Phe 
275 280 



<210> 266 
<211> 1302 
<212> DNA 
<213> SHRIMP 



<400> 266 

atggcacttt 

ggagcttcta 

aatgccaaaa 

gaaaaaactg 

atcaatgaga 

gctgaggatg 

aacgtatggt 

ataaacgtcc 

acgaaccatg 

a tcacttacg 

gttgatcctt 

gaaatggtga 

caacctacac 

ggaggttt ta 

aaca tgaaac 

gttggat tag 

aaagcccgga 

aacaaaacag 

tcagactatt 

aataaaaatg 

ctggacattt 

agacctatta 



caaacaatgg 
ttgccctctt 
caaaaaagag 
acgacgataa 
cagcgcttca 
aattcttcac 
taa taaagga 
ccgatgtcgg 
acgaagtcaa 
aacaactcgt 
ctgatgaaaa 
tgttgacaac 
ctgattacct 
tcagtgatcg 
agaggacatt 
aacccgtgct 
tagagttttc 
aggactgttt 
ggataagctc 
atgaaacaag 
ttatgacaat 
cagataataa 



aggaatatac 
ctttgctatc 
taagaaatat 
taataataat 
acgtcaaacg 
caaattagca 
taaaataaca 
acaggcaatt 
ggaaacgatg 
aatagacttg 
tggggatgaa 
aacaccttct 
tgcccggtac 
tgatatgcgc 
taatgttatt 
acaaaaacaa 
atttgtagta 
ttgtagttta 
tgcaagcaca 
tttcttctat 
agattgtgca 
tataatggtt 



attgtttttg 
tcgggcgtag 
aaattagact 
ggaggaggag 
agagagcatt 
gatcaggaat 
gatggaaaag 
gctgatgaaa 
gatgaagttg 
accaatattt 
agcctacaga 
tcacaactag 
tcaaaggaat 
acttggcaag 
agtgcagcaa 
ggtagagcag 
gaaggtaacc 
ctccccaact 
gctaaggaaa 
aactttgagg 
cctaatcttc 
gcactgtcat 



cggttattgt 
ggaagggaac 
ctaaatacac 
ggggaggaac 
ttgcaagaac 
ttgacacata 
tt tcaatccc 
acttgttcga 
ttgcacaaaa 
tattgtttgg 
gatcaacaga 
ctagacaaca 
tggtgataaa 
gacgaatgtc 
cgaatctgga 
ctgtgggagg 
gtgtacgggt 
gttataatgt 
aaacgtactt 
aaggtgttga 
ctttcattaa 
aa 



tttggtaata 
tctacattca 
tgacgatgat 
agttgatgtt 
tcttgaaaaa 
caaa tcagaa 
tgaaggtgac 
tctcataggg 
atctaccaat 
tacagtaaca 
cccagacgca 
acaacctcct 
taatatacga 
tgtacatgtc 
ttctctacaa 
acgtattgaa 
atacgctaca 
taaaaaggca 
gtttattgct 
agaaattgac 
aaatttacca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1302 
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<210> 267 
<211> 431 
<212> PRT 
<213> SHRIMP 



<400> 267 



Met 


Ala 


Leu 


Ser 


Asn 


Asn 


Gly 


Gly 


He 


Tyr 


1 








5 










10 


Val 


Leu 


Val 


lie 


Gly 


Ala 


Ser 


He 


Ala 


Leu 








20 










25 




Val 


Gly 


Lys 


Gly 


Thr 


Leu 


His 


Ser 


Asn 


Ala 






35 










40 






Lys 


Tyr 


Lys 


Leu 


Asp 


Ser 


Lys 


Tyr 


Thr 


Asp 




50 










55 








Asp 


Asp 


Asn 


Asn 


Asn 


Asn 


Gly 


Gly 


Gly 


Gly 


65 










70 , 










lie 


Asn 


Glu 


Thr 


Ala 


Leu 


Gin 


Arg 


Gin 


Thr 










85 










90 


Thr 


Leu 


Glu 


Lys 


Ala 


Glu 


Asp 


Glu 


Phe 


Phe 








100 










105 




Glu 


Phe 


Asp 


Thr 


Tyr 


Lys 


Ser 


Glu 


Asn 


Val 






115 










120 






lie 


Thr 


Asp 


Gly 


Lys 


Val 


Ser 


He 


Pro 


Glu 




130 










135 








Asp 


Val 


Gly 


Gin 


Ala 


He 


Ala 


Asp 


Glu 


Asn 


145 










150 










Thr 


Asn 


His 


Asp 


Glu 


Val 


Lys 


Glu 


Thr 


Met 










165 










170 


Lys 


Ser 


Thr 


Asn 


He 


Thr 


Tyr 


Glu 


Gin 


Leu 








180 










185 




lie 


Leu 


Leu 


Phe 


Gly 


Thr 


Val 


Thr 


Val 


Asp 






195 










200 






Asp 


Glu 


Ser 


Leu 


Gin 


Arg 


Ser 


Thr 


Asp 


Pro 




210 










215 








Leu 


Thr 


Thr 


Thr 


Pro 


Ser 


Ser 


Gin 


Leu 


Ala 


225 










230 










Gin 


Pro 


Thr 


Pro 


Asp 


Tyr 


Leu 


Ala 


Arg 


Tyr 










245 










250 


Asn 


Asn 


lie 


Arg 


Gly 


Gly 


Phe 


He 


Ser 


Asp 








260 










265 




Gin 


Gly 


Arg 


Met 


Ser 


Val 


His 


Val 


Asn 


Met 






275 










280 






Val 


lie 


Ser 


Ala 


Ala 


Thr 


Asn 


Leu 


Asp 


Ser 




290 










295 








Pro 


Val 


Leu 


Gin 


Lys 


Gin 


Gly 


Arg 


Ala 


Ala 


305 










310 










Lys 


Ala 


Arg 


He 


Glu 


Phe 


Ser 


Phe 


Val 


Val 










325 










330 


Val 


Tyr 


Ala 


Thr 


Asn 


Lys 


Thr 


Glu 


Asp 


Cys 








340 










345 




Asn 


Cys 


Tyr 


Asn 


Val 


Lys 


Lys 


Ala 


Ser 


Asp 






355 










360 






Ser 


Thr 


Ala 


Lys 


Glu 


Lys 


Thr 


Tyr 


Leu 


Phe 




370 










375 








Glu 


Thr 


Ser 


Phe 


Phe 


Tyr 


Asn 


Phe 


Glu 


Glu 


385 










390 










Leu 


Asp 


lie 


Phe 


Met 


Thr 


He 


Asp 


Cys 


Ala 










405 










410 


Lys 


Asn 


Leu 


Pro 


Arg 


Pro 


He 


Thr 


Asp 


Asn 








420 










425 





PCT/USOO/28888 



He 


Val 


Phe 


Ala 


Val 


He 










15 




Phe 


Phe 


Ala 


He 


Ser 


Gly 








30 






Lys 


Thr 


Lys 


Lys 


Ser 


Lys 






45 








Asp 


Asp 


Glu 


Lys 


Thr 


Asp 




60 










Gly 


Gly 


Thr 


Val 


Asp 


Val 


75 










80 


Arg 


Glu 


His 


Phe 


Ala 


Arg 










95 




Thr 


Lys 


Leu 


Ala 


Asp 


Gin 








110 






Trp 


Leu 


He 


Lys 


Asp 


Lys 






125 








Gly 


Asp 


He 


Asn 


Val 


Pro 




140 










Leu 


Phe 


Asp 


Leu 


He 


Gly 


155 










160 


Asp 


Glu 


Val 


Val 


Ala 


Gin 










175 




Val 


He 


Asp 


Leu 


Thr 


Asn 








190 






Pro 


Ser 


Asp 


Glu 


Asn 


Gly 






205 








Asp 


Ala 


Glu 


Met 


Val 


Met 




220 










Arg 


Gin 


Gin 


Gin 


Pro 


Pro 


235 










240 


Ser 


Lys 


Glu 


Leu 


Val 


He 










255 




Arg 


Asp 


Met 


Arg 


Thr 


Trp 








270 






Lys 


Gin 


Arg 


Thr 


Phe 


Asn 






285 








Leu 


Gin 


Val 


Gly 


Leu 


Glu 




300 










Val 


Gly 


Gly 


Arg 


He 


Glu 


315 










320 


Glu 


Gly 


Asn 


Arg 


Val 


Arg 










335 




Phe 


Cys 


Ser 


Leu 


Leu 


Pro 








350 






Tyr 


Trp 


He 


Ser 


Ser 


Ala 






365 








He 


Ala 


Asn 


Lys 


Asn 


Asp 




380 










Gly 


Val 


Glu 


Glu 


He 


Asp 


395 










400 


Pro 


Asn 


Leu 


Pro 


Phe 


He 










415 




Asn 


He 


Met 


Val 


Ser 





430 
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<210> 268 
<211> 207 
<212> DNA 
<213> SHRIMP 



<400> 268 

atgtctgata tgaccagaaa catcatcgtc ggcttggccg ttgtcgtcat cgcattgagc 60 
atggtcgctt tcatgctttc tgttactcct gcacttaccg gattcctcct aggtttgggt 120 
gtatcagcac taggagttac actctttgga tgtcccacta tgaaatctcc agggggagga 180 
aatgctacaa tcaaccccgt ggcataa 207 

<210> 269 
<211> 68 
<212> PRT 
<213> SHRIMP 

<400> 269 

Met Ser Asp Met Thr Arg Asn lie lie Val Gly Leu Ala Val Val Val 

15 10 15 

lie Ala Leu Ser Met Val Ala Phe Met Leu Ser Val Thr Pro Ala Leu 

20 25 30 

Thr Gly Phe Leu Leu Gly Leu Gly Val Ser Ala Leu Gly Val Thr Leu 

35 40 45 

Phe Gly Cys Pro Thr Met Lys Ser Pro Gly Gly Gly Asn Ala Thr lie 

50 55 60 

Asn Pro Val Ala 
65 



<2I0> 270 

<211> 552 

<212> DNA 

<213> SHRIMP 



<400> 270 

atgttccaga 

acatgtgtat 

acatcgcatt 

tcctctccca 

gacgaccata 

ggagtacaat 

ttccctgttc 

gaaaatgaag 

gtgtccaaag 

tcaaacaaat 



aatggtttga 
gctcagttta 
cgcatgaggg 
ccagtggaaa 
cgagatggta 
tcatggtata 
acctccaaat 
aagatatgag 
acacagtgct 
aa 



atcgtttctg 
ttcatatttt 
tataaaaatc 
gatgtgtaac 
tgacaaaaat 
cgagttccat 
acacaacagg 
ggtcaggaac 
tgtagagaat 



gattcttccc 
tccccttgtc 
catcctccat 
caccaccaca 
acatcatgca 
ctaacaccaa 
aatactgaga 
at tcatccaa 
gggtttcggt 



gacctagata 
ggaaacatat 
caatattgaa 
agagattgta 
tctatcttga 
agaacaatca 
aaacatccct 
aatccaagat 
acaaggtgaa 



tctggatacg 
aaaattttcc 
ccataatact 
cctgagcact 
agatattgga 
actattctcc 
cctcgtattt 
attgatcccc 
aattgtatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

552 



<210> 271 

<211> 183 

<212> PRT 

<213> SHRIMP 



<400> 271 

Met Phe Gin Lys Trp Phe Glu Ser Phe Leu Asp Ser Ser Arg Pro Arg 

15 10 15 

Tyr Leu Asp Thr Thr Cys Val Cys Ser Val Tyr Ser Tyr Phe Ser Pro 

20 25 30 

Cys Arg Lys His lie Lys Phe Ser Thr Ser His Ser His Glu Gly lie 

35 40 45 

Lys lie His Pro Pro Ser lie Leu Asn His Asn Thr Ser Ser Pro Thr 
50 55 60 
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Ser 


Gly 


Lys 


Met 


Cys 


Asn 


His 


His 


His 


Lys 


Arg 


Leu 


Tyr 


Leu 


Ser 


Thr 


d c 
DO 










70 










75 










80 


Asp 


Asp 


His 


Thr 


Arg 


Trp 


Tyr 


Asp 


Lys 


Asn 


Thr 


Ser 


Cys 


He 


Tyr 


Leu 










85 










90 










95 




Glu 


Asp 


He 


Gly 


Gly 


Val 


Gin 


Phe 


Met 


Val 


Tyr 


Glu 


Phe 


His 


Leu 


Thr 








100 










105 










110 






Pro 


Lys 


Asn 


Asn 


Gin 


Leu 


Phe 


Ser 


Phe 


Pro 


Val 


His 


Leu 


Gin 


He 


His 






115 










120 










125 








Asn 


Arg 


Asn 


Thr 


Glu 


Lys 


Thr 


Ser 


Leu 


Leu 


Val 


Phe 


Glu 


Asn 


Glu 


Glu 




130 










135 










140 










Asp 


Met 


Arg 


Val 


Arg 


Asn 


He 


His 


Pro 


Lys 


Ser 


Lys 


He 


Leu 


He 


Pro 


145 










150 










155 










160 


Val 


Ser 


Lys 


Asp 


Thr 


Val 


Leu 


Val 


Glu 


Asn 


Gly 


Phe 


Arg 


Tyr 


Lys 


Val 










165 










170 










175 




Lys 


lie 


Val 


Leu 


Ser 


Asn 


Lys 





















180 



<210> 272 
<211> 684 
<212> DNA 
<213> SHRIMP 



<400> 272 

atggactcac 

aacgcggaca 

tgcgagtctc 

gagagtgtac 

ctcaggcgat 

aaggagcttt 

ccattccacc 

ggaaaatcta 

aaagatattg 

ttcctccatt 

acatctgctg 

ccatatgaat 



ttataagcaa 
gcatgttcat 
cattgtacaa 
tgtcttcttc 
tccctgtcct 
acaagcacgg 
ccaatatccc 
tcattatgag 
gccaattctt 
actttgcagg 
atgtgttacg 
tgatgaagaa 



attggaaaac 
gcaaaccatg 
gaacaagtcg 
ttcggacgaa 
acttcattgc 
gaagtttgcc 
cgccatgccg 
tgaagtcatg 
tgctgtattg 
aggaaatacc 
cattat tgct 
ataa 



atattctcca 
ctcctcccta 
ggagggaaga 
aagatgagct 
aactacaagc 
ctcc tcggcg 
tttgataaa t 
accaaggagc 
aatgtaacta 
atgagggatg 
gagataacaa 



<210> 273 
<211> 227 
<212> PRT 
<213> SHRIMP 



ttgccgagca 
ccgacgccat 
atattgtcac 
tcaaagtgct 
agacgaatac 
acctggtgtt 
cccccatttg 
ttttgtacaa 
accccattac 
gggaagggga 
tacagaaaac 



ggactttttc 
gttcaccgat 
cgatgttgga 
gtcccacgta 
gcccctgtgg 
attctccaac 
tgacaccact 
gttggccgac 
tggagattct 
taaaatctgc 
tggcaagatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

684 



<400> 273 



Met 


Asp 


Ser 


Leu 


He 


Ser 


Lys 


Leu 


Glu 


Asn 


He 


Phe 


Ser 


He 


Ala 


Glu 


1 








5 










10 










15 




Gin 


Asp 


Phe 


Phe 
20 


Asn 


Ala 


Asp 


Ser 


Met 
25 


Phe 


Met 


Gin 


Thr 


Met 
30 


Leu 


Leu 


Pro 


Thr 


Asp 
35 


Ala 


Met 


Phe 


Thr 


Asp 
40 


Cys 


Glu 


Ser 


Pro 


Leu 
45 


Tyr 


Lys 


Asn 


Lys 


Ser 
50 


Gly 


Gly 


Lys 


Asn 


He 
55 


Val 


Thr 


Asp 


Val 


Gly 
60 


Glu 


Ser 


Val 


Leu 


Ser 


Ser 


Ser 


Ser 


Asp 


Glu 


Lys 


Met 


Ser 


Phe 


Lys 


Val 


Leu 


Ser 


His 


Val 


65 










70 










75 










80 


Leu 


Arg 


Arg 


Phe 


Pro 
85 


Val 


Leu 


Leu 


His 


Cys 
90 


Asn 


Tyr 


Lys 


Gin 


Thr 
95 


Asn 


Thr 


Pro 


Leu 


Trp 
100 


Lys 


Glu 


Leu 


Tyr 


Lys 
105 


His 


Gly 


Lys 


Phe 


Ala 
110 


Leu 


Leu 


Gly 


Asp 


Leu 
115 


Val 


Leu 


Phe 


Ser 


Asn 
120 


Pro 


Phe 


His 


Pro 


Asn 
125 


He 


Pro 


Ala 


Met 


Pro 
130 


Phe 


Asp 


Lys 


Ser 


Pro 

135 


He 


Cys 


Asp 


Thr 


Thr 
140 


Gly 


Lys 


Ser 


He 
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lie 


Met 


Ser 


Glu 


Val 


Met 


Thr 


Lys 


Glu 


Leu 


Leu 


Tyr 


Lys 


Leu 


Ala 


Asp 


145 










150 










155 










160 


Lys 


Asp 


He 


Gly 


Gin 


Phe 


Phe 


Ala 


Val 


Leu 


Asn 


Val 


Thr 


Asn 


Pro 


He 










165 










170 










175 




Thr 


Gly 


Asp 


Ser 


Phe 


Leu 


His 


Tyr 


Phe 


Ala 


Gly 


Gly 


Asn 


Thr 


Met 


Arg 








180 










185 










190 






Asp 


Gly 


Glu 


Gly 


Asp 


Lys 


He 


Cys 


Thr 


Ser 


Ala 


Asp 


Val 


Leu 


Arg 


He 






195 










200 










205 








He 


Ala 


Glu 


He 


Thr 


He 


Gin 


Lys 


Thr 


Gly 


Lys 


Met 


Pro 


Tyr 


Glu 


Leu 




210 










215 










220 










Met 


Lys 


Lys 




























225 

































<210> 274 

<211> 2193 

<212> DNA 

<213> SHRIMP 



<400> 274 

atggagggtg 

tcgaaaacgc 

gccatagctg 

tctccttatt 

aaagcaccgc 

tttaa tagaa 

aatccaacgt 

tctgtgttga 

ccaaattgtg 

gtcctccgtt 

gaaacaaatg 

cccccgtgtt 

ccgttccatt 

tcgtccaact 

tttttgtggt 

ggtagtaaaa 

aagaataatg 

tttaagcttt 

gaggtcactg 

cacgtccaac 

gaagt taaag 

gacaaggaaa 

agaatgaaag 

gattttgcag 

gagaatgttg 

atgcaagatc 

caagctactg 

tcaa t ttcag 

tttgttttgg 

ccggacataa 

gattttggtt 

gagaggttct 

tcagaaactg 

attaagagaa 

attgtatcta 

cctgtaattg 

aaacat tea t 

<210> 275 
<211> 724 
<212> PRT 



gggaccaacg 
caggagaagg 
tgaaatcttg 
ctcttaggcc 
ctcctgtgac 
ataagaagaa 
atgcgacaga 
ggaagcaccg 
caaaattctc 
cccatgtcaa 
tagtaaagag 
actttgcagt 
tgattagaca 
acgagaccat 
ctaacaaccg 
acctatccgt 
gtaaagggtg 
cccctggagc 
cccaatgtaa 
cagaatatgc 
tcggggtata 
tgtactgtga 
gcgcagtagg 
gcatt tctgt 
tacagcaacc 
atggcagctt 
gccgcctgcc 
ccatagctat 
tagaaattgt 
aaagtgacaa 
taattgtacc 
ttagtaacca 
ccatgacttt 
ctgcagattt 
aaatgtatga 
gtgca tgttt 
tgtatagcaa 



gacaaaactt 
agaaggagga 
ttgctctaaa 
catgaagaga 
tttgaggctc 
gaggcctatt 
cattatcaag 
cgccaatgcc 
tactgtcaat 
ggggccaaaa 
gaacttttct 
ggcagaggct 
ggccgcaaaa 
aaaacagagc 
tactaaacaa 
tttagacact 
gataaaaacc 
atgtgaatct 
gaaatgggaa 
ttctcaagtc 
ttacatgctt 
atttgtattt 
tgtgagaaag 
gattgatgta 
taacccctcg 
gtgtaggatg 
tgaaggtcta 
cgttggaaaa 
aaatattgtc 
tatagtaatt 
atgtaaaaag 
tcctcataca 
tgggttggct 
gtctgccaat 
ccaggaaaag 
cccgttcaag 
gaaggttaag 



acgccagcaa 
gaaggaggag 
aacgctactc 
ctaaagaaga 
cgcgaggact 
actattgatg 
aagcagcaat 
gacacccagt 
ttgaaggcta 
cacttgaaat 
tccattgaca 
gataccaata 
ttaggcctca 
tgtattgacg 
cccccttcat 
agccctctct 
atgagcatcg 
atattggaga 
agttaccgca 
gtaatgattg 
gaaactggaa 
gaaactgttt 
atgtgtgttg 
ttaaatggag 
actacttcct 
agggatactc 
caatctaaat 
atgagggaga 
actcggttgt 
gatggaataa 
tactacaatt 
gctcctgaat 
tatttgttaa 
tctatctata 
accaataggc 
gataatattg 
tag 



ccgtgatggg 
ggcaattcaa 
gccgatcccc 
ataatggaga 
acgagagcac 
aaaatcaatt 
tgccttctgt 
acagaaaaag 
gagactatac 
cttcttgtga 
agtgggtcaa 
ttgcagccgg 
tttctgacgt 
caaaggaaaa 
cttggtggcc 
tgaactggaa 
atcacatggc 
agaaaactac 
gaaatat tec 
gaccatctga 
aagttatcaa 
ttagtcacgc 
aaggtttttg 
acctgaaatg 
ccaagccagc 
tgtacggtgt 
gcaagaaacc 
gaatgttaaa 
ctcaacaagg 
ctggtcaacc 
ttaaatgt tg 
ttattaacag 
tagacatgtt 
caaacat tec 
cgagagcgta 
ctaaactttt 



actttaccaa 
gatacct tea 
tccctcagat 
ggtgggagga 
accttaeaac 
tgcaaeatta 
tagtgccgcg 
attctctcat 
tccactgtct 
taccgtgact 
gctagaaaaa 
tctagaatct 
gcaagatgtg 
agcgtccaag 
tgttgggttt 
eaggttatgc 
aaagaatgtt 
actcttgggg 
tgtaccagca 
attatatctc 
gtt tatgacg 
tcttgaggga 
tgtcgagatg 
taaaatggac 
cgctgagctc 
taggatgctt 
cattacggat 
ccaa t tgccc 
attagtgaat 
taagatgatt 
gggaactgat 
tgagttgtgt 
gtcca ttttg 
atttttgtct 
tgaaattgcg 
ccagtcacct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2193 
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<213> SHRIMP 



<400> 275 



Met 


Glu 


Gly 


Gly Asp 


Gin 


Arg 


Thr 


Lys 


Leu 


1 








5 










10 


Gly 


Gin 


Ser 


Lys 


Thr 


Pro 


Gly 


Glu 


Gly 


Glu 








20 










25 




Gin 


Phe 


Lys 


He 


Pro 


Ser 


Ala 


He 


Ala 


Val 






35 










40 






Asn 


Ala 


Thr 


Arg 


Arg 


Ser 


Pro 


Pro 


Ser 


Asp 




50 










55 








Pro 


Met 


Lys 


Arg 


Leu 


Lys 


Lys 


Asn 


Asn 


Gly 


65 










70 










Pro 


Pro 


Pro 


Val 


Thr 


Leu 


Arg 


Leu 


Arg 


Glu 










85 










90 


Tyr 


Asn 


Phe 


Asn 


Arg 


Asn 


Lys 


Lys 


Lys 


Arg 








100 










105 




Asn 


Gin 


Phe 


Ala 


Thr 


Leu 


Asn 


Pro 


Thr 


Tyr 






115 










120 






Lys 


Gin 


Gin 


Leu 


Pro 


Ser 


Val 


Ser 


Ala 


Ala 




130 










135 








Arg 


Ala 


Asn 


Ala 


Asp 


Thr 


Gin 


Tyr 


Arg 


Lys 


145 










150 










Cys 


Ala 


Lys 


Phe 


Ser 


Thr 


Val 


Asn 


Leu 


Lys 










165 










170 


Leu 


Ser 


Val 


Leu 


Arg 


Ser 


His 


Val 


Lys 


Gly 








180 










185 




Ser 


Cys 


Asp 


Thr 


Val 


Thr 


Glu 


Thr 


Asn 


Val 






195 










200 






Ser 


He 


Asp 


Lys 


Trp 


Val 


Lys 


Leu 


Glu 


Lys 




210 










215 








Val 


Ala 


Glu 


Ala 


Asp 


Thr 


Asn 


He 


Ala 


Ala 


225 










^30 










His 


Leu 


He 


Arg 


Gin 


Ala 


Ala 


Lys 


Leu 


Gly 










245 










250 


Asp 


Val 


Ser 


Ser 


Asn 


Tyr 


Glu 


Thr 


He 


Lys 








260 










265 




Lys 


Glu 


Lys 


Ala 


Ser 


Lys 


Phe 


Leu 


Trp 


Ser 






275 










280 






Pro 


Pro 


Ser 


Ser 


Trp 


Trp 


Pro 


Val 


Gly 


Phe 




290 










295 








Val 


Leu 


Asp 


Thr 


Ser 


Pro 


Leu 


Leu 


Asn 


Trp 


305 










310 










Asn 


Gly 


Lys 


Gly 


Trp 


He 


Lys 


Thr 


Met 


Ser 










325 










330 


Asn 


Val 


Phe 


Lys 


Leu 


Ser 


Pro 


Gly 


Ala 


Cys 








340 










345 




Thr 


Leu 


Leu 


Gly 


Glu 


Val 


Thr 


Ala 


Gin 


Cys 






355 










360 






Arg 


Arg 


Asn 


He 


Pro 


Val 


Pro 


Ala 


His 


Val 




370 










375 








Gin 


Val 


Val 


Met 


He 


Gly 


Pro 


Ser 


Glu 


Leu 


385 










390 










Gly 


Val 


Tyr 


Tyr 


Met 


Leu 


Glu 


Thr 


Gly 


Lys 










405 










410 


Asp 


Lys 


Glu 


Met 


Tyr 


Cys 


Glu 


Phe 


Val 


Phe 








420 










425 




Ala 


Leu 


Glu 


Gly 


Arg 


Met 


Lys 


Gly 


Ala 


Val 






435 










440 






Val 


Glu 


Gly 


Phe 


Cys 


Val 


Glu 


Met 


Asp 


Phe 




450 










455 









PCT/USOO/28888 



Thr 


Pro 


Ala 


Thr 


Val 
15 


Met 


Gly 


Gly 


Glu 


Gly 
30 


Gly 


Gly 


Lys 


Ser 


Cys 
45 


Cys 


Ser 


Lys 


Ser 


Pro 
60 


Tyr 


Ser 


Leu 


Arg 


Glu 


Val 


Gly 


Gly 


Lys 


Ala 


75 










80 


Asp 


Tyr 


Glu 


Ser 


Thr 
95 


Pro 


Pro 


He 


Thr 


He 
110 


Asp 


Glu 


Ala 


Thr 


Asp 
125 


He 


He 


Lys 


Ser 


Val 
140 


Leu 


Arg 


Lys 


His 


Arg 


Phe 


Ser 


His 


Pro 


Asn 


155 










160 


Ala 


Arg 


Asp 


Tyr 


Thr 

175 


Pro 


Pro 


Lys 


His 


Leu 
190 


Lys 


Ser 


Val 


Lys 


Arg 
205 


Asn 


Phe 


Ser 


Pro 


Pro 
220 


Cys 


Tyr 


Phe 


Ala 


Gly 


Leu 


Glu 


Ser 


Pro 


Phe 


235 










240 


Leu 


He 


Ser 


Asp 


Val 
255 


Gin 


Gin 


Ser 


Cys 


He 
270 


Asp 


Ala 


Asn 


Asn 


Arg 
285 


Thr 


Lys 


Gin 


Gly 


Ser 
300 


Lys 


Asn 


Leu 


Ser 


Asn 


Arg 


Leu 


Cys 


Lys 


Asn 


315 










320 


He 


Asp 


His 


Met 


Ala 

335 


Lys 


Glu 


Ser 


He 


Lys 
350 


Lys 


Thr 


Lys 


Lys 


Trp 
365 


Glu 


Ser 


Tyr 


Gin 


Pro 

380 


Glu 


Tyr 


Ala 


Ser 


Tyr 


Leu 


Glu 


Val 


Lys 


Val 


395 










400 


Val 


He 


Lys 


Phe 


Met 
415 


Thr 


Glu 


Thr 


Val 


Phe 
430 


Ser 


His 


Gly 


Val 


Arg 
445 


Lys 


Met 


Cys 


Ala 


Gly 


He 


Ser 


Val 


He 



460 
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Asp 


Val 


Leu 


Asn 


Gly 


Asp 


Leu 


Lys 


Cys 


Lys 


Met 


Asp 


Glu 


Asn 


Val 


Val 


465 










470 










475 










480 


Gin 


Gin 


Pro 


Asn 


Pro 


Ser 


Thr 


Thr 


Ser 


Ser 


Lys 


Pro 


Ala 


Ala 


Glu 


Leu 










4 85 










4 90 










4 95 




Met 


Gin 


Asp 


His 


Gly 


Ser 


Leu 


Cys 


Arg 


Met 


Arg 


Asp 


Thr 


Leu 


Tyr 


Gly 








500 










505 










510 






Val 


Arg 


Met 


Leu 


Gin 


Ala 


Thr 


Gly 


Arg 


Leu 


Pro 


Glu 


Gly 


Leu 


Gin 


Ser 






515 










520 










525 








Lys 


Cys 


Lys 


Lys 


Pro 


He 


Thr 


Asp 


Ser 


He 


Ser 


Ala 


He 


Ala 


He 


Val 




530 










535 










540 










Gly 


Lys 


Met 


Arg 


Glu 


Arg 


Met 


Leu 


Asn 


Gin 


Leu 


Pro 


Phe 


Val 


Leu 


Val 


545 










550 










555 










560 


Glu 


He 


Val 


Asn 


He 


Val 


Thr 


Arg 


Leu 


Ser 


Gin 


Gin 


Gly 


Leu 


Val 


Asn 










565 










570 










575 




Pro 


Asp 


He 


Lys 


Ser 


Asp 


Asn 


He 


Val 


He 


Asp 


Gly 


He 


Thr 


Gly 


Gin 








580 










585 










590 






Pro 


Lys 


Met 


He 


Asp 


Phe 


Gly 


Leu 


He 


Val 


Pro 


Cys 


Lys 


Lys 


Tyr 


Tyr 






595 










600 










605 








Asn 


Phe 


Lys 


Cys 


Trp 


Gly 


Thr 


Asp 


Glu 


Arg 


Phe 


Phe 


Ser 


Asn 


His 


Pro 




610 










615 










620 










His 


Thr 


Ala 


Pro 


Glu 


Phe 


He 


Asn 


Ser 


Glu 


Leu 


Cys 


Ser 


Glu 


Thr 


Ala 


625 










630 










635 










640 


Met 


Thr 


Phe 


Gly 


Leu 


Ala 


Tyr 


Leu 


Leu 


He 


Asp 


Met 


Leu 


Ser 


He 


Leu 










645 










650 










655 




He 


Lys 


Arg 


Thr 


Ala 


Asp 


Leu 


Ser 


Ala 


Asn 


Ser 


He 


Tyr 


Thr 


Asn 


He 








660 










665 










670 






Pro 


Phe 


Leu 


Ser 


He 


Val 


Ser 


Lys 


Met 


Tyr 


Asp 


Gin 


Glu 


Lys 


Thr 


Asn 






675 










680 










685 








Arg 


Pro 


Arg 


Ala 


Tyr 


Glu 


He 


Ala 


Pro 


Val 


He 


Gly 


Ala 


Cys 


Phe 


Pro 




690 










695 










700 










Phe 


Lys 


Asp 


Asn 


He 


Ala 


Lys 


Leu 


Phe 


Gin 


Ser 


Pro 


Lys 


His 


Ser 


Lys 


705 










710 










715 










720 


Lys 


Lys 


Val 


Lys 



























<210> 276 
<211> 615 
<212> DNA 
<213> SHRIMP 



<400> 276 

atgtctagcg 

cttctagatg 

gccggttccc 

agacggggaa 

tcagtatcag 

tttgctcagt 

gttgagagct 

cgcttgtatt 

tacaacaaga 

aagtaccatg 

tctcaaagga 



gaaaagtaac 
gaggtgcagc 
tccacgataa 
aaacaaacag 
ttgtccaaca 
acaatatggg 
gcaagaagca 
actttaacaa 
taatcttccc 
cttctattcg 
tgtaa 



ctacgaaa tc 
aatctgtctg 
cctcttcaag 
caaaaatctg 
cccgacagaa 
taatggaaga 
cgaaagggtc 
gtgtcttcac 
tgccagaatt 
agatttctcc 



gttgaagggg 
cagtctaatt 
atgctaggat 
gcca taattg 
ccagaaaggt 
aaatgttact 
cacaaatctt 
gcgatcgcca 
gggtgcgcgg 
acaatcattg 



gattgttgaa 
gtgttgcaag 
ttggcgaccc 
aagatagacc 
tttgctccat 
tccctaacga 
ccacagaaat 
aatcacctgc 
cagctggagg 
ataaggaagt 



caacaagtac 
aaaacgtcac 
ctataaacag 
tcaactcggg 
gacattctta 
caaagagtat 
gaaaagattg 
aatgaagaag 
agattgggag 
gataatagtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

615 



<210> 277 
<2H> 204 
<212> PRT 
<213> SHRIMP 



<400> 277 

Met Ser Ser Gly Lys Val Thr Tyr Glu He Val Glu Gly Gly Leu Leu 
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15 10 15 

Asn Asn Lys Tyr Leu Leu Asp Gly Gly Ala Ala lie Cys Leu Gin Ser 

20 25 30 

Asn Cys Val Ala Arg Lys Arg His Ala Gly Ser Leu His Asp Asn Leu 

35 40 45 

Phe Lys Met Leu Gly Phe Gly Asp Pro Tyr Lys Gin Arg Arg Gly Lys 

50 55 60 

Thr Asn Ser Lys Asn Leu Ala lie lie Glu Asp Arg Pro Gin Leu Gly 
65 70 75 80 

Ser Val Ser Val Val Gin His Pro Thr Glu Pro Glu Arg Phe Cys Ser 

85 90 95 

Met Thr Phe Leu Phe Ala Gin Tyr Asn Met Gly Asn Gly Arg Lys Cys 

100 105 110 

Tyr Phe Pro Asn Asp Lys Glu Tyr Val Glu Ser Cys Lys Lys His Glu 

115 120 125 

Arg Val His Lys Ser Ser Thr Glu Met Lys Arg Leu Arg Leu Tyr Tyr 

130 135 140 

Phe Asn Lys Cys Leu His Ala lie Ala Lys Ser Pro Ala Met Lys Lys 
145 150 155 160 

Tyr Asn Lys lie He Phe Pro Ala Arg He Gly Cys Ala Ala Ala Gly 

165 170 175 

Gly Asp Trp Glu Lys Tyr His Ala Ser He Arg Asp Phe Ser Thr He 

180 185 190 

He Asp Lys Glu Val He He Val Ser Gin Arg Met 
195 200 



<210> 278 
<211> 828 
<212> DNA 
<213> SHRIMP 



<400> 278 

atgtcttcca 

agatggacaa 

ccccaacaac 

cctccattcc 

gaccagtgca 

agggaaagag 

agatcttccc 

aaccaacaca 

cttgtgaatt 

gctcgcgccc 

gttgtcatcg 

ccagatattc 

gattcttcta 

gtttattctg 



accgattcag 
ctgtcaagaa 
aacaacaaca 
ccatcattag 
gaaattgcga 
ttcgctcatt 
ccttccttaa 
tgtaccccct 
ggcgagaccc 
ctacacccac 
aggaagaaac 
ccgcaataac 
cagtatttct 
cattatttgc 



tcagctgagg 
caggaggaac 
tcaaaaaaga 
ttggggagcc 
tgaaactggc 
aaacaacact 
taagaagaag 
caacaagtac 
ttcacaagaa 
tccccaagaa 
tgcaccagaa 
tgcaactact 
cagaaatgtt 
aaaatgtatt 



ggcaacgagg 
agacagcaac 
acatcaacca 
ctcggcagct 
tattacaat t 
ccaagtgaag 
gacgttgacg 
agtt tccgtg 
aaacaggaca 
aaggaaccag 
ccagaaccag 
actactacta 
attctgagta 
agatctaaga 



agatggttgg 
agtattccca 
attctcctcc 
actcaatgta 
tccactctta 
gcatgtggcg 
aagctccacc 
aatatactcc 
agatcttaca 
aagtagaaac 
aaccagcccc 
cagttgcaac 
tcgtgttttg 
aggaataa 



ggactattca 
tagtttccgt 
tgctccacct 
tcgactggat 
tgatagaaag 
gcgcacaagt 
tcctcaatca 
ttcatcaaag 
agaggaagaa 
taaagatgat 
agttccagac 
acgtcacgac 
gtttctgggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

828 



<210> 279 
<211> 275 
<212> PRT 
<213> SHRIMP 



<400> 279 
Met Ser Ser Asn 
1 

Gly Asp Tyr Ser 
20 

Gin Gin Tyr Ser 
35 

Lys Arg Thr Ser 



Arg Phe Ser Gin 
5 

Arg Trp Thr Thr 

His Ser Phe Arg 
40 

Thr Asn Ser Pro 



Leu Arg Gly Asn 
10 

Val Lys Asn Arg 
25 

Pro Gin Gin Gin 

Pro Ala Pro Pro 



Glu Glu Met Val 
15 

Arg Asn Arg Gin 
30 

Gin Gin His Gin 
45 

Pro Pro Phe Pro 
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50 55 60 



He 


He 


Ser 


Trp 


Gly 


Ala 


Leu 


Gly 


Ser 


Tyr 


Ser 


Met 


Tyr 


Arg 


Leu 


Asp 


65 










70 










75 










80 


Asp 


Gin 


Cys 


Arg 


Asn 
85 


Cys 


Asp 


Glu 


Thr 


Gly 
90 


Tyr 


Tyr 


Asn 


Phe 


His 
95 


Ser 


Tyr 


Asp 


Arg 


Lys 
100 


Arg 


Glu 


Arg 


Val 


Arg 
105 


Ser 


Leu 


Asn 


Asn 


Thr 
110 


Pro 


Ser 


Glu 


Gly 


Met 
115 


Trp 


Arg 


Arg 


Thr 


Ser 
120 


Arg 


Ser 


Ser 


Pro 


Phe 
125 


Leu 


Asn 


Lys 


Lys 


Lys 
130 


Asp 


Val 


Asp 


Glu 


Ala 
135 


Pro 


Pro 


Pro 


Gin 


Ser 
140 


Asn 


Gin 


His 


Met 


Tyr 


Pro 


Leu 


Asn 


Lys 


Tyr 


Ser 


Phe 


Arg 


Glu 


Tyr 


Thr 


Pro 


Ser 


Ser 


Lys 


145 










150 










155 










160 


Leu 


Val 


Asn 


Trp 


Arg 

165 


Asp 


Pro 


Ser 


Gin 


Glu 
170 


Lys 


Gin 


Asp 


Lys 


He 
175 


Leu 


Gin 


Glu 


Glu 


Glu 
180 


Ala 


Arg 


Ala 


Pro 


Thr 
185 


Pro 


Thr 


Pro 


Gin 


Glu 
190 


Lys 


Glu 


Pro 


Glu 


Val 
195 


Glu 


Thr 


Lys 


Asp 


Asp 
200 


Val 


Val 


He 


Glu 


Glu 
205 


Glu 


Thr 


Ala 


Pro 


Glu 
210 


Pro 


Glu 


Pro 


Glu 


Pro 
215 


Ala 


Pro 


Val 


Pro 


Asp 
220 


Pro 


Asp 


He 


Pro 


Ala 


He 


Thr 


Ala 


Thr 


Thr 


Thr 


Thr 


Thr 


Thr 


Val 


Ala 


Thr 


Arg 


His 


Asp 


225 










230 










235 










240 


Asp 


Ser 


Ser 


Thr 


Val 
245 


Phe 


Leu 


Arg 


Asn 


Val 
250 


He 


Leu 


Ser 


He 


Val 
255 


Phe 


Trp 


Phe 


Leu 


Gly 
260 


Val 


Tyr 


Ser 


Ala 


Leu 
265 


Phe 


Ala 


Lys 


Cys 


He 
270 


Arg 


Ser 



Lys Lys Glu 
275 



<210> 280 
<211> 2025 
<212> DNA 
<213> SHRIMP 



<400> 280 

atggcaggga 

gaacaaggca 

aacttaaaat 

gcgtctacgc 

gacatcaacc 

tctatacgga 

gaaccaatct 

gcaaatgcaa 

atagtgacag 

gctgtaaact 

gggaa tg tag 

tcagaaatat 

acccctgaca 

gttcttatag 

ctgt tea tat 

aaca tttctt 

aacctgtaca 

tcaaaaaagg 

gt tgtggaga 

tcatggcctg 

ttaatgagac 

aactccttga 

aagattttgc 

tcagtcaaag 



atagaaccca 
tggagtgttg 
tgggcgattc 
ccctccttgg 
taatggctca 
gagttgtagt 
cacgccctca 
taagggacaa 
cggccttggg 
acatgtacga 
tctatctttc 
tagataaggt 
attcgttccc 
atctcatcac 
gtaaccattt 
cttcatccga 
atagactgga 
agtccttttc 
gttttcctcc 
ttattaataa 
tagcgataag 
caacagcagt 
tcgggcgagc 
agacgggaga 



gttcgtatcg 
tggcagacaa 
tcttaaagaa 
ggaattaaac 
tttctcctac 
ggctaatcat 
cccgctggga 
gttcatcagc 
ggctattgcc 
tgtagaattt 
caccaaaatg 
atcaaaacga 
ttaccaattg 
caatgcgatc 
aagggcagtt 
taattatttt 
atggttcatg 
aggagctgac 
cgttctcttg 
cagcaataat 
gacgagtagc 
gaccaaccgt 
tctggacgaa 
agaaaaggat 



tccctgattg 
gcacaggatg 
actgatgtta 
tacgacaaac 
gctgctttgg 
caacgtagaa 

ggggtagaac 

atgggggcgt 
agtgaacgtg 
gcagaaagag 
gacgaagatg 
cccgcaaagg 
atttggggcg 
gtgcctaata 
attaggagta 
gaggatggac 
ttagtagtta 
aatgttaacg 
gacactgaat 
aatagtacac 
ggtgcccgac 
attaccttcc 
gaagaagctg 
aagaacaata 



ctaaatgtat 
cattaatgac 
atttggaa ta 
aacaatatgc 
gtatagaaag 
ataatggaaa 
ctcctctatc 
tggacagatt 
aactattct t 
atgctgctac 
aagatgacat 
aaggtataga 
atgattctg t 
tttttatggc 
tgagggaaat 
gtaaatggtg 
gatttgtaa t 
tgaaaagact 
gggtcaagac 
tccctgtgac 
atcctatttt 
agtctgcaga 
gaacaaaaat 
cgttctcttc 



atcagacgtg 
tcgcctagcc 
cttgagatac 
ggcaacagtt 
tatactgaat 
aaaaccttct 
gtcagagttg 
gaattcagca 
acgtgaaaat 
tacagataca 
aataaagcgt 
ctggcgcccc 
agatgatact 
aaaatttatc 
tttatacggt 
cttctggttg 
tt tcctccac 
tctggtggta 
taatataacg 
agaagacacc 
cgacgaaatt 
a t tctgcaca 
gctagtaaaa 
atttggttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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ttactgaaga 
actacagatg 
tatgcgttta 
gagagtgacg 
gtgtccgact 
agtgtgtggt 
aaggtacaat 
aaaattgctg 
gagcaagaag 
gtcgttcaga 



acacaaaaaa 
tggcttgttg 
aaaaaatatg 
ccattacatc 
ggagttgctt 
tacaaatcct 
ctctttcatc 
ccgaacacgt 
cagcactatc 
acactctcaa 



tgaagaattg 
ggcacgtact 
gggccttgag 
ctttgtcacc 
actattacat 
gaaagatttt 
tgagatttgt 
tcgcagtgta 
tacagaacac 
ccttctggag 



gaaataaaca 
tcctcgacat 
gatgcaagtg 
gataagagca 
ccctgttgta 
tcccaggaaa 
cagaaatcaa 
aaaaagttat 
tgtatttggt 
aattttcccg 



taggcgataa 
cctttatccg 
atgtagtcga 
gtcctctcct 
aagcaccggc 
atataaaaac 
acgaccgttt 
taaatacgat 
taacgatttt 
tataa 



cgatgatgag 
taataggaca 
gctgaagcga 
atttccgtat 
cataattaaa 
tataaatgaa 
taaaaataaa 
aagcaacagg 
gtggaaacaa 



1500 
1560 
162 0- 
1680 
1740 
1800 
1860 
1920 
1980 
2025 



<210> 281 
<211> 672 
<212> PRT 
<213> SHRIMP 



<400> 281 



Met 


Ala 


Gly 


Asn 


Arg 


Thr 


Gin 


Phe 


Val 


Ser 


Ser 


Leu 


He 


Ala 


Lys 


Cys 


1 








5 










10 










15 




lie 


Ser 


Asp 


Val 


Glu 


Gin 


Gly 


Met 


Glu 


Cys 


Cys 


Gly 


Arg 


Gin 


Ala 


Gin 








20 










25 










30 






Asp 


Ala 


Leu 


Met 


Thr 


Arg 


Leu 


Ala 


Asn 


Leu 


Lys 


Leu 


Gly 


Asp 


Ser 


Leu 






35 










40 










45 








Lys 


Glu 


Thr 


Asp 


Val 


Asn 


Leu 


Glu 


Tyr 


Leu 


Arg 


Tyr 


Ala 


Ser 


Thr 


Pro 




50 










55 










60 










Leu 


Leu 


Gly 


Glu 


Leu 


Asn 


Tyr 


Asp 


Lys 


Gin 


Gin 


Tyr 


Ala 


Ala 


Thr 


Val 


65 










70 










75 










80 


Asp 


He 


Asn 


Leu 


Met 


Ala 


His 


Phe 


Ser 


Tyr 


Ala 


Ala 


Leu 


Gly 


He 


Glu 










85 










90 










95 




Ser 


He 


Leu 


Asn 


Ser 


He 


Arg 


Arg 


Val 


Val 


Val 


Ala 


Asn 


His 


Gin 


Arg 








100 










105 










110 






Arg 


Asn 


Asn 


Gly 


Lys 


Lys 


Pro 


Ser 


Glu 


Pro 


He 


Ser 


Arg 


Pro 


His 


Pro 






115 










120 










125 








Leu 


Gly 


Gly 


Val 


Glu 


Pro 


Pro 


Leu 


Ser 


Ser 


Glu 


Leu 


Ala 


Asn 


Ala 


He 




130 










135 










140 










Arg 


Asp 


Lys 


Phe 


He 


Ser 


Met 


Gly 


Ala 


Leu 


Asp 


Arg 


Leu 


Asn 


Ser 


Ala 


145 










150 










155 










160 


He 


Val 


Thr 


Ala 


Ala 


Leu 


Gly 


Ala 


He 


Ala 


Ser 


Glu 


Leu 


Phe 


Leu 


Arg 










165 










170 










175 




Glu 


Asn 


Ala 


Val 


Asn 


Tyr 


Met 


Tyr 


Asp 


Val 


Glu 


Phe 


Ala 


Glu 


Arg 


Asp 








180 










185 










190 






Ala 


Ala 


Thr 


Thr 


Asp 


Thr 


Gly 


Asn 


Val 


Val 


Tyr 


Leu 


Ser 


Thr 


Lys 


Met 






195 










200 










205 








Asp 


Glu 


Asp 


Glu 


Asp 


Asp 


He 


He 


Lys 


Arg 


Ser 


Glu 


He 


Leu 


Asp 


Lys 




210 










215 










220 










Val 


Ser 


Lys 


Arg 


Pro 


Ala 


Lys 


Glu 


Gly 


He 


Asp 


Trp 


Arg 


Pro 


Thr 


Pro 


225 










230 










235 










240 


Asp 


Asn 


Ser 


Phe 


Pro 


Tyr 


Gin 


Leu 


He 


Trp 


Gly 


Asp 


Asp 


Ser 


Val 


Asp 










245 










250 










255 




Asp 


Thr 


Val 


Leu 


He 


Asp 


Leu 


He 


Thr 


Asn 


Ala 


He 


Val 


Pro 


Asn 


He 








260 










265 










270 






Phe 


Met 


Ala 


Lys 


Phe 


He 


Leu 


Phe 


He 


Cys 


Asn 


His 


Leu 


Arg 


Ala 


Val 






275 










280 










285 








He 


Arg 


Ser 


Met 


Arg 


Glu 


He 


Leu 


Tyr 


Gly 


Asn 


He 


Ser 


Ser 


Ser 


Ser 




290 










295 










300 










Asp 


Asn 


Tyr 


Phe 


Glu 


Asp 


Gly 


Arg 


Lys 


Trp 


Cys 


Phe 


Trp 


Leu 


Asn 


Leu 


305 










310 










315 










320 


Tyr 


Asn 


Arg 


Leu 


Glu 


Trp 


Phe 


Met 


Leu 


Val 


Val 


Arg 


Phe 


Val 


He 


Phe 










325 










330 










335 




Leu 


His 


Ser 


Lys 


Lys 


Glu 


Ser 


Phe 


Ser 


Gly 


Ala 


Asp 


Asn 


Val 


Asn 


Val 



340 345 350 
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Lys 


Arg 


Leu 


Leu 


Val 






355 






Asp 


Thr 


Glu 


Trp 


Val 




370 








Asn 


Ser 


Asn 


Asn 


Asn 


385 










Arg 


Leu 


Ala 


lie 


Arg 










405 


Glu 


lie 


Asn 


Ser 


Leu 








420 




Ser 


Ala 


Glu 


Phe 


Cys 






435 






Glu 


Glu 


Ala 


Gly 


Thr 




450 








Glu 


Glu 


Lys 


Asp 


Lys 


465 










Lys 


Asn 


Thr 


Lys 


Asn 










485 


Asp 


Glu 


Thr 


Thr 


Asp 








500 




Phe 


lie 


Arg 


Asn 


Arg 






515 






Asp 


Ala 


Ser 


Asp 


Val 




530 








Ser 


Phe 


Val 


Thr 


Asp 


545 










Asp 


Trp 


Ser 


Cys 


Leu 










565 


lie 


Lys 


Ser 


Val 


Trp 








580 




lie 


Lys 


Thr 


lie 


Asn 






595 






Gin 


Lys 


Ser 


Asn 


Asp 




610 








Val 


Arg 


Ser 


Val 


Lys 


625 










Glu 


Ala 


Ala 


Leu 


Ser 










645 


Lys 


Gin 


Val 


Val 


Gin 



660 



Val Val Val Glu Ser 
360 

Lys Thr Asn lie Thr 
375 

Ser Thr Leu Pro Val 
390 

Thr Ser Ser Gly Ala 
410 

Thr Thr Ala Val Thr 
425 

Thr Lys lie Leu Leu 
440 

Lys Met Leu Val Lys 
455 

Asn Asn Thr Phe Ser 
470 

Glu Glu Leu Glu lie 
490 

Val Ala Cys Trp Ala 
505 

Thr Tyr Ala Phe Lys 
520 

Val Glu Leu Lys Arg 
535 

Lys Ser Ser Pro Leu 
550 

Leu Leu His Pro Cys 
570 

Leu Gin lie Leu Lys 
585 

Glu Lys Val Gin Ser 
600 

Arg Phe Lys Asn Lys 
615 

Lys Leu Leu Asn Thr 
630 

Thr Glu His Cys lie 
650 

Asn Thr Leu Asn Leu 
665 



Phe 


Pro 


Pro 


Val 


Leu 


Leu 






365 








Ser 


Trp 


Pro 


Val 


He 


Asn 




380 










Thr 


Glu 


Asp 


Thr 


Leu 


Met 


395 










400 


Arg 


His 


Pro 


He 


Phe 


Asp 










415 




Asn 


Arg 


He 


Thr 


Phe 


Gin 








430 






Gly 


Arg 


Ala 


Leu 


Asp 


Glu 






445 








Ser 


Val 


Lys 


Glu 


Thr 


Gly 




460 










Ser 


Phe 


Gly 


Leu 


Leu 


Leu 


475 










480 


Asn 


lie 


Gly 


Asp 


Asn 


Asp 










495 




Arg 


Thr 


Ser 


Ser 


Thr 


Ser 








510 






Lys 


lie 


Trp 


Gly 


Leu 


Glu 






525 








Glu 


Ser 


Asp 


Ala 


He 


Thr 




540 










Leu 


Phe 


Pro 


Tyr 


Val 


Ser 


555 










560 


Cys 


Lys 


Ala 


Pro 


Ala 


He 










575 




Asp 


Phe 


Ser 


Gin 


Glu 


Asn 








590 






Leu 


Ser 


Ser 


Glu 


He 


Cys 






605 








Lys 


lie 


Ala 


Ala 


Glu 


His 




620 










lie 


Ser 


Asn 


Arg 


Glu 


Gin 


635 










640 


Trp 


Leu 


Thr 


He 


Leu 


Trp 










655 




Leu 


Glu 


Asn 


Phe 


Pro 


Val 



670 



<210> 282 
<211> 2535 
<212> DNA 
<213> SHRIMP 



<400> 282 

atgttcactc 

attgacgttc 

aatccggaat 

acaatagttg 

atcaagaagg 

ctctgggtac 

gagattgtga 

aactatcaaa 

attttaatga 

aacaacacca 

atgctccaca 

gctagtagtt 

gcaatccaag 



acttgacccg 
accgagtggt 
catccatata 
gtgattacat 
tcatcgaaca 
accttctgga 
atgccaagta 
ctgggaagag 
gatgtgga tc 
actttcttca 
ttgctgaaat 
tccctgatgc 
aagataatgc 



agccttccgc 
ggccgagtta 
tagaactccg 
cctctctccg 
cttcccagta 
cgaacgccat 
caggggta tt 
tctactgatg 
atacaaattc 
ctgctgtgca 
gttctttgcc 
tgcagcagag 
tgatgcactc 



aagatgaaca 
tcctatccag 
atatccctct 
aagacggact 
atcttccact 
catcgcctgc 
gttgtcatcc 
agcaaactgg 
atctcattaa 
agtaaatggg 
aaccctacta 
gacgacaagg 
ctgttcctga 



acctagtcaa 
aattcgaaga 
tccaaaacaa 
cattccaagt 
gcacccacaa 
tccagagcct 
catactacag 
cgtccgtaaa 
tgtgcatgat 
gagaagttgg 
ctagccaaca 
ggaaaacacc 
tctccctcta 



tcgcagcttc 
ggatgtaaag 
ggatattgtt 
tctataccca 
taatgcccct 
gctgacgtac 
gcgccccatc 
agttctggac 
caacaagaag 
aagcaagatg 
cctatccgac 
tgcccatcta 
cggcgcaccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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tggtttcaag 
gtcaaggtac 
caactagaac 
gaagggaaaa 
gaatgcctga 
gatgtgggcg 
gctgagagat 
cagatggaag 
ccagccagaa 
gctgaagcca 
ggagaagaat 
tttggaagac 
gctgaactgt 
cacatctcta 
tccttcatct 
cgagttggag 
cacgcagctt 
ttggactttc 
agtcctcttc 
gcagagagta 
cacatcaccg 
atcaggactg 
cgagatattg 
ccccctgttc 
gtagatttca 
agaaagtaca 
gaaagggtgg 
tacaacacta 
gatttgggtg 
tttactttaa 



ataacaactc 
tatcctttgc 
cagataccat 
caacaagtct 
tgggaatgtg 
acgaagtact 
ctgaacacaa 
caatgaaacc 
gaggcgaacg 
cagctcaagg 
atgaagaaga 
catttctgga 
ctagacgagg 
cagtaaatat 
acaacatgct 
gagaaagagt 
cttccatgct 
aagcagtgta 
acgcctaccc 
tcagatacga 
aagcattaga 
gaaaatctta 
ctgaggaatt 
atgtacatcc 
tccaccactg 
ttgaacacga 
aagttttaag 
gaatgtccct 
tgtctaaatt 
tctaa 



gtacatgaaa 
agctgacaag 
gtgtggagtg 
ttcctggtac 
tgctgctgct 
ggaaagatgc 
tcgtgtactt 
tcccagagtt 
agccatcaga 
agatgtcaac 
gggagaagaa 
agatgaggaa 
cgtttctgtc 
tgtgcaacca 
caagaatgac 
gcccgtcatg 
ttgtgacttt 
tgatccccat 
aggagccgtc 
caatgatatt 
tgtctttgag 
ctctaactgg 
tgttagggta 
ctttagagat 
catcacctgg 
gaatacaaat 
ggtatctcaa 
cagcaccaaa 
tggtgtatac 



tctgcccttg 
tacgagattt 
tgtgcaacat 
cagatgaatt 
ggcaatgtac 
cctcctacaa 
tttgaagcaa 
gttgttcctc 
atcgcaagag 
tcctacttcc 
ttcttcaatt 
gaagccagac 
aatattatta 
gtttatggag 
gtctt tgagt 
aacctgtcca 
gccacagaaa 
cacatatcca 
acttttctgg 
gtctcattct 
ggtagtttat 
aacgaccatt 
tgtgaaaact 
ggagcaatcc 
tctatgcaag 
gtgcacctgt 
ctcagatgga 
agattgagcc 
aaactcctag 



aactcaagtc 
tacccaacat 
ctgtggaaga 
gcaagcatta 
aatgccccat 
tatttagatg 
aaaagcaaga 
ctcgcaggac 
aaattgccac 
ctgttctcat 
ctgaagagga 
aaatacagat 
acaatgataa 
ttgaaaagtc 
ctatacgctc 
atgacaagag 
caaactctca 
actatatcga 
acggggccca 
cagaaatggc 
tatccccact 
tgaggcgtag 
ctctagcttc 
ccattctcat 
ttaatgcact 
taaacttgcg 
gccgcttgtt 
tcatgaagat 
atattattga 



taacaagtgt 
taacaacaat 
agatgaaaat 
catccattgc 
gtgccgtgag 
gttaaaactg 
attctataag 
atttctcacc 
caacgccatc 
tgacgggagc 
ggcgcttgct 
gcgccagttt 
tcctcatcga 
acctgctgct 
aagagatact 
ggcattattc 
aattgttgga 
gacgtttggt 
ggactattat 
aagtgagttg 
gttcaagaaa 
aaattatgct 
acgcgaacac 
tgaatatata 
ccattgtatg 
tcctactgat 
caatgaacaa 
cttcaaccat 
aatgtactgt 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2535 



<210> 283 
<211> 838 
<212> PRT 
<213> SHRIMP 



<400> 283 
Met Phe Thr His 
1 

Asn Arg Ser Phe 
20 

Pro Glu Phe Glu 
35 

Thr Pro lie Ser 
50 

Asp Tyr lie Leu 
65 

lie Lys Lys Val 

Asn Asn Ala Pro 
100 

Leu Leu Gin Ser 
115 

Gly He Val Val 
130 

Gly Lys Ser Leu 
145 

He Leu Met Arg 

He Asn Lys Lys 
180 



Leu Thr Arg Ala 

5 

He Asp Val His 

Glu Asp Val Lys 
40 

Leu Phe Gin Asn 
55 

Ser Pro Lys Thr 
70 

He Glu His Phe 
85 

Leu Trp Val His 

Leu Leu Thr Tyr 
120 

He Pro Tyr Tyr 
135 

Leu Met Ser Lys 
150 

Cys Gly Ser Tyr 
165 

Asn Asn Thr Asn 



Phe Arg Lys Met 
10 

Arg Val Val Ala 
25 

Asn Pro Glu Ser 

Lys Asp He Val 
60 

Asp Ser Phe Gin 
75 

Pro Val He Phe 
90 

Leu Leu Asp Glu 
105 

Glu He Val Asn 

Arg Arg Pro He 
140 

Leu Ala Ser Val 
155 

Lys Phe He Ser 
170 

Phe Leu His Cys 
185 



Asn 


Asn 


Leu 


Val 






15 




Glu 


Leu 


Ser 


Tyr 




30 






Ser 


He 


Tyr 


Arg 


45 








Thr 


He 


Val 


Gly 


Val 


Leu 


Tyr 


Pro 








80 


His 


Cys 


Thr 


His 






95 




Arg 


His 


His 


Arg 




110 






Ala 


Lys 


Tyr 


Arg 


125 








Asn 


Tyr 


Gin 


Thr 


Lys 


Val 


Leu 


Asp 








160 


Leu 


Met 


Cys 


Met 






175 




Cys 


Ala 


Ser 


Lys 




190 
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Trp Gly Glu Val Gly Ser Lys Met Met Leu His lie Ala Glu Met Phe 

195 200 205 

Phe Ala Asn Pro Thr Thr Ser Gin His Leu Ser Asp Ala Ser Ser Phe 

210 215 220 

Pro Asp Ala Ala Ala Glu Asp Asp Lys Gly Lys Thr Pro Ala His Leu 
225 230 235 240 

Ala lie Gin Glu Asp Asn Ala Asp Ala Leu Leu Phe Leu lie Ser Leu 

245 250 255 

Tyr Gly Ala Pro Trp Phe Gin Asp Asn Asn Ser Tyr Met Lys Ser Ala 

260 265 270 

Leu Glu Leu Lys Ser Asn Lys Cys Val Lys Val Leu Ser Phe Ala Ala 

275 280 285 

Asp Lys Tyr Glu lie Leu Pro Asn lie Asn Asn Asn Gin Leu Glu Pro 

290 295 300 

Asp Thr Met Cys Gly Val Cys Ala Thr Ser Val Glu Glu Asp Glu Asn 
305 310 315 320 

Glu Gly Lys Thr Thr Ser Leu Ser Trp Tyr Gin Met Asn Cys Lys His 

325 330 335 

Tyr lie His Cys Glu Cys Leu Met Gly Met Cys Ala Ala Ala Gly Asn 

340 345 350 

Val Gin Cys Pro Met Cys Arg Glu Asp Val Gly Asp Glu Val Leu Glu 

355 360 365 

Arg Cys Pro Pro Thr lie Phe Arg Trp Leu Lys Leu Ala Glu Arg Ser 

370 375 380 

Glu His Asn Arg Val Leu Phe Glu Ala Lys Lys Gin Glu Phe Tyr Lys 
385 390 395 400 

Gin Met Glu Ala Met Lys Pro Pro Arg Val Val Val Pro Pro Arg Arg 

405 410 415 

Thr Phe Leu Thr Pro Ala Arg Arg Gly Glu Arg Ala lie Arg lie Ala 

420 425 430 

Arg Glu lie Ala Thr Asn Ala lie Ala Glu Ala Thr Ala Gin Gly Asp 

435 440 445 

Val Asn Ser Tyr Phe Pro Val Leu lie Asp Gly Ser Gly Glu Glu Tyr 

450 455 460 

Glu Glu Glu Gly Glu Glu Phe Phe Asn Ser Glu Glu Glu Ala Phe Gly 
465 470 475 480 

Arg Pro Phe Leu Glu Asp Glu Glu Glu Ala Arg Gin He Gin Met Arg 
485 490 495 

- Gin Phe Ala Glu Leu Ser Arg Arg Gly Val Ser Val Asn He He Asn 
500 505 510 

Asn Asp Asn Pro His Arg His Thr Val Asn He Val Gin Pro Val Tyr 

515 520 525 

Gly Val Glu Lys Ser Pro Ala Ala Ser Phe He Tyr Asn Met Leu Lys 

530 535 540 

Asn Asp Val Phe Glu Ser He Arg Ser Arg Asp Thr Arg Val Gly Gly 
545 550 555 560 

Glu Arg Val Pro Val Met Asn Leu Ser Asn Asp Lys Arg Ala Leu Phe 

565 570 575 

His Ala Ala Ser Ser Met Leu Cys Asp Phe Ala Thr Glu Thr Asn Ser 

580 585 590 

Gin He Val Gly Leu Asp Phe Gin Ala Val Tyr Asp Pro His His Asn 

595 600 605 

Tyr He Glu Thr Phe Gly Ser Pro Leu His Ala Tyr Pro Gly Ala Val 

610 615 620 

Thr Phe Leu Asp Gly Ala Gin Asp Tyr Tyr Ala Glu Ser He Arg Tyr 
625 630 635 640 

Asp Asn Asp He Val Ser Phe Ser Glu Met Ala Ser Glu Leu His He 

645 650 655 

Thr Glu Ala Leu Asp Val Phe Glu Gly Ser Leu Leu Ser Pro Leu Phe 

660 665 670 

Lys Lys He Arg Thr Gly Lys Ser Tyr Ser Asn Trp Asn Asp His Leu 
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675 










680 










685 








Arg 


Arg 


Arg 


Asn 


Tyr 


Ala 


Arg 


Asp 


He 


Ala 


Glu 


Glu 


Phe 


Val 


Arg 


Val 




690 










695 










700 










Cys 


Glu 


Asn 


Ser 


Leu 


Ala 


Ser 


Arg 


Glu 


His 


Pro 


Pro 


Val 


His 


Val 


His 


705 










710 










715 










720 


Pro 


Phe 


Arg 


Asp 


Gly 


Ala 


He 


Pro 


He 


Leu 


He 


Glu 


Tyr 


He 


Val 


Asp 










725 










730 










735 




Phe 


He 


His 


His 


Cys 


He 


Thr 


Trp 


Ser 


Met 


Gin 


Val 


Asn 


Ala 


Leu 


His 








740 










745 










750 






Cys 


Met 


Arg 


Lys 


Tyr 


He 


Glu 


His 


Glu 


Asn 


Thr 


Asn 


Val 


His 


Leu 


Leu 






755 










760 










765 








Asn 


Leu 


Arg 


Pro 


Thr 


Asp 


Glu 


Arg 


Val 


Glu 


Val 


Leu 


Arg 


Val 


Ser 


Gin 




770 










775 










780 










Leu 


Arg 


Trp 


Ser 


Arg 


Leu 


Phe 


Asn 


Glu 


Gin 


Tyr 


Asn 


Thr 


Arg 


Met 


Ser 


785 










790 










795 










800 


Leu 


Ser 


Thr 


Lys 


Arg 


Leu 


Ser 


Leu 


Met 


Lys 


He 


Phe 


Asn 


His 


Asp 


Leu 










805 










810 










815 




Gly 


Val 


Ser 


Lys 


Phe 


Gly 


Val 


Tyr 


Lys 


Leu 


Leu 


Asp 


He 


He 


Glu 


Met 








820 










825 










830 






Tyr 


Cys 


Phe 


Thr 


Leu 


He 























<210> 284 
<211> 2799 
<212> DNA 
<213> SHRIMP 



<400> 284 

atggcagcag 

attgccgata 

gcagctaatg 

tggtcccaag 

gaggcattta 

gcaactggac 

tcaaatacaa 

agtgctgaaa 

cccgtgtact 

gcatccaaga 

ctgattgttg 

gaactagaac 

gatgaaatgt 

cttcccca tc 

atcaacattg 

tctataccat 

cccaataact 

agtggagatt 

ttctatggcg 

acctgttctt 

acacctgaaa 

ttccattcac 

aacccttttt 

gaagtat tgg 

atggacagat 

gatagaataa 

tgtaaaattc 

actgctacgt 

tatttgtgta 

gaagatcaca 

ctcgaaccta 

a ttaagatgc 



acctcctaga 
ctgaattcct 
gacggccata 
ataacaggaa 
tagctggttc 
taggataccc 
ctaattacaa 
ttgataagcg 
tgaaactgat 
ggacggccaa 
ccatattaat 
agcaagtatt 
gtgtagtggc 
atttgtacaa 
aaaattcctc 
caagtatggt 
tgatgactgc 
acggatggtt 
accgaaaaca 
attggaacaa 
ataactgtgg 
cggtaacgcg 
ttattagaca 
aaaatgttgg 
ttggtagtct 
agtctctaga 
tagaaaccaa 
ctgatgatcc 
atattctaga 
ctaaaaccgg 
aacaacattt 
tgagaaatga 



gttggctatc 
caattatctt 
tcttcctaca 
gctgaaactc 
tatagatgct 
tctattgaaa 
cttgtacatt 
tccttcatca 
cgacagaaga 
ggaaattgga 
acctgcatac 
ggatgatgca 
catattgagt 
tgcacctttc 
attctttaat 
aataaaacac 
agcagaacgg 
ttctgtaatt 
tctaataaat 
gtatatggat 
ttttcgtata 
agtcaacaag 
cacagaacct 
cgtgaccctc 
attacttgga 
aactatttct 
gtataactac 
tataaagatg 
atttgctata 
caatggtggt 
aattgttaaa 
ttgcgaacgc 



caggaaacaa 
ccccataaaa 
ctagaaatga 
ttggggcatt 
gaaacgtgcg 
aaactagccc 
gatagaaatt 
atacagaaca 
accgagtgtc 
gcagccagaa 
acagaaacgt 
tattttaatg 
actttgcaca 
cgtcttcctc 
gaagatacga 
catacgcgaa 
tctatatttt 
gtaggatcta 
acagtcaaat 
tgcagatctt 
agggctgcaa 
aagaaaacga 
aagtggtaca 
gaacaacacg 
cgagaaaaga 
aacaatctca 
aaatcttctt 
aagattatcg 
atttcatcag 
tgtgcgtttt 
gtgaataaat 
aacaagtgta 



tccagtctga 
ctggcatctg 
ggaacgaagt 
tttgtggcaa 
tgggcttttt 
tgattgcccg 
caatgatgaa 
cttctcacac 
ttgccctgga 
aggtttgttt 
ttgttcttga 
cggagaataa 
atttatttgt 
cctttggaca 
cacctattct 
aaaatagtag 
tacgtggagt 
ctataatgcc 
ctaataactt 
atgggtttga 
ttgattgctc 
gcattattaa 
acaaaaatgc 
tccgtgttag 
aatggacgtg 
agggtaaaat 
ccttatacta 
cctctataaa 
agaaaaagga 
caaagtataa 
acattgaggc 
ggtttaaaga 



attggaagaa 
cgaagaagct 
tgaccatttc 
cttgtatgtg 
gaggtcgcaa 
tgaggataaa 
acaagttttt 
aaaa tct tec 
ttggctggac 
cctccagaac 
tacaggcaac 
agataaagtg 
taggaaaagt 
acaccctatc 
agcgtcaatt 
atggagatgt 
tttaacagtc 
tagtgtactt 
ttctgccata 
gattatagac 
gaacacagat 
tgcggtaaag 
catgtgtggt 
tgatgagtat 
taat tatcta 
tgacaccatg 
taagcaaata 
caagaggagg 
tgaggtggag 
gaaaaaacaa 
tttttcgtta 
ggctgaaatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 
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agagagtgcg 
cacgatctgg 
aacgctaaca 
aaagattcgc 
ccaataagac 
actgttacct 
gactttttgg 
agactaaaaa 
aattctgata 
agtacagttg 
ttttcaatcg 
tcgagtagtg 
atgaataact 
tgcacagata 
gtaaaaagta 



ccaacgaact 
tattgaagcg 
ctatcatgtc 
gcgatgttaa 
actctatgct 
gttttgaatg 
atcgttacga 
aatgtagtag 
cactttattc 
taaaaaatga 
gtaaaggaga 
acgacgacga 
gcagcagcag 
ttctcaatgt 
tggatgtgtt 



ggtacgagaa 
aactaatgtc 
tttaatacct 
gttgttgaat 
ggagttagta 
gttaaaatgg 
gaaaacgggg 
agatcacact 
ttattctctc 
cactgacgga 
aaaaacaacc 
tgaagaagaa 
catcaagaac 
tgtagaacct 
gaatttgtta 



ctgtataggg 
cacttgacat 
aaatgtaagc 
tttctgagaa 
tacggagagg 
tgctcgaaaa 
gaggaagata 
aaggatataa 
gataaaaatg 
aaaacgtcta 
aaaaaacgaa 
gaggaagaag 
aagagcaaga 
tctctaccta 
tgatgttaa 



cttcggccag 
ggcaacgccc 
tacacacagt 
cgagggatgg 
agtacgctaa 
aaggtgtgat 
aagacgaaag 
aaaaaataga 
tgcaaaccca 
tggtgggctg 
aactggaaac 
atgaaggaaa 
acaagaatgg 
atactttatc 



aagttatgtc 
ttacgatgaa 
attgtacgat 
aaattataac 
agatgtcagt 
taaatatgaa 
ggaattcttt 
aaatgtacta 
cgcatcttct 
ggattatatt 
gatagatata 
aagaatgaaa 
gagaatgtgt 
gttcaattgt 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2799 



<210> 285 
<211> 924 
<212> PRT 
<213> SHRIMP 



<400> 285 



Met 


Ala 


Ala 


Asp 


Leu 


Leu 


Glu 


Leu 


Ala 


He 


Gin 


Glu 


Thr 


He 


Gin 


Ser 


1 








5 










10 










15 




Glu 


Leu 


Glu 


Glu 


He 


Ala 


Asp 


Thr 


Glu 


Phe 


Leu 


Asn 


Tyr 


Leu 


Pro 


His 








20 










25 










30 






Lys 


Thr 


Gly 


lie 


Cys 


Glu 


Glu 


Ala 


Ala 


Ala 


Asn 


Gly 


Arg 


Pro 


Tyr 


Leu 






35 










40 










45 








Pro 


Thr 


Leu 


Glu 


Met 


Arg 


Asn 


Glu 


Val 


Asp 


His 


Phe 


Trp 


Ser 


Gin 


Asp 




50 










55 










60 










Asn 


Arg 


Lys 


Leu 


Lys 


Leu 


Leu 


Gly 


His 


Phe 


Cys 


Gly 


Asn 


Leu 


Tyr 


Val 


65 










70 










75 










80 


Glu 


Ala 


Phe 


lie 


Ala 


Gly 


Ser 


He 


Asp 


Ala 


Glu 


Thr 


Cys 


Val 


Gly 


Phe 










85 










90 










95 




Leu 


Arg 


Ser 


Gin 


Ala 


Thr 


Gly 


Leu 


•Gly 


Tyr 


Pro 


Leu 


Leu 


Lys 


Lys 


Leu 








100 










105 










110 






Ala 


Leu 


lie 


Ala 


Arg 


Glu 


Asp 


Lys 


Ser 


Asn 


Thr 


Thr 


Asn 


Tyr 


Asn 


Leu 






115 










120 










125 








Tyr 


lie 


Asp 


Arg 


Asn 


Ser 


Met 


Met 


Lys 


Gin 


Val 


Phe 


Ser 


Ala 


Glu 


He 




130 










135 










140 










Asp 


Lys 


Arg 


Pro 


Ser 


Ser 


He 


Gin 


Asn 


Thr 


Ser 


His 


Thr 


Lys 


Ser 


Ser 


145 










150 










155 










160 


Pro 


Val 


Tyr 


Leu 


Lys 


Leu 


He 


Asp 


Arg 


Arg 


Thr 


Glu 


Cys 


Leu 


Ala 


Leu 










165 










170 










175 




Asp 


Trp 


Leu 


Asp 


Ala 


Ser 


Lys 


Arg 


Thr 


Ala 


Lys 


Glu 


He 


Gly 


Ala 


Ala 








180 










185 










190 






Arg 


Lys 


Val 


Cys 


Phe 


Leu 


Gin 


Asn 


Leu 


He 


Val 


Ala 


He 


Leu 


He 


Pro 






195 










200 










205 








Ala 


Tyr 


Thr 


Glu 


Thr 


Phe 


Val 


Leu 


Asp 


Thr 


Gly 


Asn 


Glu 


Leu 


Glu 


Gin 




210 










215 










220 










Gin 


Val 


Leu 


Asp 


Asp 


Ala 


Tyr 


Phe 


Asn 


Ala 


Glu 


Asn 


Lys 


Asp 


Lys 


Val 


225 










230 










235 










240 


Asp 


Glu 


Met 


Cys 


Val 


Val 


Ala 


He 


Leu 


Ser 


Thr 


Leu 


His 


Asn 


Leu 


Phe 










245 










250 










255 




Val 


Arg 


Lys 


Ser 


Leu 


Pro 


His 


His 


Leu 


Tyr 


Asn 


Ala 


Pro 


Phe 


Arg 


Leu 








260 










265 










270 






Pro 


Pro 


Phe 


Gly 


Gin 


His 


Pro 


He 


He 


Asn 


He 


Glu 


Asn 


Ser 


Ser 


Phe 






275 










280 










285 








Phe 


Asn 


Glu 


Asp 


Thr 


Thr 


Pro 


He 


Leu 


Ala 


Ser 


He 


Ser 


He 


Pro 


Ser 




290 










295 










300 










Ser 


Met 


Val 


He 


Lys 


His 


His 


Thr 


Arg 


Lys 


Asn 


Ser 


Arg 


Trp 


Arg 


Cys 



wo 01/38351 



372 



PCT/USOO/28888 



305 310 315 320 



Pro 


Asn 


Asn 


Leu 


Met 
325 


Thr 


Ala 


Ala 


Glu 


Arg 
330 


Ser 


He 


Phe 


Leu 


Arg 
335 


Gly 


Val 


Leu 


Thr 


Val 
340 


Ser 


Gly 


Asp 


Tyr 


Gly 
345 


Trp 


Phe 


Ser 


Val 


He 
350 


Val 


Gly 


Ser 


Thr 


He 

355 


Met 


Pro 


Ser 


Val 


Leu 
360 


Phe 


Tyr 


Gly 


Asp 


Arg 
365 


Lys 


His 


Leu 


lie 


Asn 
370 


Thr 


Val 


Lys 


Ser 


Asn 
375 


Asn 


Phe 


Ser 


Ala 


He 
380 


Thr 


Cys 


Ser 


Tyr 


Trp 


Asn 


Lys 


Tyr 


Met 


Asp 


Cys 


Arg 


Ser 


Tyr 


Gly 


Phe 


Glu 


He 


He 


Asp 


385 










390 










395 










400 


Thr 


Pro 


Glu 


Asn 


Asn 
405 


Cys 


Gly 


Phe 


Arg 


He 
410 


Arg 


Ala 


Ala 


He 


Asp 
415 


Cys 


Ser 


Asn 


Thr 


Asp 
420 


Phe 


His 


Ser 


Pro 


Val 

425 


Thr 


Arg 


Val 


Asn 


Lys 
430 


Lys 


Lys 


Thr 


Ser 


He 
435 


He 


Asn 


Ala 


Val 


Lys 
440 


Asn 


Pro 


Phe 


Phe 


He 
445 


Arg 


His 


Thr 


Glu 


Pro 
450 


Lys 


Trp 


Tyr 


Asn 


Lys 
455 


Asn 


Ala 


Met 


Cys 


Gly 
460 


Glu 


Val 


Leu 


Glu 


Asn 


Val 


Gly 


Val 


Thr 


Leu 


Glu 


Gin 


His 


Val 


Arg 


Val 


Ser 


Asp 


Glu 


Tyr 


4 65 










470 










475 










480 


Met 


Asp 


Arg 


Phe 


Gly 
485 


Ser 


Leu 


Leu 


Leu 


Gly 
490 


Arg 


Glu 


Lys 


Lys 


Trp 
495 


Thr 


Cys 


Asn 


Tyr 


Leu 
500 


Asp 


Arg 


He 


Lys 


Ser 
505 


Leu 


Glu 


Thr 


He 


Ser 
510 


Asn 


Asn 


Leu 


Lys 


Gly 
515 


Lys 


He 


Asp 


Thr 


Met 
520 


Cys 


Lys 


He 


Thr 


Lys 
525 


Tyr 


Asn 


Tyr 


Lys 


Ser 
530 


Ser 


Ser 


Leu 


Tyr 


Tyr 
535 


Lys 


Gin 


He 


Thr 


Ala 
540 


Thr 


Ser 


Asp 


Asp 


Pro 


He 


Lys 


Met 


Lys 


He 


He 


Ala 


Ser 


He 


Asn 


Lys 


Arg 


Arg 


Tyr 


Leu 


545 










550 










555 










560 


Cys 


Asn 


He 


Phe 


Ala 
565 


He 


He 


Ser 


Ser 


Glu 
570 


Lys 


Lys 


Asp 


Glu 


Val 
575 


Glu 


Glu 


Asp 


His 


Thr 
580 


Lys 


Thr 


Gly 


Asn 


Gly 

585 


Gly 


Cys 


Ala 


Phe 


Ser 
590 


Lys 


Tyr 


Lys 


Lys 


Lys 
595 


Gin 


Leu 


Glu 


Pro 


Lys 
600 


Gin 


His 


Leu 


He 


Val 
605 


Lys 


Val 


Asn 


Lys 


Tyr 
610 


He 


Glu 


Ala 


Phe 


Ser 
615 


Leu 


He 


Lys 


Met 


Leu 
620 


Arg 


Asn 


Asp 


Cys 


Glu 


Arg 


Asn 


Lys 


Cys 


Arg 


Phe 


Lys 


Glu 


Ala 


Glu 


He 


Arg 


Glu 


Cys 


Ala 


625 










630 










635 










640 


Asn 


Glu 


Leu 


Val 


Arg 
645 


Glu 


Leu 


Tyr 


Arg 


Ala 
650 


Ser 


Ala 


Arg 


Ser 


Tyr 
655 


Val 


His 


Asp 


Leu 


Val 
660 


Leu 


Lys 


Arg 


Thr 


Asn 
665 


Val 


His 


Leu 


Thr 


Trp 

670 


Gin 


Arg 


Pro 


Tyr 


Asp 
675 


Glu 


Asn 


Ala 


Asn 


Thr 
680 


He 


Met 


Ser 


Leu 


He 
685 


Pro 


Lys 


Cys 


Lys 


Leu 
690 


His 


Thr 


Val 


Leu 


Tyr 
695 


Asp 


Lys 


Asp 


Ser 


Arg 
700 


Asp 


Val 


Lys 


Leu 


Leu 


Asn 


Phe 


Leu 


Arg 


Thr 


Arg 


Asp 


Gly 


Asn 


Tyr 


Asn 


Pro 


He 


Arg 


His 


705 










710 










715 










720 


Ser 


Met 


Leu 


Glu 


Leu 

725 


Val 


Tyr 


Gly 


Glu 


Glu 

730 


Tyr 


Ala 


Lys 


Asp 


Val 

735 


Ser 


Thr 


Val 


Thr 


Cys 
740 


Phe 


Glu 


Trp 


Leu 


Lys 
745 


Trp 


Cys 


Ser 


Lys 


Lys 
750 


Gly 


Val 


He 


Lys 


Tyr 
755 


Glu 


Asp 


Phe 


Leu 


Asp 
760 


Arg 


Tyr 


Glu 


Lys 


Thr 
765 


Gly 


Glu 


Glu 


Asp 


i-ys 

770 


Asp 


Glu 


Arg 


Glu 


Phe 
775 


Phe 


Arg 


Leu 


Lys 


Lys 
780 


Cys 


Ser 


Arg 


Asp 


His 


Thr 


Lys 


Asp 


He 


Lys 


Lys 


He 


Glu 


Asn 


Val 


Leu 


Asn 


Ser 


Asp 


Thr 



785 790 795 800 
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Lys 


Tyr 


Ser 


Leu 


Asp 
805 


Lys 


Asn 


Val 


Gin 


Thr 
810 


His 


Ala 


Ser 


Ser 


Ser 
815 


Thr 


Val 


Val 


Lys 


Asn 
820 


Asp 


Thr 


Asp 


Gly 


Lys 
825 


Thr 


Ser 


Met 


Val 


Gly 
830 


Trp 


Asp 


Tyr 


He 


Phe 
835 


Ser 


He 


Gly 


Lys 


Gly 
840 


Glu 


Lys 


Thr 


Thr 


Lys 
845 


Lys 


Arg 


Lys 


Leu 


Glu 
850 


Thr 


He 


Asp 


He 


Ser 
855 


Ser 


Ser 


Asp 


Asp 


Asp 
860 


Asp 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Asp 


Glu 


Gly 


Lys 


Arg 


Met 


Lys 


Met 


Asn 


Asn 


Cys 


Ser 


Ser 


865 










870 










875 










880 


Ser 


He 


Lys 


Asn 


Lys 
885 


Ser 


Lys 


Asn 


Lys 


Asn 
890 


Gly 


Arg 


Met 


Cys 


Cys 
895 


Thr 


Asp 


He 


Leu 


Asn 
900 


Val 


Val 


Glu 


Pro 


Ser 
905 


Leu 


Pro 


Asn 


Thr 


Leu 

910 


Ser 


Phe 


Asn 


Cys 


Val 
915 


Lys 


Ser 


Met 


Asp 


Val 
920 


Leu 


Asn 


Leu 


Leu 











<210> 286 
<211> 635 
<212> DNA 
<213> SHRIMP 



<400> 286 

catttcttcg 

actattaact 

gagtcaggga 

atagccatgg 

ccatatgaaa 

agaaatttta 

tttgtgttga 

ccaaatccca 

gtcagcacca 

ggtgaagaag 

gtaaaggcta 



tctacataaa 
ccacaaattc 
cgtttccatc 
atggagatta 
cctacaaaga 
aacctttaaa 
tcgaatcacc 
tttacaattc 
ctggtacaga 
gagttacaga 
gtttcaaatc 



gaagaataac 
tggaggagta 
tagggtacaa 
tgcgacatta 
cgtggaggat 
cggtgatgaa 
atattataat 
tactgaaaag 
tgaaaggttc 
aaataggtac 
agtcatgccc 



<210> 287 
<211> 431 
<212> DNA 
<213> SHRIMP 



tggttctctg 
ggaggattat 
tattccacat 
gtaaggaacg 
agccagttct 
aataaagacc 
ggagggtttt 
ccgtatatta 
ttctgcctcg 
tttttacgcc 
actat 



gggccattag 
gggataatgt 
caacgaaaaa 
gaatgtccac 
atttacattt 
accttgaaag 
tatcatacaa 
acacggagat 
agaaggaata 
acatggcaag 



taatgacact 
ttctgattca 
tttagtcaat 
aaatcaaagg 
tttccacgtt 
ggatgaaagt 
tatcaacaac 
aacttccatc 
cgtcgaagat 
taattatgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

635 



<400> 287 

ctgaagaata 

gactgacgcc 

gacttttgat 

gtgcctaaag 

gactggagat 

agacggtgac 

tttcttccct 

aaaaaaaaaa 



ttgaaagaag 
gatttcttgc 
actgttgagg 
cagggatcta 
tcaacgctgg 
gggagactaa 
atgtttaaaa 
a 



acttcttgaa 
tggtggggga 
cagtcaggaa 
cttttgtcct 
agattcaacc 
tatctttctt 
atttgtcttg 



gaggaccgat 
tgatactagt 
gagtga tcta 
caatggaggc 
catgattgtg 
agtttcccgt 
gttaaaaaaa 



aaaaaaatgg 
agatatgaag 
ga tgaccgtg 
atcgaagaat 
ccaacaacag 
cacggtgaaa 
taaaacgaaa 



ccaccttcca 
aagtgatgaa 
tttacatggt 
tgcgtctttt 
aataaaataa 
atgttggtta 
actgtcaaaa 



60 

120 

180 

240 

300 

360 

420 

431 



<210> 288 
<211> 1103 
<212> DNA 
<213> SHRIMP 



<400> 288 

ctattttagg gtactagaaa acgttgtcag aaccaatcat gaggtttatc ccgatgttga 60 

taattgcgct tatcgctgca tttgttattg cagctcttct tactgccaat tcaaattacc 120 

tggatcacaa tatcaataag gaactaaatt tgacccgttc ccttcaactt cggggaacat 180 

tcacaccaga agatatagct cataacaaca gaattctccc ttccaagctg agtgttttag 240 
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aaagagggtc 
ataattctca 
catcccaatc 
cctgtatgtg 
aatatattaa 
agtattaaga 
cttgcaagtg 
tccttttctt 
tcctacaaca 
tccagggaat 
tcttcttctt 
gtaccttcta 
gaaga tatta 
agattttcta 
attctatatt 



aatcatttta 
agagaggagg 
atcttcccaa 
acgagtctcg 
aggaggaaga 
tgatgaaacc 
cgatagggag 
ccatgacgtt 
ataaaaggat 
tgttatctga 
cttcttcaac 
attgtacttc 
accgtttaag 
aaattaatag 
ccaaaaaaaa 



gcagaaatgg 
aacatttcta 
gttgaacttt 
caccgacgag 
agttagtgga 
tggttcaaaa 
tagaaagaaa 
gaatgaagga 
aaaaaacaat 
aactacggtg 
gtcatcatcc 
taaaactgta 
gtataacata 
attaactaca 
aaa 



acaagtacaa 
ctcctcaaca 
aaattctata 
aaagaagatt 
ggagaaaagg 
atcaaaacaa 
gctgaagaat 
ggacgtctgg 
cgttctaggc 
agggaggaaa 
acttcttcgt 
tgcgattttg 
atacatatat 
tgtatacaag 



aaatgcacaa 
acaacaacag 
agaaaatgct 
ttaacaacga 
ttgtaaatat 
agtacaggct 
atattgaacg 
cttcagcgat 
ctgtgaagct 
tactacggaa 
tttcttcaat 
taaaacacat 
ctgaagaaaa 
gtataagtaa 



cctacagtaa 
accaccccct 
gatttttgtt 
tgaggaagaa 
cgtagtaggg 
ctactccctc 
tctatacaat 
ttattgtcca 
gatccatgcc 
atctcccgct 
ttttttgttt 
acagtatgaa 
ctatgcttca 
gacaataaag 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1103 



<210> 289 
<211> 234 
<212> DNA 
<213> SHRIMP 



<400> 289 

acaacctcca catctccttc taattctaac acacaagaaa aggataaaaa taaaataaat 60 
attgatgcta acaaaattta ttcaagaggt cacaaacatg taccaatatc aattgtaaaa 120 
agcacgttaa acaaagcgtt aaaaatagca ggtatgtctg ctatgagcaa agagcaagtg 180 
ttaacaatgt accaattaat aaaccttagg taggaaagag aaaaaaaaaa aaaa 234 

<210> 290 
<211> 597 
<212> DNA 
<213> SHRIMP 



<400> 290 

aaaa tactcc 

tcgtggagga 

gtcacatttg 

gctgcgtcct 

cagttaattg 

ttcatcctta 

caactccacc 

ggcaatcagt 

aagaagaaaa 

ttttttctca 



tcaacttcca 
agggcaagta 
ctatctagga 
cctgctggtg 
tgatgtgagc 
ccattcttct 
agatactaaa 
tactgtgaat 
agcaa tgggt 
atatggcaaa 



acaagaatgg 
aacgactacg 
agaggaggaa 
actgtcatga 
ccacagagct 
tccacaacca 
aaagttgacg 
aatggaggtg 
atcaatacag 
aaataaaata 



gctctgctaa 
ggtctaccaa 
gaattgcatg 
ctttgttgat 
actcgccgcc 
ccactacttc 
acgactatga 
ttttcatcaa 
ataatggagg 
tggtatatta 



cttgaacgac 
caagaaacaa 
tgtcatcgca 
tgtggtactg 
gccgcagccg 
cactactact 
tgacgacgtc 
tggaagaaaa 
atttgtt tgg 
aaaaaaaaaa 



cgaactctac 
ccagtgtgct 
tctgttctgg 
ggaactgcac 
ccggtgcagt 
actactacac 
aatattggag 
ctctcaaaag 
aagaatggtg 
aaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

597 



<210> 291 

<211> 335 

<212> DNA 

<213> SHRIMP 



<400> 291 

cgagaacaaa 

caagaccaaa 

gcggaggagc 

ttcctcaaag 

acaacaacta 

tccattgaaa 



agattgttgt ctaggaacaa agaaacatta aaaatggttg cccgaagctc 60 
tcccgccgtg gaagcaagaa gaggtccacc actgctggac gcatctccaa 120 
ccatcaatga agaagcgtgc aggaaagaag agctccactg tccgtcgccg 180 
agcggaaaga agtctggagc ccgcaagtca aggcgttaat tcttccctgt 240 
tgttatttaa ttgatttttt ttcttctgaa taattggaaa taataaaaca 300 
cttaaaaaaa aaaaaaaaaa aaaaa 335 



<210> 292 
<211> 225 
<212> DNA 
<213> SHRIMP 
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<400> 292 

ggagctctgt ca tttctggt 
acttgacatc agaccgaccc 
ttgaaaagtt tttgagatgg 
accaataaat ttagtggtaa 

<210> 293 
<211> 107 
<212> DNA 
<213> SHRIMP 



gcaatctgac attggctcga cccagcggtc caccctccaa 60 

agcggtccac cctctaaact cgagcgaccc agaaaaattt 120 

aggaagagta aaattcccta gtgtaaacag taccaagaag 180 

taaaactaca catatgatta aaaaa 225 



<400> 293 

tcctcctcct tcttctcctc 
atgtacactc tgttaatttt 



ctgcacataa aaaatcacgt cttccggatg aaggcgaaaa 60 
ttcaaacaat aaactaacca ccttgta 107 



